PSIBLAST 2.11.0+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Stephen F.
Altschul, John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005)
"Protein database searches using compositionally adjusted
substitution matrices", FEBS J. 272:5101-5109.


Reference for composition-based statistics starting in round 2:
Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei
Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and
Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST
protein database searches with composition-based statistics and
other refinements", Nucleic Acids Res. 29:2994-3005.



Database: nr30
           33,704,358 sequences; 7,716,583,296 total letters

Results from round 5


Query= Malomonas_qIce2__nr30

Length=630
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

WP_072908504.1 hypothetical protein [Malonomonas rubra]SHJ30007.1...  299     4e-90
PLY07819.1 hypothetical protein C0624_03270 [Desulfuromonas sp.]      154     1e-36
RME32505.1 tetratricopeptide repeat protein, partial [Gammaproteo...  147     2e-34
PYM54254.1 hypothetical protein DMD77_23985, partial [Candidatus ...  135     4e-31
NNJ91639.1 hypothetical protein [Gammaproteobacteria bacterium]       136     2e-30
WP_049721524.1 hypothetical protein [Gilvimarinus polysaccharolyt...  135     4e-30
PYN97354.1 hypothetical protein DMD91_18295 [Candidatus Rokubacte...  134     5e-30
PYN07539.1 hypothetical protein DME02_11495, partial [Candidatus ...  131     3e-29
MBE0616275.1 RDD family protein [Burkholderiales bacterium]           128     3e-28
WP_153498920.1 hypothetical protein [Alcanivorax sp. PA15-N-34]MQ...  127     9e-28
WP_094984345.1 hypothetical protein [Cellvibrio mixtus]OZY86742.1...  127     2e-27
PCJ44889.1 hypothetical protein COA99_06475, partial [Moraxellace...  124     9e-27
WP_198262883.1 hypothetical protein [sulfur-oxidizing endosymbion...  124     2e-26
RME67327.1 hypothetical protein D6778_03425 [Nitrospirae bacterium]   115     3e-26
PLX80586.1 hypothetical protein C0615_00685 [Desulfuromonas sp.]      121     5e-26
TNE28966.1 hypothetical protein EP349_07150 [Alphaproteobacteria ...  119     3e-25
NOY85388.1 hypothetical protein [Nitrospirae bacterium]               112     7e-25
WP_113955051.1 hypothetical protein [Arenicella xantha]RBP49185.1...  116     3e-24
WP_041551596.1 hypothetical protein [Cellvibrio japonicus]ACE8585...  116     4e-24
PPR15797.1 hypothetical protein CFH43_00866, partial [Proteobacte...  112     7e-24
PIR46206.1 hypothetical protein COV07_04540 [Candidatus Vogelbact...  112     3e-23
PLX78103.1 hypothetical protein C0616_15510 [Desulfuromonas sp.]      111     6e-23
WP_144392134.1 hypothetical protein [Pleionea sediminis]              112     7e-23
MAF31333.1 hypothetical protein [Magnetococcales bacterium]           111     8e-23
EKD24616.1 hypothetical protein ACD_80C00181G0002 [uncultured bac...  109     2e-22
NIS09268.1 hypothetical protein [Candidatus Dadabacteria bacteriu...  111     2e-22
NRB77098.1 hypothetical protein [Saccharospirillaceae bacterium]      109     5e-22
WP_008042065.1 hypothetical protein [Reinekea blandensis]EAR10730...  109     7e-22
MBI5694516.1 zinc-ribbon domain-containing protein [Nitrospirae b...  106     6e-21
WP_146504520.1 hypothetical protein [Rubinisphaera italica]TWT626...  105     1e-20
MBE9582320.1 HEAT repeat domain-containing protein [Proteobacteri...  104     2e-20
MBE9544953.1 zinc-ribbon domain-containing protein [Proteobacteri...  104     2e-20
PYM69976.1 hypothetical protein DME10_21885 [Candidatus Rokubacte...  103     2e-20
NIM06119.1 hypothetical protein [Armatimonadetes bacterium]NIO976...  102     8e-20
NNK01369.1 hypothetical protein [Desulfatitalea sp.]                  98.7    1e-19
MBI5624999.1 hypothetical protein [Elusimicrobia bacterium]           100     3e-19
MBI5641685.1 hypothetical protein [Nitrospirae bacterium]             99.8    4e-19
WP_150074101.1 hypothetical protein [Rhodopirellula sp. JC645]KAA...  99.1    8e-19
WP_198264656.1 hypothetical protein [sulfur-oxidizing endosymbion...  95.6    8e-19
CAB1070897.1 hypothetical protein D1AOALGA4SA_1060 [Olavius algar...  99.5    1e-18
MBC8870065.1 hypothetical protein [Planctomycetes bacterium]          95.6    2e-18
HAD58437.1 TPA: hypothetical protein [Planctomycetaceae bacterium]    95.2    2e-17
MBI2670271.1 hypothetical protein [Candidatus Yanofskybacteria ba...  94.1    4e-17
MBC8871457.1 hypothetical protein [Planctomycetes bacterium]          92.1    2e-16
NIY14984.1 hypothetical protein [Nitrospinaceae bacterium]            89.4    2e-16
WP_167192080.1 hypothetical protein [Aestuariicella hydrocarbonica]   91.0    2e-16
MBI3648016.1 hypothetical protein [Actinobacteria bacterium]          87.5    3e-16
NJD55990.1 hypothetical protein [Nitrospirae bacterium]               91.0    3e-16
MBI5187560.1 pilus assembly protein PilP [Nitrospirae bacterium]      90.2    3e-16
RLC34648.1 hypothetical protein DRZ76_02340 [Candidatus Nealsonba...  88.3    3e-16
MBI2062359.1 hypothetical protein [Candidatus Yanofskybacteria ba...  90.2    3e-16
KKT50197.1 hypothetical protein UW40_C0008G0005 [Parcubacteria gr...  88.3    4e-16
MBI2677778.1 hypothetical protein [Candidatus Koribacter versatilis]  86.7    5e-16
MAG94652.1 hypothetical protein [Planctomycetaceae bacterium]         87.5    5e-16
OGD63017.1 hypothetical protein A2160_05215 [Candidatus Beckwithb...  88.3    6e-16
PYJ43452.1 hypothetical protein DME80_08840 [Verrucomicrobia bact...  85.2    1e-15
MBI4273046.1 hypothetical protein [Candidatus Uhrbacteria bacterium]  89.4    1e-15
WP_054977454.1 hypothetical protein [Paenibacillus sp. A3]            87.9    2e-15
NNK00149.1 hypothetical protein [Desulfatitalea sp.]                  89.1    2e-15
KKT47946.1 hypothetical protein UW39_C0004G0005 [Parcubacteria gr...  84.4    3e-15
OGZ50532.1 hypothetical protein A3C83_02435 [Candidatus Ryanbacte...  87.9    3e-15
KKR18156.1 hypothetical protein UT47_C0003G0168 [candidate divisi...  83.7    3e-15
RMG35661.1 hypothetical protein D6725_12135 [Planctomycetes bacte...  84.8    5e-15
PCJ63660.1 hypothetical protein COA73_05130 [Candidatus Hydrogene...  87.1    5e-15
OGZ79235.1 hypothetical protein A2358_03170 [Candidatus Staskawic...  85.2    8e-15
HBD04937.1 TPA: hypothetical protein [Candidatus Uhrbacteria bact...  83.7    1e-14
KKU46575.1 hypothetical protein UX65_C0002G0049 [Parcubacteria gr...  84.8    1e-14
MBD8978789.1 DUF975 family protein [Clostridiales bacterium]          85.2    2e-14
NIQ93538.1 hypothetical protein [Desulfuromonadales bacterium]NIR...  83.3    2e-14
MBC8087786.1 hypothetical protein [Phycisphaerae bacterium]           80.6    2e-14
MBI3653834.1 hypothetical protein [Acidobacteria bacterium]           84.1    2e-14
PIS41547.1 hypothetical protein COT25_02505 [Candidatus Kerfeldba...  84.4    2e-14
NNL75238.1 hypothetical protein [Desulfobacterales bacterium]         83.7    2e-14
ANZ76760.1 BA75_03591T0 [Komagataella pastoris]                       84.4    2e-14
WP_111889993.1 DUF975 family protein [Aerococcus urinae]RAV93935....  84.8    3e-14
PLX75151.1 hypothetical protein C0614_11220 [Desulfuromonas sp.]      84.1    5e-14
PYM54499.1 hypothetical protein DMD79_24525, partial [Candidatus ...  76.7    6e-14
OGY17246.1 hypothetical protein A2784_02080 [Candidatus Chisholmb...  80.2    6e-14
PIZ00505.1 hypothetical protein COY62_02255 [bacterium (Candidatu...  81.4    8e-14
KKW28605.1 hypothetical protein UY73_C0041G0005 [Parcubacteria gr...  82.9    8e-14
NLV54496.1 hypothetical protein [Acidimicrobiales bacterium]          81.7    8e-14
MBI3120608.1 hypothetical protein [Candidatus Kerfeldbacteria bac...  79.0    1e-13
QFU93549.1 hypothetical protein YIM_42065 [Amycolatopsis sp. YIM 10]  82.1    1e-13
OGS26675.1 hypothetical protein A2297_04140 [Elusimicrobia bacter...  82.5    1e-13
HDN01387.1 TPA: hypothetical protein [Candidatus Bathyarchaeota a...  79.8    2e-13
SFB51261.1 hypothetical protein SAMN05216266_11559 [Amycolatopsis...  79.4    2e-13
NLM70207.1 hypothetical protein [Firmicutes bacterium]                79.4    3e-13
WP_166080418.1 glycerophosphoryl diester phosphodiesterase membra...  81.4    3e-13
PIW91097.1 hypothetical protein COZ91_02280, partial [Candidatus ...  76.3    3e-13
WP_152053729.1 hypothetical protein [Aquisphaera sp. JC650]           81.4    4e-13
MBI4063975.1 hypothetical protein [Elusimicrobia bacterium]           77.9    4e-13
PWL61158.1 hypothetical protein DBY37_07050 [Desulfovibrionaceae ...  79.0    5e-13
TDJ55075.1 hypothetical protein E2O47_04950 [Gemmatimonadetes bac...  76.0    6e-13
PIT91442.1 hypothetical protein COU17_00420 [Candidatus Kaiserbac...  77.9    6e-13
WP_114370738.1 hypothetical protein [Blastopirellula cremea]RCS43...  80.2    8e-13
MBI4087172.1 hypothetical protein [Candidatus Kaiserbacteria bact...  76.3    1e-12
WP_158039999.1 hypothetical protein [Pseudoclavibacter chungangen...  78.7    2e-12
PJA45116.1 hypothetical protein CO174_04745 [Candidatus Uhrbacter...  75.6    2e-12
WP_171809514.1 hypothetical protein [Corallococcus exiguus]NPD286...  79.0    3e-12
MBF6612405.1 glycerophosphoryl diester phosphodiesterase membrane...  78.3    3e-12
MBI2903216.1 hypothetical protein [Candidatus Methylomirabilis ox...  75.6    3e-12
HGE14881.1 TPA: hypothetical protein [Candidatus Parcubacteria ba...  78.7    3e-12
WP_068790608.1 hypothetical protein [Phormidium willei]OAB55799.1...  75.2    3e-12
MBE6446274.1 hypothetical protein [Alphaproteobacteria bacterium]     78.3    4e-12
MBE0521809.1 DUF975 family protein [Candidatus Methanoperedenacea...  77.9    4e-12
MBI2166473.1 zinc ribbon domain-containing protein [Chloroflexi b...  75.2    4e-12
TMM16922.1 hypothetical protein E6F98_00490 [Actinobacteria bacte...  74.8    4e-12
NOZ86130.1 hypothetical protein [Deltaproteobacteria bacterium]       76.0    4e-12
NIO19370.1 hypothetical protein [Candidatus Aenigmarchaeota archa...  75.6    4e-12
WP_130023871.1 hypothetical protein [Emticicia sp. 17J42-9]RYU929...  77.5    5e-12
MBI5106072.1 glycerophosphoryl diester phosphodiesterase membrane...  74.8    7e-12
MBF0618578.1 hypothetical protein [Candidatus Omnitrophica bacter...  74.0    7e-12
MBJ7342504.1 hypothetical protein [Solirubrobacteraceae bacterium]    71.3    7e-12
HIF31776.1 TPA: hypothetical protein [Planctomycetaceae bacterium]    76.3    8e-12
WP_166233540.1 glycerophosphoryl diester phosphodiesterase membra...  75.2    1e-11
NCO75413.1 hypothetical proteinNCO78273.1 hypothetical proteinNCQ...  72.9    1e-11
EQB62434.1 hypothetical protein RBG1_1C00001G0013 [candidate divi...  72.5    2e-11
WP_156892035.1 glycerophosphoryl diester phosphodiesterase membra...  76.0    2e-11
OGZ78234.1 hypothetical protein A2358_04180 [Candidatus Staskawic...  72.1    2e-11
WP_161390382.1 glycerophosphoryl diester phosphodiesterase membra...  71.7    2e-11
OGK18868.1 hypothetical protein A2799_02280 [Candidatus Roizmanba...  74.8    2e-11
MBI4415174.1 hypothetical protein [Candidatus Kerfeldbacteria bac...  74.0    2e-11
NHV97722.1 hypothetical protein [Thaumarchaeota archaeon]             72.1    3e-11
PTL75252.1 hypothetical protein DAT35_55950 [Vitiosangium sp. GDM...  74.4    3e-11
KXA89225.1 hypothetical protein AKJ57_05660 [candidate division M...  69.4    3e-11
NQW20074.1 hypothetical protein [Chloroflexi bacterium]               71.0    3e-11
OGY30342.1 hypothetical protein A3F35_00750 [Candidatus Woykebact...  72.5    3e-11
MXX63576.1 hypothetical protein [Acidimicrobiia bacterium]MYD0359...  71.7    4e-11
MAF79665.1 hypothetical protein [bacterium]                           74.4    4e-11
OQB13217.1 hypothetical protein BWY16_00266 [Candidatus Omnitroph...  71.0    5e-11
RTK93754.1 hypothetical protein EKI60_05420 [Candidatus Saccharib...  71.0    6e-11
MBM98530.1 hypothetical protein [Planctomycetaceae bacterium]         72.9    6e-11
WP_169451565.1 glycerophosphoryl diester phosphodiesterase membra...  70.2    6e-11
HGV67288.1 TPA: hypothetical protein [Candidatus Moranbacteria ba...  71.7    7e-11
PIS40798.1 hypothetical protein COT26_01430 [Candidatus Kerfeldba...  71.0    7e-11
NDV08681.1 PQQ-binding-like beta-propeller repeat protein [Rhodoc...  74.4    7e-11
ERH20767.1 hypothetical protein HMPREF1978_00028 [Actinomyces gra...  70.2    8e-11
CAB1321354.1 unnamed protein product [Coregonus sp. 'balchen']        74.4    8e-11
NLI34789.1 DUF975 family protein [Deltaproteobacteria bacterium]      72.1    8e-11
HBF67379.1 TPA: hypothetical protein [Candidatus Magasanikbacteri...  72.5    9e-11
PWM08302.1 hypothetical protein DBX98_00680 [Clostridiales bacter...  71.0    1e-10
MBA3723719.1 hypothetical protein [Candidatus Levybacteria bacter...  72.1    1e-10
PIS21728.1 hypothetical protein COT51_01250 [candidate division W...  72.1    1e-10
EFD92724.1 hypothetical protein BJBARM5_0577 [Candidatus Parvarch...  70.2    1e-10
MBI4281800.1 hypothetical protein [Candidatus Uhrbacteria bacterium]  71.0    1e-10
KKR53597.1 hypothetical protein UT90_C0006G0009 [Parcubacteria gr...  70.6    1e-10
HFH10638.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  72.5    1e-10
HGX05283.1 TPA: hypothetical protein [Gemmataceae bacterium]          71.7    1e-10
WP_115032115.1 glycerophosphoryl diester phosphodiesterase membra...  72.1    1e-10
TAN40314.1 hypothetical protein EPN25_08115 [Nitrospirae bacterium]   69.4    1e-10
OVE80851.1 hypothetical protein BVY04_04755 [bacterium M21]           71.0    2e-10
WP_013269169.1 hypothetical protein [Brevundimonas subvibrioides]...  71.3    2e-10
MBI2506872.1 hypothetical protein [Candidatus Colwellbacteria bac...  70.2    2e-10
PYQ65644.1 hypothetical protein DMF53_05085 [Acidobacteria bacter...  71.0    2e-10
MBA3550783.1 glycerophosphoryl diester phosphodiesterase membrane...  71.3    2e-10
HEV45185.1 TPA: hypothetical protein [Caulobacterales bacterium]      72.5    2e-10
MBA3422132.1 hypothetical protein [Thermoleophilaceae bacterium]      70.6    2e-10
PIZ48299.1 hypothetical protein COY32_00210 [candidate division W...  72.1    2e-10
RKY68944.1 hypothetical protein DRP97_05810 [Candidatus Latesciba...  69.8    2e-10
WP_050702637.1 EI24 domain-containing protein [Dysgonomonas sp. B...  70.6    2e-10
MBI5175733.1 hypothetical protein [Candidatus Melainabacteria bac...  71.3    2e-10
OQX00469.1 hypothetical protein BWK69_01345 [Candidatus Parcubact...  69.4    2e-10
AGW14098.1 hypothetical protein DGI_2345 [Desulfovibrio gigas DSM...  69.8    2e-10
WP_183343261.1 hypothetical protein [Conexibacter arvalis]MBB4663...  69.0    2e-10
CCY66643.1 putative uncharacterized protein [Clostridium sp. CAG:...  71.3    2e-10
MBI2910977.1 hypothetical protein [Chloroflexi bacterium]             72.1    2e-10
MBI4252880.1 hypothetical protein [Candidatus Uhrbacteria bacterium]  71.0    2e-10
EKE14382.1 hypothetical protein ACD_12C00540G0002 [uncultured bac...  69.4    3e-10
NQU78030.1 hypothetical protein [Candidatus Falkowbacteria bacter...  71.3    3e-10
RJQ14347.1 hypothetical protein C4553_01375 [Candidatus Parcubact...  70.6    3e-10
OGC78213.1 hypothetical protein A2619_02620 [candidate division W...  70.2    3e-10
OLD18896.1 hypothetical protein AUI91_09650 [Acidobacteria bacter...  70.2    3e-10
RDZ91371.1 hypothetical protein DEQ92_22740, partial [Haloferax s...  67.1    3e-10
MBC7853613.1 hypothetical protein [Pirellulaceae bacterium]           69.4    3e-10
MBA3787561.1 hypothetical protein [Actinobacteria bacterium]          68.6    3e-10
KKW00375.1 hypothetical protein UY34_C0030G0007 [Parcubacteria gr...  71.3    4e-10
WP_191349191.1 hypothetical protein [Candidatus Neoanaerotignum g...  69.4    4e-10
MBI5092810.1 hypothetical protein [Candidatus Hydrogenedentes bac...  69.4    4e-10
RJR30374.1 hypothetical protein C4564_00115 [Candidatus Microgeno...  71.0    4e-10
KKU10417.1 hypothetical protein UX13_C0012G0001, partial [Candida...  67.9    4e-10
NIM49202.1 hypothetical protein [Gemmatimonadales bacterium]NIN10...  69.0    4e-10
PIE49413.1 hypothetical protein CSA39_02750 [Flavobacteriales bac...  69.4    4e-10
MBI4252560.1 hypothetical protein [Candidatus Uhrbacteria bacterium]  69.0    4e-10
AHB41123.1 Integral membrane protein [candidate division SR1 bact...  69.0    5e-10
TLX97152.1 zinc ribbon domain-containing protein [Thaumarchaeota ...  66.7    5e-10
QNN23843.1 hypothetical protein HED60_16730 [Planctomycetales bac...  69.4    5e-10
KKS32243.1 hypothetical protein UU93_C0008G0005 [Candidatus Amesb...  68.6    5e-10
OGF21061.1 hypothetical protein A2257_01450 [Candidatus Falkowbac...  68.6    5e-10
MYH55686.1 hypothetical protein [Acidimicrobiia bacterium]            67.5    5e-10
HHY99918.1 TPA: hypothetical protein [Tissierellia bacterium]         71.0    5e-10
MPZ48671.1 hypothetical protein [Dehalococcoidia bacterium]           68.6    5e-10
MBI4010427.1 glycerophosphoryl diester phosphodiesterase membrane...  68.3    6e-10
MBC8521355.1 glycerophosphoryl diester phosphodiesterase membrane...  66.7    6e-10
WP_119319608.1 hypothetical protein [Capsulimonas corticalis]GCE5...  69.8    6e-10
PIS20176.1 hypothetical protein COT53_01965 [Zetaproteobacteria b...  71.0    6e-10
NCB78264.1 hypothetical protein [Negativicutes bacterium]             67.9    8e-10
PSQ32180.1 hypothetical protein BRD09_04410 [Halobacteriales arch...  67.9    1e-09
MBI3790869.1 hypothetical protein [Gemmatimonadetes bacterium]        67.9    1e-09
MBO10016.1 hypothetical protein [Planctomycetaceae bacterium]         69.0    1e-09
ELY56510.1 hypothetical protein C491_13262 [Natronococcus amyloly...  66.3    1e-09
WP_199556060.1 hypothetical protein [Sandaracinobacter sp. SZY PN-1]  67.5    1e-09
NIP23869.1 hypothetical protein [Phycisphaerae bacterium]NIX27684...  66.3    1e-09
KKR21395.1 hypothetical protein UT48_C0009G0027, partial [Parcuba...  64.8    1e-09
MAQ77366.1 hypothetical protein [Candidatus Campbellbacteria bact...  67.5    1e-09
WP_051192030.1 glycerophosphoryl diester phosphodiesterase membra...  69.8    1e-09
NPA67212.1 hypothetical protein [Chlorobi bacterium]                  67.5    2e-09
OVE77039.1 hypothetical protein BVX98_03860 [bacterium F11]           69.0    2e-09
MXZ07158.1 hypothetical protein [Acidimicrobiia bacterium]MYD0493...  67.1    2e-09
MBD0371881.1 hypothetical protein [Pyrinomonadaceae bacterium]        67.9    2e-09
WP_175193162.1 hypothetical protein [Achromobacter deleyi]CAB3719...  69.0    2e-09
WP_071163930.1 glycerophosphoryl diester phosphodiesterase membra...  68.6    2e-09
MBC7807260.1 hypothetical protein [Akkermansiaceae bacterium]         67.1    2e-09
MBI2084736.1 hypothetical protein [Candidatus Aenigmarchaeota arc...  65.9    2e-09
RYF91076.1 hypothetical protein EON95_15685 [Caulobacteraceae bac...  66.7    2e-09
MAG61737.1 cysteine--tRNA ligase [Candidatus Pacearchaeota archaeon]  69.0    2e-09
OHE89611.1 hypothetical protein A3G75_05400 [Verrucomicrobia bact...  69.0    3e-09
GBD34248.1 hypothetical protein HRbin34_00577 [bacterium HR34]        68.6    3e-09
PWL88827.1 hypothetical protein DBY14_02330 [Escherichia coli]        68.6    3e-09
OGI62560.1 hypothetical protein A2818_02395 [Candidatus Nomurabac...  67.5    3e-09
MBF0521684.1 hypothetical protein [Candidatus Omnitrophica bacter...  66.3    3e-09
KKW30364.1 hypothetical protein UY74_C0043G0008 [Candidatus Kaise...  67.5    3e-09
MBI1270537.1 hypothetical protein [bacterium]                         65.9    3e-09
MBI5170268.1 hypothetical protein [Candidatus Eisenbacteria bacte...  66.3    3e-09
HBP55355.1 TPA: hypothetical protein [Verrucomicrobiales bacterium]   65.2    3e-09
KKR48633.1 hypothetical protein UT86_C0004G0119 [Candidatus Magas...  67.5    3e-09
NOZ68443.1 DUF975 family protein [Deferribacteres bacterium]          66.7    4e-09
WP_133516618.1 hypothetical protein [Methanimicrococcus blatticol...  67.5    4e-09
WP_113962173.1 hypothetical protein [Roseimicrobium gellanilyticu...  66.7    4e-09
OGG86161.1 hypothetical protein A2392_01720 [Candidatus Kaiserbac...  65.6    4e-09
MBI5585589.1 DUF975 family protein [Deltaproteobacteria bacterium]    66.3    4e-09
NLK36502.1 hypothetical protein [Epulopiscium sp.]                    66.3    4e-09
OHB25146.1 hypothetical protein A2X84_09095 [Desulfuromonadaceae ...  66.7    4e-09
MSP93144.1 hypothetical protein [Myxococcales bacterium]              67.1    4e-09
RMF97190.1 DUF975 family protein [Candidatus Schekmanbacteria bac...  67.5    5e-09
MXY43355.1 hypothetical protein [Dehalococcoidia bacterium]MYD517...  66.3    5e-09
WP_145056105.1 hypothetical protein [Lignipirellula cremea]QDU973...  67.1    5e-09
MBI2036982.1 hypothetical protein [Candidatus Liptonbacteria bact...  65.6    5e-09
NIA05393.1 hypothetical protein [Proteobacteria bacterium]            65.2    5e-09
MAG82522.1 hypothetical protein [Candidatus Poribacteria bacterium]   66.7    5e-09
WP_145201323.1 hypothetical protein [Thalassoglobus polymorphus]Q...  66.7    6e-09
MBI4122325.1 hypothetical protein [Parcubacteria group bacterium]     65.9    6e-09
MBF0930940.1 glycerophosphoryl diester phosphodiesterase membrane...  67.5    6e-09
WP_095981871.1 hypothetical protein [Melittangium boletus]ATB3390...  65.9    6e-09
MYF31094.1 hypothetical protein [Gammaproteobacteria bacterium]MY...  65.2    6e-09
MBR62226.1 hypothetical protein [Dehalococcoidia bacterium]           65.6    7e-09
NTU91881.1 hypothetical protein [Chlorobiaceae bacterium]             65.2    7e-09
WP_098796000.1 hypothetical protein [Bacillus sp. AFS040349]PGT87...  67.5    7e-09
HGV36320.1 TPA: hypothetical protein [Spirochaetes bacterium]         64.0    7e-09
CAA9462944.1 hypothetical protein AVDCRST_MAG38-339 [uncultured S...  65.9    8e-09
WP_077913226.1 DUF975 family protein [Listeria floridensis]EUJ282...  67.1    8e-09
MBI4982381.1 hypothetical protein [Candidatus Omnitrophica bacter...  65.2    8e-09
KKS47526.1 hypothetical protein UV09_C0004G0015 [Candidatus Gotte...  65.9    8e-09
WP_128545667.1 hypothetical protein [Larkinella soli]                 65.6    8e-09
OHB18443.1 hypothetical protein A2666_05150 [Parcubacteria group ...  66.7    9e-09
OGG70203.1 hypothetical protein A2929_03920 [Candidatus Kaiserbac...  65.6    9e-09
TSC61666.1 hypothetical protein G01um1014106_720 [Parcubacteria g...  65.6    1e-08
MBI1317494.1 hypothetical protein [Candidatus Hydrogenedens sp.]      64.8    1e-08
WP_091743029.1 hypothetical protein [Marininema mesophilum]SDX574...  66.3    1e-08
NLO94611.1 hypothetical protein [Firmicutes bacterium]                65.6    1e-08
WP_133164692.1 hypothetical protein [Candidatus Sulfotelmatomonas...  65.9    1e-08
KKW24533.1 hypothetical protein UY66_C0001G0034 [Parcubacteria gr...  65.2    1e-08
MBV70577.1 hypothetical protein [Myxococcales bacterium]              66.3    1e-08
NUN53930.1 hypothetical protein [Planctomycetaceae bacterium]         65.9    1e-08
NQU99098.1 hypothetical protein [Parcubacteria group bacterium]       64.4    1e-08
TAN57580.1 hypothetical protein EPN15_03890 [Patescibacteria grou...  66.3    1e-08
WP_071067494.1 glycerophosphoryl diester phosphodiesterase membra...  65.9    1e-08
HEC97179.1 TPA: hypothetical protein [Nitrospirae bacterium]          64.0    1e-08
MBI4145956.1 hypothetical protein [Candidatus Woesearchaeota arch...  64.8    1e-08
MBI3158699.1 hypothetical protein [Chloroflexi bacterium]             64.4    2e-08
NJN27555.1 hypothetical protein [Cyclobacteriaceae bacterium]         64.0    2e-08
OHA80890.1 hypothetical protein A2675_02245 [Candidatus Yonathbac...  65.2    2e-08
NCU42827.1 hypothetical protein [Candidatus Falkowbacteria bacter...  64.0    2e-08
WP_088618176.1 hypothetical protein [Methylovulum psychrotolerans...  64.0    2e-08
WP_080064285.1 hypothetical protein [Ruminiclostridium hungatei]O...  64.0    2e-08
MBI4039457.1 hypothetical protein [Candidatus Daviesbacteria bact...  64.0    2e-08
VTU00964.1 Uncharacterized protein OS=Haliangium ochraceum (strai...  64.4    2e-08
OHA00147.1 hypothetical protein A3C07_00290 [Candidatus Sungbacte...  64.8    2e-08
PIS23345.1 hypothetical protein COT49_00665 [candidate division W...  62.5    2e-08
RYX82517.1 hypothetical protein EON83_18975 [bacterium]               65.9    2e-08
WP_179512173.1 MULTISPECIES: hypothetical protein [unclassified S...  63.3    2e-08
WP_184452173.1 hypothetical protein [Schaalia hyovaginalis]MBB633...  65.2    2e-08
MBE3596173.1 glycerophosphoryl diester phosphodiesterase membrane...  62.9    2e-08
PKL92426.1 hypothetical protein CVV21_03705 [Candidatus Goldbacte...  64.4    3e-08
TML36175.1 hypothetical protein E6G29_06075 [Actinobacteria bacte...  64.8    3e-08
WP_110520753.1 hypothetical protein [Bacillus lacisalsi]PYZ96814....  62.5    3e-08
MAG37794.1 hypothetical protein [Candidatus Pacearchaeota archaeon]   63.6    3e-08
WP_072854441.1 glycerophosphodiester phosphodiesterase [Lactonifa...  65.6    3e-08
PZU50759.1 hypothetical protein DI568_03145 [Sphingomonas sp.]        63.3    3e-08
MBI4668874.1 hypothetical protein [Elusimicrobia bacterium]           60.6    3e-08
MBE6284254.1 hypothetical protein [Mediterranea massiliensis]         62.9    3e-08
TMD13252.1 hypothetical protein E6J07_08815 [Chloroflexi bacterium]   61.3    3e-08
HDI11102.1 TPA: hypothetical protein [Candidatus Acetothermia bac...  62.9    3e-08
MBA3531544.1 hypothetical protein [Ardenticatenales bacterium]        63.6    3e-08
MBI5170267.1 hypothetical protein [Candidatus Eisenbacteria bacte...  63.3    3e-08
MBE6030738.1 DUF975 family protein [Clostridiales bacterium]          64.4    3e-08
MBA3630351.1 hypothetical protein [Actinobacteria bacterium]          60.2    3e-08
RLF61093.1 hypothetical protein DRN25_01195 [Thermoplasmata archa...  63.6    4e-08
HHH55202.1 TPA: hypothetical protein [Bacteroidetes bacterium]        63.6    4e-08
WP_073559828.1 hypothetical protein [Archangium sp. Cb G35]OJT257...  62.9    4e-08
WP_132698640.1 hypothetical protein [Reinekea marinisedimentorum]...  62.5    4e-08
MBA2301696.1 hypothetical protein [Acidobacteria bacterium]           62.9    4e-08
WP_055953470.1 hypothetical protein [Curtobacterium sp. Leaf261]      64.8    4e-08
WP_060992333.1 hypothetical protein [Aliivibrio sifiae]PQJ84747.1...  62.5    4e-08
WP_114585485.1 glycerophosphoryl diester phosphodiesterase membra...  59.8    5e-08
WP_086784362.1 glycerophosphoryl diester phosphodiesterase membra...  63.6    5e-08
MBI2900576.1 protein kinase [Planctomycetes bacterium]                64.8    5e-08
HCR55653.1 TPA: hypothetical protein [Candidatus Saccharibacteria...  62.5    5e-08
HAE32562.1 TPA: hypothetical protein [Dehalococcoidia bacterium]      61.7    5e-08
NLL64538.1 hypothetical protein [Clostridiaceae bacterium]            64.0    5e-08
WP_053241145.1 hypothetical protein [Clostridium sp. DMHC 10]KOF5...  62.5    6e-08
NIO16973.1 hypothetical protein [Deltaproteobacteria bacterium]NI...  63.3    6e-08
MBI5135168.1 hypothetical protein [Candidatus Uhrbacteria bacterium]  63.3    6e-08
MAO63522.1 hypothetical protein [Balneola sp.]                        62.9    6e-08
WP_182141828.1 glycerophosphoryl diester phosphodiesterase membra...  61.7    6e-08
MBI4366411.1 hypothetical protein [Deltaproteobacteria bacterium]     63.3    6e-08
MBE6812653.1 DUF975 family protein [Ruminococcaceae bacterium]        62.5    6e-08
RKY65317.1 hypothetical protein DRQ08_06045 [Candidatus Latesciba...  62.1    6e-08
HGT34381.1 TPA: hypothetical protein [bacterium]                      64.0    6e-08
NOX30874.1 hypothetical protein [Actinobacteria bacterium]            62.9    6e-08
MBA3901750.1 hypothetical protein [Bacteroidetes bacterium]           62.9    7e-08
WP_187590111.1 glycerophosphoryl diester phosphodiesterase membra...  63.3    8e-08
HIG36735.1 TPA: hypothetical protein [Oceanospirillaceae bacterium]   60.9    8e-08
MBI4492985.1 hypothetical protein [Chloroflexi bacterium]             62.5    8e-08
MYC65673.1 hypothetical protein [Acidobacteriia bacterium]            61.3    8e-08
WP_016444535.1 glycerophosphoryl diester phosphodiesterase membra...  63.6    9e-08
GED96985.1 hypothetical protein nbrc107697_10240 [Gordonia crocea]    61.3    9e-08
MBA3264083.1 hypothetical protein [Thermoleophilaceae bacterium]      60.9    9e-08
WP_054970279.1 hypothetical protein [Alicyclobacillus ferrooxydan...  62.9    9e-08
WP_026942390.1 hypothetical protein [Hellea balneolensis]             62.5    9e-08
KKS94333.1 hypothetical protein UV70_C0002G0042 [Parcubacteria gr...  62.9    1e-07
PKL90739.1 hypothetical protein CVV21_11340 [Candidatus Goldbacte...  63.3    1e-07
RLD98598.1 hypothetical protein DRI91_02770, partial [Aquificae b...  61.3    1e-07
MBA2735288.1 protein kinase [Pyrinomonadaceae bacterium]              63.6    1e-07
WP_030890737.1 MULTISPECIES: hypothetical protein, partial [Strep...  63.6    1e-07
MBI2442806.1 DUF975 family protein [Candidatus Levybacteria bacte...  60.9    1e-07
RZM09416.1 hypothetical protein EOP67_70085, partial [Sphingomona...  60.6    1e-07
MBI5948430.1 hypothetical protein [Chloroflexi bacterium]             61.7    1e-07
TLZ73523.1 hypothetical protein E6K14_05275 [Euryarchaeota archaeon]  62.9    1e-07
MBJ25907.1 hypothetical protein [Flavobacteriaceae bacterium]         60.9    1e-07
OIO48197.1 hypothetical protein AUJ33_00165 [Parcubacteria group ...  62.5    1e-07
MBB71806.1 hypothetical protein [Legionellales bacterium]             61.7    1e-07
RZA05935.1 hypothetical protein EOP11_11585 [Proteobacteria bacte...  60.9    1e-07
PSQ42289.1 hypothetical protein BRD17_09020 [Halobacteriales arch...  61.3    1e-07
NCT55999.1 hypothetical protein [bacterium]                           61.7    1e-07
WP_193327074.1 hypothetical protein [Trueperella sp. 19M2397]QOQ3...  62.9    1e-07
ESS70677.1 hypothetical protein MGMO_120c00640 [Methyloglobulus m...  59.8    1e-07
OGZ24322.1 hypothetical protein A2896_01665 [Candidatus Nealsonba...  60.9    1e-07
ANM29506.1 hypothetical protein ABI59_07795 [Acidobacteria bacter...  64.0    2e-07
WP_187994411.1 glycerophosphoryl diester phosphodiesterase membra...  59.8    2e-07
MBI2568879.1 hypothetical protein [Candidatus Schekmanbacteria ba...  61.7    2e-07
WP_003393144.1 hypothetical protein [Brevibacillus borstelensis]E...  61.3    2e-07
MQG02979.1 hypothetical protein [SAR202 cluster bacterium]            61.3    2e-07
WP_038672995.1 hypothetical protein [Pelosinus sp. UFO1]AIF53084....  61.3    2e-07
MAF18131.1 hypothetical protein [Oceanospirillaceae bacterium]        60.9    2e-07
MBC19125.1 hypothetical protein [Planctomycetaceae bacterium]         62.5    2e-07
MYE53787.1 hypothetical protein [Chloroflexi bacterium]               61.7    2e-07
MBI4249967.1 hypothetical protein [Candidatus Uhrbacteria bacterium]  61.7    2e-07
HHM02727.1 TPA: hypothetical protein [Caldithrix abyssi]              61.7    2e-07
HGN33993.1 TPA: zinc ribbon domain-containing protein [Candidatus...  62.9    2e-07
MBC7405857.1 hypothetical protein [Candidatus Parcubacteria bacte...  61.3    2e-07
WP_092816523.1 hypothetical protein [Afifella marina]RAI17562.1 h...  60.2    2e-07
MBD3231235.1 hypothetical protein [Candidatus Dependentiae bacter...  60.6    2e-07
WP_095133100.1 glycerophosphoryl diester phosphodiesterase membra...  62.1    2e-07
MYA18471.1 hypothetical protein [Gammaproteobacteria bacterium]       60.9    2e-07
WP_167605441.1 hypothetical protein [Maribellus sp. Y2-1-60]          61.3    2e-07
WP_145122374.1 hypothetical protein [Rosistilla oblonga]              60.6    2e-07
MBI5354740.1 hypothetical protein [Chloroflexi bacterium]             61.3    2e-07
WP_091526934.1 hypothetical protein [Microlunatus soli]SDS97780.1...  62.1    2e-07
SEV91228.1 Uncharacterized membrane protein [[Clostridium] fimeta...  61.7    2e-07
NES04179.1 hypothetical protein [Okeania sp. SIO2F4]                  60.6    3e-07
KPV49372.1 hypothetical protein SE17_33025 [Kouleothrix aurantiaca]   60.9    3e-07
MBC8229532.1 hypothetical protein [bacterium]                         60.9    3e-07
KPQ20317.1 Protein of unknown function (DUF3426)/zinc-ribbon doma...  55.9    3e-07
PQM43996.1 hypothetical protein C1Y40_05845 [Mycobacterium talmon...  58.6    3e-07
CUP52935.1 Protein of uncharacterised function (DUF975) [Roseburi...  58.6    3e-07
MSR46955.1 hypothetical protein [Planctomycetes bacterium]            60.9    3e-07
MBC7328698.1 hypothetical protein [bacterium]                         62.1    3e-07
NBX42790.1 thioredoxin [Rhodobacteraceae bacterium]                   55.5    3e-07
HHK85603.1 TPA: hypothetical protein [Candidatus Buchananbacteria...  60.9    3e-07
KKR16226.1 hypothetical protein UT44_C0017G0018 [Candidatus Levyb...  59.4    3e-07
CRH61016.1 Membrane domain of glycerophosphoryl diester phosphodi...  61.7    3e-07
WP_193485678.1 hypothetical protein [Anaerotignum lactatifermenta...  60.6    3e-07
HDH90286.1 TPA: zinc ribbon domain-containing protein [Candidatus...  60.6    3e-07
MYC93824.1 hypothetical protein [Caldilineaceae bacterium SB0661_...  60.6    4e-07
MBI5794293.1 hypothetical protein [Candidatus Uhrbacteria bacterium]  60.2    4e-07
MBE9605143.1 hypothetical protein [Acetobacteraceae bacterium H6797]  60.2    4e-07
MBM65180.1 hypothetical protein [Myxococcales bacterium]              59.4    4e-07
PSP67089.1 hypothetical protein BRC85_07690, partial [Halobacteri...  57.5    4e-07
KAA0205949.1 hypothetical protein EDM68_03710 [Candidatus Uhrbact...  60.2    4e-07
MBE6287268.1 hypothetical protein [Mediterranea massiliensis]         60.2    4e-07
MBE6816950.1 DUF975 family protein [Ruminococcaceae bacterium]        61.3    4e-07
MBE6292719.1 tetratricopeptide repeat protein [Bacteroidales bact...  62.1    4e-07
TAK89232.1 hypothetical protein EPO04_04015 [Patescibacteria grou...  60.2    4e-07
MBJ7597377.1 glycerophosphoryl diester phosphodiesterase membrane...  61.3    4e-07
NUN52367.1 hypothetical protein [Planctomycetaceae bacterium]         60.2    4e-07
EFB62692.1 SPFH/Band 7/PHB domain protein [Lactobacillus gasseri ...  61.7    4e-07
WP_163843582.1 hypothetical protein [Nocardia cyriacigeorgica]NEW...  62.1    4e-07
PYK40849.1 hypothetical protein DME60_06390 [Verrucomicrobia bact...  57.5    4e-07
HGX27654.1 TPA: hypothetical protein [Candidatus Woesearchaeota a...  59.4    4e-07
NER49523.1 hypothetical protein [Symploca sp. SIO1A3]                 58.6    4e-07
MAZ38870.1 hypothetical protein [Legionellales bacterium]             60.2    4e-07
HEY94520.1 TPA: hypothetical protein [Dehalococcoidia bacterium]      60.2    5e-07
OGL66433.1 hypothetical protein A2856_01940 [Candidatus Uhrbacter...  59.8    5e-07
NJK90035.1 hypothetical protein [Myxococcales bacterium]              55.2    5e-07
PYQ05917.1 hypothetical protein DMF82_07395 [Acidobacteria bacter...  60.6    5e-07
WP_188939660.1 hypothetical protein [Nakamurella endophytica]GGL8...  60.2    5e-07
RME80321.1 hypothetical protein D6785_10490 [Planctomycetes bacte...  58.6    5e-07
MBI3305221.1 hypothetical protein [Candidatus Parcubacteria bacte...  60.6    6e-07
PSP25025.1 hypothetical protein BRC55_05155 [Cyanobacteria bacter...  58.2    6e-07
WP_185343483.1 DUF975 family protein [Listeria rocourtiae]MBC1436...  60.6    6e-07
WP_158539300.1 glycerophosphoryl diester phosphodiesterase membra...  58.2    6e-07
MBI5090966.1 glycerophosphoryl diester phosphodiesterase membrane...  59.8    6e-07
WP_014682444.1 hypothetical protein [Solitalea canadensis]AFD0922...  60.2    6e-07
MAZ30238.1 hypothetical protein [bacterium]                           60.2    6e-07
WP_165202354.1 hypothetical protein [Roseimicrobium sp. ORNL1]QIF...  59.4    6e-07
MBA3567853.1 hypothetical protein [Pyrinomonadaceae bacterium]        60.2    7e-07
HDS28937.1 TPA: hypothetical protein [Candidatus Acetothermia bac...  57.9    7e-07
OGG13937.1 hypothetical protein A3D77_03470 [Candidatus Gottesman...  58.6    7e-07
WP_150120661.1 zinc-ribbon domain-containing protein, partial [Su...  58.2    7e-07
HBS52247.1 TPA: hypothetical protein [Coxiellaceae bacterium]         58.2    7e-07
WP_144301311.1 hypothetical protein [Desulfovibrio indonesiensis]...  59.8    7e-07
HAN31510.1 TPA: hypothetical protein [Myxococcales bacterium]         57.9    7e-07
OQB68089.1 hypothetical protein BWX91_01341 [Spirochaetes bacteri...  59.8    8e-07
OGO58413.1 hypothetical protein A2V85_06000, partial [Chloroflexi...  59.8    8e-07
WP_081629009.1 PQQ-binding-like beta-propeller repeat protein [Sm...  61.3    8e-07
WP_193642973.1 glycerophosphoryl diester phosphodiesterase membra...  60.6    8e-07
HBF66094.1 TPA: hypothetical protein [Clostridium sp.]                58.6    8e-07
MBA2760881.1 hypothetical protein [Segetibacter sp.]                  59.0    8e-07
WP_013314060.1 hypothetical protein [Spirochaeta thermophila]ADN0...  60.2    8e-07
WP_115373442.1 stage II sporulation protein M [Adhaeribacter pall...  60.9    8e-07
WP_110520752.1 hypothetical protein [Bacillus lacisalsi]PYZ96813....  58.6    8e-07
MBD3366060.1 hypothetical protein [candidate division WWE3 bacter...  60.2    8e-07
WP_152891139.1 hypothetical protein [Clostridium tarantellae]MPQ4...  59.4    9e-07
HDH01909.1 TPA: hypothetical protein [Nitrospirae bacterium]          57.9    9e-07
NLE89106.1 FHA domain-containing protein [Myxococcales bacterium]     60.6    9e-07
HIP40010.1 TPA: hypothetical protein [Desulfocapsa sulfexigens]       60.6    9e-07
PYK18012.1 hypothetical protein DME55_08090 [Verrucomicrobia bact...  57.9    9e-07
WP_133753375.1 hypothetical protein [Naumannella halotolerans]TDT...  60.2    9e-07
OGY28135.1 hypothetical protein A2802_01820 [Candidatus Woykebact...  59.8    1e-06
WP_196492246.1 zinc-ribbon domain-containing protein, partial [Er...  53.6    1e-06
EGT3615446.1 hypothetical protein [Clostridium perfringens]           58.2    1e-06
KPK43062.1 hypothetical protein AMK72_13880 [Planctomycetes bacte...  60.2    1e-06
HCO84956.1 TPA: hypothetical protein [Arenibacter sp.]                55.9    1e-06
OFT38584.1 hypothetical protein HMPREF3163_05675 [Actinomyces sp....  59.4    1e-06
WP_144831788.1 MULTISPECIES: glycerophosphoryl diester phosphodie...  59.4    1e-06
HCA46319.1 TPA: hypothetical protein [Armatimonadetes bacterium]      59.4    1e-06
NMB70371.1 hypothetical protein [candidate division WWE3 bacterium]   57.1    1e-06
MAK60876.1 hypothetical protein [Ponticaulis sp.]                     59.0    1e-06
RME15377.1 hypothetical protein D6801_07530, partial [Alphaproteo...  54.4    1e-06
NLT64447.1 zinc-ribbon domain-containing protein [Clostridiales b...  59.4    1e-06
WP_196103494.1 hypothetical protein [Pontivivens sp. MT2928]QPH54...  58.2    1e-06
WP_185800092.1 hypothetical protein [Parasphingopyxis sp. GrpM-11...  59.0    1e-06
MXW30800.1 hypothetical protein [Chloroflexi bacterium]               57.9    1e-06
HIE43473.1 TPA: hypothetical protein [Candidatus Omnitrophica bac...  58.6    1e-06
PYU97057.1 hypothetical protein DMG25_00030 [Acidobacteria bacter...  60.2    1e-06
PIT95734.1 hypothetical protein COT94_04060 [Candidatus Falkowbac...  58.6    1e-06
MBI4101263.1 hypothetical protein [Candidatus Microgenomates bact...  59.0    1e-06
NLW13745.1 hypothetical protein [Trueperella sp.]                     59.4    1e-06
HGT40621.1 TPA: hypothetical protein [Schlesneria paludicola]         57.9    1e-06
OGL26321.1 hypothetical protein A2708_00885 [Candidatus Saccharib...  58.2    1e-06
NOY65511.1 hypothetical protein [Nitrospirae bacterium]               57.9    1e-06
WP_075603862.1 hypothetical protein [Saccharicrinis aurantiacus]      59.0    1e-06
OGL95714.1 hypothetical protein A2348_03745 [Candidatus Uhrbacter...  59.0    1e-06
MBI1728251.1 hypothetical protein [Candidatus Rokubacteria bacter...  58.2    1e-06
HEB57611.1 TPA: hypothetical protein [Gammaproteobacteria bacterium]  58.6    1e-06
WP_011745950.1 hypothetical protein [Chlorobium phaeobacteroides]...  57.9    1e-06
NDO19887.1 zinc ribbon domain-containing protein [Lachnospiraceae...  58.2    2e-06
WP_082152352.1 zinc-ribbon domain-containing protein [Candidatus ...  58.6    2e-06
HDH91445.1 TPA: hypothetical protein [Candidatus Aenigmarchaeota ...  57.5    2e-06
WP_163952864.1 hypothetical protein [Paenibacillus sp. SYP-B3998]     59.4    2e-06
HEA68366.1 TPA: hypothetical protein [Desulfobacterales bacterium]    58.2    2e-06
ERH25412.1 hypothetical protein HMPREF1979_00521 [Actinomyces joh...  58.6    2e-06
NBV61863.1 thioredoxin [Rhodobacteraceae bacterium]                   57.5    2e-06
OGG50252.1 hypothetical protein A2763_04725 [Candidatus Kaiserbac...  57.9    2e-06
RXZ76672.1 hypothetical protein EBB07_34185 [Paenibacillaceae bac...  59.0    2e-06
OGF51174.1 hypothetical protein A2231_01940 [Candidatus Firestone...  58.2    2e-06
WP_005045810.1 hypothetical protein [Halococcus salifodinae]EMA49...  58.2    2e-06
MBP48220.1 hypothetical protein [Myxococcales bacterium]              57.9    2e-06
HCC71773.1 TPA: hypothetical protein [Bacteroidales bacterium]        55.5    2e-06
WP_180275835.1 zinc-ribbon domain-containing protein, partial [Sp...  53.2    2e-06
KKS76411.1 hypothetical protein UV50_C0022G0002 [Parcubacteria gr...  58.6    2e-06
WP_034901039.1 hypothetical protein [Erythrobacter litoralis]AOL2...  57.9    2e-06
TAL09222.1 hypothetical protein EPO00_06175 [Chloroflexi bacterium]   59.0    2e-06
MAG90267.1 hypothetical protein [Rhodobacteraceae bacterium]          53.6    2e-06
PSP47766.1 hypothetical protein BRC75_08735 [Halobacteriales arch...  57.5    2e-06
WP_184732161.1 hypothetical protein [Streptomyces netropsis]MBB48...  57.9    2e-06
MBI4600142.1 hypothetical protein [Candidatus Uhrbacteria bacterium]  58.6    2e-06
HBZ40362.1 TPA: hypothetical protein [Erysipelotrichaceae bacterium]  57.1    2e-06
CCH73280.1 membrane hypothetical protein [Tetrasphaera australien...  59.0    2e-06
WP_149346280.1 hypothetical protein [Pedobacter sp. BS3]TZF85665....  58.2    2e-06
RKX99572.1 hypothetical protein DRP77_13020 [Candidatus Poribacte...  57.9    2e-06
STO01854.1 Protein of uncharacterised function (DUF975) [[Eubacte...  58.6    2e-06
RLG27558.1 hypothetical protein DRO03_11900 [Methanosarcinales ar...  55.5    2e-06
GGA60510.1 hypothetical protein GCM10008025_00670 [Ornithinibacil...  59.4    2e-06
QDT06426.1 hypothetical protein K227x_48360 [Planctomycetes bacte...  57.9    2e-06
NMC56649.1 hypothetical protein [Eubacteriaceae bacterium]            58.6    2e-06
WP_036850679.1 hypothetical protein [Porphyromonas macacae]KGO006...  57.5    2e-06
KKT31394.1 hypothetical protein UW18_C0004G0051 [Microgenomates g...  57.5    2e-06
PIN91066.1 hypothetical protein COU57_02180 [Candidatus Pacearcha...  58.6    3e-06
HEG99323.1 TPA: hypothetical protein [Thermoleophilum album]          57.1    3e-06
WP_174134421.1 hypothetical protein [Sulfitobacter sp. 1151]NSX53...  57.9    3e-06
MBA2655141.1 hypothetical protein [Gammaproteobacteria bacterium]     57.5    3e-06
MSY37893.1 hypothetical protein [Actinobacteria bacterium]            57.1    3e-06
MAS95237.1 hypothetical protein [Verrucomicrobiales bacterium]        58.6    3e-06
CAD7185640.1 unnamed protein product [Sepia pharaonis]                59.4    3e-06
PKQ21473.1 hypothetical protein CVT65_18350, partial [Actinobacte...  55.5    3e-06
NNJ70809.1 DUF975 family protein [Kiritimatiellales bacterium]        57.1    3e-06
KTR86842.1 hypothetical protein NS354_02490 [Leucobacter chromiir...  58.6    3e-06
NWG92281.1 zinc-ribbon domain-containing protein [Parvularculacea...  52.9    3e-06
NNF63771.1 hypothetical protein [Acidimicrobiia bacterium]            57.5    3e-06
NIP30995.1 hypothetical protein [Candidatus Dadabacteria bacterium]   55.9    3e-06
WP_004099200.1 glycerophosphoryl diester phosphodiesterase membra...  57.5    3e-06
WP_138191041.1 glycerophosphodiester phosphodiesterase [Culicoidi...  59.0    3e-06
MBI1946177.1 glycerophosphoryl diester phosphodiesterase membrane...  57.9    3e-06
RHA44277.1 hypothetical protein D1825_02125 [Cellulomonas rhizosp...  55.9    3e-06
MBI5634317.1 hypothetical protein [Nitrospirae bacterium]             59.0    4e-06
MBI4705907.1 hypothetical protein [Deltaproteobacteria bacterium]     57.1    4e-06
SES89824.1 hypothetical protein SAMN05443572_101512 [Myxococcus f...  57.5    4e-06
WP_162601381.1 hypothetical protein [Occallatibacter savannae]        57.9    4e-06
TXH43255.1 hypothetical protein E6Q90_07385 [Actinobacteria bacte...  58.2    4e-06
MAG18385.1 hypothetical protein [Candidatus Diapherotrites archaeon]  58.2    4e-06
WP_167149738.1 glycerophosphoryl diester phosphodiesterase membra...  58.6    4e-06
WP_155913061.1 hypothetical protein [Mycolicibacterium sp. CBMA 3...  55.5    4e-06
MBE6875750.1 DUF975 family protein [Ruminococcus sp.]                 58.2    4e-06
MBD3176542.1 hypothetical protein [Armatimonadia bacterium]           56.7    4e-06
PIT97047.1 hypothetical protein COT77_03640 [Candidatus Berkelbac...  56.7    4e-06
WP_066055597.1 hypothetical protein [Bacillus korlensis]              57.1    4e-06
WP_039391209.1 hypothetical protein [Novosphingobium sp. MBES04]G...  56.3    4e-06
THB73204.1 hypothetical protein D3926_24290 [Desulfobacteraceae b...  56.7    4e-06
PZS01445.1 hypothetical protein DLM69_04945 [Chloroflexi bacterium]   57.9    4e-06
HIH89202.1 TPA: hypothetical protein [Candidatus Bathyarchaeota a...  57.5    4e-06
MBI2922678.1 hypothetical protein [Planctomycetes bacterium]          56.7    4e-06
PHQ62737.1 hypothetical protein COC10_09890 [Sphingobium sp.]         56.3    4e-06
WP_142094151.1 hypothetical protein [Propioniferax innocua]TQL583...  58.2    4e-06
WP_114575805.1 hypothetical protein [Saliphagus sp. LR7]              57.1    5e-06
WP_152391887.1 hypothetical protein [Paenibacillus guangzhouensis]    56.7    5e-06
MBI9051909.1 hypothetical protein [Anaerolineaceae bacterium]         56.7    5e-06
MBR97464.1 hypothetical protein [Dehalococcoidia bacterium]           57.9    5e-06
HAJ90839.1 TPA: hypothetical protein [Rhodospirillaceae bacterium]    52.9    5e-06
SMF15505.1 hypothetical protein SAMN02745866_01010 [Alteromonadac...  58.6    5e-06
MYE37998.1 hypothetical protein [Candidatus Spechtbacteria bacter...  57.9    5e-06
PIR67629.1 hypothetical protein COU50_02265 [bacterium CG10_big_f...  56.7    5e-06
MBI3887958.1 hypothetical protein [Candidatus Microgenomates bact...  55.5    5e-06
HFG18269.1 TPA: DUF3426 domain-containing protein [Deltaproteobac...  57.1    5e-06
OLP76399.1 Galectin-3-binding protein B [Symbiodinium microadriat...  59.0    5e-06
NUN14493.1 FHA domain-containing protein [Myxococcales bacterium]     58.2    6e-06
NYI57516.1 hypothetical protein [Cellulomonas soli]                   57.5    6e-06
KTT97616.1 hypothetical protein NS355_11030, partial [Sphingomona...  52.5    6e-06
WP_011194601.1 hypothetical protein [Symbiobacterium thermophilum...  57.1    6e-06
MBC8095534.1 protein kinase [Akkermansiaceae bacterium]               58.2    6e-06
GDX17703.1 hypothetical protein LBMAG05_09990 [Actinobacteria bac...  57.1    6e-06
MBI5844290.1 hypothetical protein [Deltaproteobacteria bacterium]     56.7    7e-06
WP_166292691.1 glycerophosphoryl diester phosphodiesterase membra...  57.5    7e-06
PIP97860.1 thioredoxin, partial [Rhodobacterales bacterium CG18_b...  51.3    7e-06
WP_003536493.1 MULTISPECIES: DUF975 family protein [Erysipelotric...  57.9    7e-06
NLY05342.1 hypothetical protein [Candidatus Atribacteria bacterium]   55.9    7e-06
NLA27476.1 hypothetical protein [Firmicutes bacterium]                56.7    7e-06
WP_067390131.1 DUF975 family protein [Enterococcus canis]OJG19460...  56.3    7e-06
HAC1236277.1 TPA: DUF975 family protein [Listeria monocytogenes]      56.3    7e-06
NMB76397.1 hypothetical protein [Myxococcales bacterium]              51.7    7e-06
MBI2551190.1 hypothetical protein [Candidatus Uhrbacteria bacterium]  56.7    8e-06
NBO29941.1 hypothetical protein [Synechococcaceae bacterium WB6_1...  53.2    8e-06
HAV61890.1 TPA: hypothetical protein [Verrucomicrobiales bacterium]   57.5    8e-06
PIS11650.1 hypothetical protein COT73_02845 [Bdellovibrio sp. CG1...  55.5    8e-06
KAF5813155.1 hypothetical protein HanXRQr2_Chr03g0094911 [Heliant...  56.3    8e-06
KYK30526.1 hypothetical protein AYK19_18230 [Theionarchaea archae...  56.7    8e-06
HBB03643.1 TPA: hypothetical protein [Patescibacteria group bacte...  54.4    8e-06
MBI3091688.1 hypothetical protein [Candidatus Tectomicrobia bacte...  55.9    8e-06
WP_191474837.1 hypothetical protein [Candidatus Neoanaerotignum t...  56.3    8e-06
WP_175475016.1 glycerophosphoryl diester phosphodiesterase membra...  55.9    9e-06
WP_124954075.1 hypothetical protein [Halomarina oriensis]RRJ32173...  56.3    9e-06
KIE04164.1 hypothetical protein NF27_JF00420 [Candidatus Jidaibac...  55.9    9e-06
WP_184342077.1 hypothetical protein [Prosthecobacter vanneervenii...  56.3    9e-06
TDJ55875.1 hypothetical protein E2O47_03395 [Gemmatimonadetes bac...  56.7    9e-06
NLE88109.1 hypothetical protein [Myxococcales bacterium]              54.4    9e-06
MBI4733040.1 hypothetical protein [Chloroflexi bacterium]             56.7    9e-06
WP_176476741.1 zinc-ribbon domain-containing protein, partial [Ya...  52.9    9e-06
RLB74892.1 hypothetical protein DRH06_09025, partial [Deltaproteo...  50.9    9e-06
RMG69868.1 hypothetical protein D6710_08015, partial [Nitrospirae...  53.2    9e-06
OGC51151.1 hypothetical protein A2982_03115 [candidate division W...  57.1    9e-06
NJO07805.1 hypothetical protein [Chloroflexaceae bacterium]           54.4    9e-06
KEP22454.1 hypothetical protein DA06_21770, partial [Georgenia sp...  53.6    9e-06
HHB81612.1 TPA: thioredoxin [Aliiroseovarius sp.]                     51.3    1e-05
MBC7537647.1 hypothetical protein [Bacteriovorax sp.]                 55.2    1e-05
NNL27683.1 hypothetical protein [Acidimicrobiia bacterium]            55.9    1e-05
WP_152604154.1 hypothetical protein [Vibrio tubiashii]                54.4    1e-05
NNM90246.1 hypothetical protein [Bacilli bacterium]                   56.3    1e-05
WP_166977408.1 MULTISPECIES: hypothetical protein [unclassified A...  56.7    1e-05
WP_102216465.1 glycerophosphoryl diester phosphodiesterase membra...  57.1    1e-05
WP_101552891.1 hypothetical protein [Bacillus sp. UMB0728]PLR7012...  55.9    1e-05
PIE56355.1 hypothetical protein CSA34_04545 [Desulfobulbus propio...  57.1    1e-05
MBI1309030.1 DUF3426 domain-containing protein [Proteobacteria ba...  55.9    1e-05
OFX33191.1 hypothetical protein A2Z07_03805, partial [Armatimonad...  54.4    1e-05
EKE01476.1 hypothetical protein ACD_21C00122G0007 [uncultured bac...  55.9    1e-05
QDU63139.1 hypothetical protein Pan216_40140 [Planctomycetes bact...  55.5    1e-05
WP_073005725.1 hypothetical protein [Clostridium amylolyticum]SHI...  56.3    1e-05
PYQ09707.1 hypothetical protein DMH00_12530 [Acidobacteria bacter...  55.9    1e-05
OGS54086.1 hypothetical protein A2Y20_10305 [Firmicutes bacterium...  55.9    1e-05
HBB65106.1 TPA: hypothetical protein [Candidatus Vogelbacteria ba...  55.5    1e-05
TRW99263.1 hypothetical protein FNJ84_00865 [Paracoccus sp. M683]     56.3    1e-05
RME69568.1 hypothetical protein D6778_00320 [Nitrospirae bacterium]   55.5    1e-05
MBD3250178.1 hypothetical protein [Candidatus Pacebacteria bacter...  55.5    1e-05
2NB9_A Solution structure of ZitP zinc finger [Caulobacter vibrio...  50.9    1e-05
PIR44495.1 hypothetical protein COV10_04550 [Candidatus Vogelbact...  54.8    1e-05
OUW89139.1 hypothetical protein CBD86_00810 [Gammaproteobacteria ...  55.9    1e-05
RMG72764.1 hypothetical protein D6710_04330, partial [Nitrospirae...  54.8    1e-05
TFH65499.1 hypothetical protein E4G91_02245 [candidate division Z...  55.5    1e-05
TSC58475.1 hypothetical protein Greene041662_736 [Candidatus Pere...  54.8    1e-05
OQX68453.1 hypothetical protein B6A08_09975 [Sorangiineae bacteri...  55.9    1e-05
NVK36973.1 hypothetical protein [Gammaproteobacteria bacterium]       55.5    1e-05
WP_084613712.1 zinc-ribbon domain-containing protein [Roseibacter...  55.2    1e-05
HGX21135.1 TPA: DUF4339 domain-containing protein [Verrucomicrobi...  55.9    1e-05
KKP69379.1 hypothetical protein UR67_C0007G0084 [candidate divisi...  55.5    1e-05
WP_125216393.1 zinc-ribbon domain-containing protein [Rickettsial...  55.2    1e-05
HID26986.1 TPA: hypothetical protein [Methanosarcinales archaeon]     56.3    1e-05
HDH96468.1 TPA: tetratricopeptide repeat protein [Proteobacteria ...  56.7    1e-05
MAX26220.1 hypothetical protein [Phycisphaeraceae bacterium]          57.1    1e-05
GFZ94078.1 hypothetical protein CYANOKiyG1_04740 [Okeania sp. KiyG1]  54.0    1e-05
NJQ97971.1 hypothetical protein [Hydrococcus sp. CSU_1_8]             52.9    1e-05
NTV33791.1 hypothetical protein [Deltaproteobacteria bacterium]       54.8    1e-05
KSW17292.1 hypothetical protein ATM99_18135 [Cellulomonas sp. B6]     55.9    1e-05
HAJ00054.1 TPA: hypothetical protein [Dehalococcoidia bacterium]      53.2    1e-05
MBA2627168.1 zinc-ribbon domain-containing protein [Gemmatimonada...  52.1    1e-05
XP_001639125.2 inner centromere protein [Nematostella vectensis]      57.1    1e-05
RMG15829.1 hypothetical protein D6731_07500 [Planctomycetes bacte...  55.5    1e-05
NOU39593.1 DUF975 family protein [Ferruginibacter sp.]                54.8    1e-05
MBE9520865.1 hypothetical protein [Proteobacteria bacterium]          55.5    1e-05
TAK07581.1 hypothetical protein EPO38_12535, partial [Rhizorhabdu...  52.9    2e-05
OYD09572.1 hypothetical protein CHM34_00725 [Paludifilum halophilum]  55.9    2e-05
MBA3895303.1 zinc-ribbon domain-containing protein [Gemmatimonada...  50.2    2e-05
TFG50702.1 hypothetical protein E4H37_08895, partial [Gemmatimona...  50.9    2e-05
AMB93864.1 hypothetical protein AWM72_03370 [Aerococcus sanguinic...  56.7    2e-05
MBE0636037.1 hypothetical protein [Candidatus Bipolaricaulota bac...  55.2    2e-05
RKH91544.1 hypothetical protein D7Y13_38620, partial [Corallococc...  50.2    2e-05
MAY79321.1 hypothetical protein [Deltaproteobacteria bacterium]       55.2    2e-05
MBK0399387.1 zinc-ribbon domain-containing protein [Limibaculum s...  56.3    2e-05
RVX03270.1 hypothetical protein CK203_020024 [Vitis vinifera]         56.3    2e-05
WP_180325520.1 zinc-ribbon domain-containing protein, partial [Rh...  55.2    2e-05
WP_187707700.1 glycerophosphoryl diester phosphodiesterase membra...  54.8    2e-05
MBF1305591.1 hypothetical protein [Oribacterium sinus]                56.3    2e-05
KEY99875.1 membrane protein, partial [Sphingomonas sp. BHC-A]         52.5    2e-05
MYD36770.1 hypothetical protein [Dehalococcoidia bacterium]           55.9    2e-05
HAH45700.1 TPA: hypothetical protein [Planctomycetaceae bacterium]    55.2    2e-05
NDF13025.1 hypothetical protein [Proteobacteria bacterium]            54.8    2e-05
SUA44950.1 Uncharacterised protein [Nocardia africana]                56.7    2e-05
WP_133494935.1 hypothetical protein [Stakelama pacifica]              55.5    2e-05
MBD3160354.1 hypothetical protein [Candidatus Lokiarchaeota archa...  55.5    2e-05
MBA3394167.1 hypothetical protein [Deltaproteobacteria bacterium]     54.8    2e-05
OUO25217.1 hypothetical protein B5F87_18145 [Eubacterium sp. An3]     55.5    2e-05
NJS13976.1 hypothetical protein [Sphingopyxis sp.]                    53.6    2e-05
MBI4017641.1 hypothetical protein [Candidatus Aenigmarchaeota arc...  55.2    2e-05
CDE11622.1 uncharacterized conserved membrane protein [Clostridiu...  55.5    2e-05
TGU41990.1 hypothetical protein EN829_072995, partial [Mesorhizob...  51.3    2e-05
NLD60403.1 DUF975 family protein [Clostridiales bacterium]            54.4    2e-05
WP_129120554.1 hypothetical protein, partial [Deinococcus metalli...  52.9    2e-05
WP_073996011.1 hypothetical protein [Arcanobacterium urinimassili...  55.9    2e-05
NOZ34534.1 hypothetical protein [Chlorobi bacterium]                  53.2    2e-05
KPQ16331.1 zinc-ribbon domain [Rhodobacteraceae bacterium HLUCCO18]   55.9    2e-05
WP_116689066.1 hypothetical protein [Pelagibaculum spongiae]PVZ63...  55.2    2e-05
MAR30385.1 hypothetical protein [Candidatus Marinimicrobia bacter...  54.8    2e-05
RNJ62560.1 thioredoxin [Porphyrobacter sp. IPPAS B-1204]              55.5    2e-05
WP_124054136.1 glycerophosphoryl diester phosphodiesterase membra...  55.9    2e-05
NLE65142.1 hypothetical protein [Elusimicrobia bacterium]             55.2    2e-05
MBE9555140.1 zinc-ribbon domain-containing protein [Proteobacteri...  51.7    2e-05
WP_042268190.1 DUF975 family protein, partial [Clostridium perfri...  54.4    2e-05
WP_126643587.1 hypothetical protein [Embleya hyalina]GCE02021.1 m...  56.3    2e-05
WP_101754290.1 zinc-ribbon domain-containing protein [Paracoccus ...  54.0    2e-05
MBI4023262.1 hypothetical protein [Candidatus Berkelbacteria bact...  54.4    2e-05
NOU34840.1 hypothetical protein [Polyangiaceae bacterium]             49.8    3e-05
WP_166885743.1 hypothetical protein [Massilia sp. CCM 8734]NHZ950...  54.4    3e-05
HBH00242.1 TPA: hypothetical protein [Rhodobacteraceae bacterium]     50.2    3e-05
MBI2303021.1 hypothetical protein [Armatimonadetes bacterium]         54.4    3e-05
TNC74117.1 hypothetical protein FHG71_02655 [Rubellimicrobium ros...  55.5    3e-05
RME92470.1 DUF4339 domain-containing protein [Verrucomicrobia bac...  55.2    3e-05
CCX41659.1 uncharacterized protein BN454_00973 [Clostridium sp. C...  55.2    3e-05
WP_131283476.1 DUF975 family protein [Alloscardovia theropitheci]...  55.2    3e-05
HHB83669.1 TPA: hypothetical protein [Devosia sp.]                    50.9    3e-05
KYC30039.1 hypothetical protein A0J57_22825 [Sphingobium sp. 22B]     50.9    3e-05
PKN03838.1 hypothetical protein CVU75_00255 [Candidatus Dependent...  54.4    3e-05
GAX20217.1 hypothetical protein FisN_12Hu062 [Fistulifera solaris]    54.8    3e-05
TNE61745.1 hypothetical protein EP335_15095 [Alphaproteobacteria ...  55.5    3e-05
MBI2344541.1 hypothetical protein [Candidatus Dependentiae bacter...  54.4    3e-05
NKX48573.1 hypothetical protein [Rhodobacteraceae bacterium R_SAG8]   50.2    3e-05
KPJ72821.1 hypothetical protein AMJ52_05145, partial [candidate d...  50.5    3e-05
WP_153653054.1 hypothetical protein [Aeromicrobium sp. MF47]QGG41...  54.4    3e-05
WP_157151108.1 hypothetical protein [Brachyspira sp. SAP_772]         54.4    3e-05
MBG6121300.1 hypothetical protein [Corynebacterium aquatimens]        55.2    3e-05
WP_012626308.1 RDD family protein [Cyanothece sp. PCC 7425]ACL432...  55.9    3e-05
MBI3963363.1 hypothetical protein [Candidatus Kerfeldbacteria bac...  51.7    3e-05
MAQ70459.1 hypothetical protein [Flavobacteriales bacterium]          53.6    3e-05
WP_131156190.1 hypothetical protein [Egibacter rhizosphaerae]QBI2...  55.2    3e-05
MBI2410862.1 hypothetical protein [Candidatus Kerfeldbacteria bac...  54.4    3e-05
TQF77868.1 hypothetical protein FK498_10925, partial [Elioraea sp...  53.2    3e-05
NOZ34535.1 hypothetical protein [Chlorobi bacterium]                  54.8    3e-05
HBH89043.1 TPA: hypothetical protein [Hyphomonadaceae bacterium]      49.8    3e-05
WP_052433283.1 hypothetical protein [Streptacidiphilus carbonis]      55.5    3e-05
TET33779.1 hypothetical protein E3J61_03705 [Candidatus Dependent...  54.0    3e-05
WP_018632467.1 zinc-ribbon domain-containing protein [Neomegalone...  54.0    3e-05
MBC7540212.1 hypothetical protein [Bacteriovorax sp.]                 54.0    3e-05
MBI2252145.1 hypothetical protein [Armatimonadetes bacterium]         53.6    3e-05
MTI96510.1 hypothetical protein [Firmicutes bacterium]                54.4    4e-05
HHR82506.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  52.5    4e-05
WP_123193109.1 DUF975 family protein [Paraeggerthella hongkongens...  54.0    4e-05
HEN14014.1 TPA: zinc ribbon domain-containing protein [Schlesneri...  53.6    4e-05
TAM99469.1 hypothetical protein EPN39_06515, partial [Chitinophag...  53.6    4e-05
MBA3615465.1 hypothetical protein [Rubrobacteraceae bacterium]        52.9    4e-05
NBP71683.1 hypothetical protein [Alphaproteobacteria bacterium]       54.8    4e-05
PKL18483.1 hypothetical protein CVV49_05870 [Spirochaetae bacteri...  54.8    4e-05
NBU84785.1 thioredoxin [Sphingomonadaceae bacterium]                  50.5    4e-05
MBA3344587.1 zinc-ribbon domain-containing protein [Gemmatimonada...  50.9    4e-05
NNK61736.1 hypothetical protein [Gemmatimonadetes bacterium]          51.3    4e-05
MBE0592420.1 hypothetical protein [Gemmatimonadales bacterium]        52.9    4e-05
WP_021761059.1 hypothetical protein [Desulfovibrio gigas]AGW14099...  54.0    4e-05
VVB74047.1 Uncharacterised protein [uncultured archaeon]              54.4    4e-05
WP_156825759.1 hypothetical protein [Lewinella cohaerens]             54.4    4e-05
WP_131848433.1 hypothetical protein [Baia soyae]TCP69232.1 hypoth...  53.6    4e-05
EEY34820.1 hypothetical protein HMPREF0554_2324 [Leptotrichia goo...  53.6    4e-05
WP_153546430.1 zinc-ribbon domain-containing protein [Epibacteriu...  54.8    4e-05
MBA1148201.1 hypothetical protein [Ectothiorhodospiraceae bacteri...  54.4    4e-05
NDC14010.1 hypothetical protein [Synechococcaceae bacterium WB9_2...  53.2    4e-05
WP_048481137.1 glycerophosphoryl diester phosphodiesterase membra...  54.4    4e-05
MUL66471.1 hypothetical protein [Mycolicibacterium sp. CBMA 234]      53.6    4e-05
WP_180946190.1 zinc-ribbon domain-containing protein, partial [Co...  49.0    4e-05
WP_193183634.1 hypothetical protein [Nisaea sp. NBU1469]              54.0    4e-05
PCI01364.1 hypothetical protein COB76_01515 [Alphaproteobacteria ...  53.6    4e-05
NLT20123.1 zinc-ribbon domain-containing protein [Syntrophomonada...  55.5    4e-05
OGK62482.1 hypothetical protein A3K47_02720 [Candidatus Roizmanba...  53.6    4e-05
MBF0386199.1 hypothetical protein [Candidatus Omnitrophica bacter...  53.6    4e-05
TNE64350.1 hypothetical protein EP335_07760 [Alphaproteobacteria ...  54.0    4e-05
NNK85474.1 cyclic nucleotide-binding domain-containing protein [D...  54.4    5e-05
HCF16891.1 TPA: hypothetical protein [Rhodospirillum rubrum]          49.4    5e-05
KAF0121402.1 hypothetical protein FD151_1246, partial [bacterium]     49.4    5e-05
MBI4438882.1 hypothetical protein [Candidatus Woesearchaeota arch...  54.8    5e-05
KAF5178990.1 hypothetical protein FRX31_031418 [Thalictrum thalic...  52.5    5e-05
WP_113930654.1 hypothetical protein [Bacillus sp. P14.5]              54.4    5e-05
WP_034232473.1 hypothetical protein [Arcanobacterium sp. S3PF19]K...  54.8    5e-05
TFH27306.1 hypothetical protein E4H00_09665, partial [Myxococcale...  54.4    5e-05
HBC14709.1 TPA: thioredoxin [Erythrobacter sp.]                       51.3    5e-05
WP_174287002.1 zinc-ribbon domain-containing protein [Sphingomona...  53.6    5e-05
RJP42402.1 hypothetical protein C4547_00650 [Phycisphaerales bact...  54.8    5e-05
NNF52901.1 hypothetical protein [Acidimicrobiales bacterium]          54.0    5e-05
HAW80520.1 TPA: hypothetical protein [Balneola sp.]                   52.9    5e-05
MBI1179550.1 DUF3426 domain-containing protein [Alphaproteobacter...  54.4    5e-05
WP_198924911.1 zinc-ribbon domain-containing protein, partial [Ni...  48.6    5e-05
MAH04829.1 hypothetical protein [Alphaproteobacteria bacterium]       54.0    5e-05
OQC25098.1 hypothetical protein BWX68_01720 [Verrucomicrobia bact...  54.4    5e-05
PIW37039.1 hypothetical protein COW24_02120 [Candidatus Kerfeldba...  54.0    6e-05
WP_128145248.1 hypothetical protein [Nocardia africana]               54.8    6e-05
PZR11478.1 hypothetical protein DI536_17790 [Archangium gephyra]      54.0    6e-05
MAX12684.1 hypothetical protein [Candidatus Marinimicrobia bacter...  52.9    6e-05
WP_188872400.1 hypothetical protein [Halarchaeum rubridurum]          53.6    6e-05
MBI2792287.1 hypothetical protein [Gammaproteobacteria bacterium]     53.6    6e-05
PYV21798.1 hypothetical protein DMG24_18580 [Acidobacteria bacter...  54.8    6e-05
WP_025079669.1 hypothetical protein [Porphyromonas macacae]           52.5    6e-05
MBJ02401.1 hypothetical protein [Planctomycetes bacterium]            53.6    6e-05
RKX70378.1 hypothetical protein DRP53_05190 [candidate division W...  51.3    6e-05
TSA50649.1 zinc-ribbon domain-containing protein [archaeon]           54.0    6e-05
OYW46715.1 hypothetical protein B7Z36_04565 [Novosphingobium sp. ...  53.2    6e-05
HHR99629.1 TPA: hypothetical protein [Acidobacteria bacterium]        53.2    6e-05
KKU73052.1 hypothetical protein UX98_C0012G0012 [Parcubacteria gr...  53.2    6e-05
TXD43503.1 hypothetical protein FRC96_01695, partial [Bradymonada...  52.9    6e-05
MBC8791299.1 hypothetical protein [Tagaea sp. CACIAM 22H2]            54.4    6e-05
WP_066156678.1 hypothetical protein [Alkalihalobacillus krulwichi...  52.9    6e-05
WP_119895037.1 hypothetical protein [Pseudomonas sp. K2W31S-8]AYC...  52.9    6e-05
PTC38616.1 hypothetical protein CLJ1_0894 [Pseudomonas aeruginosa]    54.0    6e-05
NIM07516.1 hypothetical protein [Armatimonadetes bacterium]NIO989...  50.9    6e-05
OPZ67706.1 hypothetical protein BWY81_01183 [Firmicutes bacterium...  54.0    6e-05
TDI58471.1 hypothetical protein E2O92_09730 [Alphaproteobacteria ...  53.6    7e-05
PYT06496.1 hypothetical protein DMF60_09230 [Acidobacteria bacter...  54.4    7e-05
MBD3241668.1 hypothetical protein [Chitinivibrionales bacterium]      52.9    7e-05
WP_191430587.1 DUF975 family protein, partial [Lachnoclostridium ...  51.3    7e-05
WP_102153783.1 MULTISPECIES: glycerophosphoryl diester phosphodie...  53.6    7e-05
MSR31538.1 hypothetical protein [Gemmataceae bacterium]               54.0    7e-05
WP_173081207.1 hypothetical protein [Phytohabitans rumicis]GFJ942...  53.2    7e-05
WP_169701313.1 zinc-ribbon domain-containing protein [Planktomari...  52.9    7e-05
MBI2825313.1 hypothetical protein [Planctomycetia bacterium]          54.4    7e-05
VVB07961.1 unnamed protein product [Arabis nemorensis]                54.8    7e-05
MBA2298610.1 hypothetical protein [Actinobacteria bacterium]          52.1    7e-05
NND02970.1 DUF2510 domain-containing protein [Acidimicrobiia bact...  54.0    7e-05
NOY23820.1 hypothetical protein [Acidobacteria bacterium]             52.9    7e-05
HBJ76497.1 TPA: hypothetical protein [Porphyromonadaceae bacterium]   52.5    8e-05
WP_142094079.1 glycerophosphoryl diester phosphodiesterase membra...  54.4    8e-05
WP_146540482.1 zinc-ribbon domain-containing protein [Reyranella ...  52.5    8e-05
ABC77958.1 hypothetical transport protein [Syntrophus aciditrophi...  54.8    8e-05
TMK56128.1 hypothetical protein E6G51_10770 [Actinobacteria bacte...  52.9    8e-05
OGH68413.1 hypothetical protein A3D53_03145 [Candidatus Magasanik...  53.2    8e-05
MBI2415545.1 hypothetical protein [Candidatus Kerfeldbacteria bac...  53.2    8e-05
WP_199729054.1 zinc-ribbon domain-containing protein, partial [Co...  49.0    8e-05
OAS14804.1 hypothetical protein A8708_04695 [Paenibacillus oryzis...  52.1    8e-05
MBD3181089.1 hypothetical protein [Candidatus Poribacteria bacter...  53.6    8e-05
QOV90846.1 hypothetical protein IPV69_05665 [Phycisphaerales bact...  54.8    8e-05
MBA3259230.1 zinc-ribbon domain-containing protein [Gemmatimonada...  48.6    9e-05
HCG96087.1 TPA: hypothetical protein [Halieaceae bacterium]           53.2    9e-05
OHA53576.1 hypothetical protein A3A30_00260 [Candidatus Terrybact...  53.6    9e-05
NDC59313.1 DUF3426 domain-containing protein [Alphaproteobacteria...  54.0    9e-05
RMF67257.1 hypothetical protein D6742_07940 [Cyanobacteria bacter...  52.5    9e-05
NLM97551.1 hypothetical protein [Halanaerobiaceae bacterium]          53.6    9e-05
HBF67473.1 TPA: hypothetical protein [Candidatus Magasanikbacteri...  52.9    9e-05
MBI5354742.1 glycerophosphoryl diester phosphodiesterase membrane...  51.3    9e-05
PYV11776.1 hypothetical protein DMG23_03280, partial [Acidobacter...  53.2    1e-04
RKZ07008.1 hypothetical protein DRQ05_03820, partial [bacterium]      47.8    1e-04
PAV66383.1 hypothetical protein WR25_19351 [Diploscapter pachys]      53.6    1e-04
OFW84268.1 hypothetical protein A2018_07130 [Alphaproteobacteria ...  52.5    1e-04
OYW58129.1 hypothetical protein B7Z31_08445, partial [Rhodobacter...  52.5    1e-04
MBG85303.1 hypothetical protein [Verrucomicrobiales bacterium]        54.0    1e-04
KWV91702.1 hypothetical protein AUC45_10870 [Erythrobacter sp. YT30]  54.0    1e-04
WP_199693393.1 zinc-ribbon domain-containing protein, partial [So...  49.8    1e-04
HCI45959.1 TPA: thioredoxin [Rhodospirillaceae bacterium]             49.4    1e-04
PWM63029.1 hypothetical protein DBX63_02440 [Clostridia bacterium]    53.6    1e-04
WP_047581807.1 zinc-ribbon domain-containing protein, partial [Me...  49.8    1e-04
HCR64982.1 TPA: hypothetical protein [Oceanicaulis sp.]               47.8    1e-04
HBJ93531.1 TPA: hypothetical protein [Hyphomonadaceae bacterium]      49.8    1e-04
OYW53255.1 hypothetical protein B7Z31_12010, partial [Rhodobacter...  52.1    1e-04
MBA2360413.1 glycerophosphoryl diester phosphodiesterase membrane...  50.2    1e-04
NLH85803.1 DUF4339 domain-containing protein [Verrucomicrobia bac...  54.0    1e-04
OZB22600.1 hypothetical protein B7X51_14825, partial [Pseudomonas...  50.2    1e-04
MBC7768129.1 zinc-ribbon domain-containing protein [Phycisphaeral...  49.0    1e-04
KPJ73767.1 hypothetical protein AMS14_06350 [Planctomycetes bacte...  52.5    1e-04
ETR74400.1 anti-sigma B factor antagonist [Candidatus Magnetoglob...  51.7    1e-04
MBE3550789.1 hypothetical protein [Brockia lithotrophica]PTQ51735...  52.9    1e-04
NIR37268.1 hypothetical protein [Actinobacteria bacterium]NIS3173...  52.1    1e-04
MYD96811.1 hypothetical protein [Gammaproteobacteria bacterium]       53.2    1e-04
MBI5835749.1 hypothetical protein [Candidatus Eisenbacteria bacte...  53.2    1e-04
WP_144902450.1 hypothetical protein [Halobellus captivus]             51.3    1e-04
NLX73041.1 hypothetical protein [Bacteroidales bacterium]             53.2    1e-04
WP_196104241.1 zinc-ribbon domain-containing protein [Pontivivens...  53.6    1e-04
WP_187533636.1 hypothetical protein [Erysipelothrix inopinata]QNN...  52.1    1e-04
NOQ82797.1 hypothetical protein [Myxococcales bacterium]              54.4    1e-04
TML64728.1 hypothetical protein E6G22_03820 [Actinobacteria bacte...  52.1    1e-04
HBM60493.1 TPA: hypothetical protein [Citreicella sp.]                53.2    1e-04
VVC04255.1 Membrane domain of glycerophosphoryl diester phosphodi...  53.2    1e-04
ELZ44635.1 hypothetical protein C464_13085 [Halorubrum coriense D...  53.2    1e-04
HFD16665.1 TPA: DUF3426 domain-containing protein [Rhodospirillal...  52.9    1e-04
NUO63237.1 hypothetical protein [Gemmatimonadaceae bacterium]         47.8    1e-04
PYQ07758.1 hypothetical protein DMF83_08590 [Acidobacteria bacter...  52.5    1e-04
MBA2313169.1 hypothetical protein [Actinobacteria bacterium]          52.5    1e-04
ODT78704.1 hypothetical protein ABS71_01655 [bacterium SCN 62-11]     52.5    1e-04
OGY81431.1 hypothetical protein A3F54_02195 [Candidatus Kerfeldba...  52.5    1e-04
NTU42834.1 hypothetical protein [Nitrospirales bacterium]             52.5    1e-04
OIO19188.1 hypothetical protein AUJ23_02480 [Candidatus Magasanik...  52.5    1e-04
MAL57839.1 hypothetical protein [Brevundimonas sp.]                   47.5    1e-04
WP_182450389.1 hypothetical protein [Streptacidiphilus sp. P02-A3...  52.5    1e-04
MBA4032952.1 hypothetical protein [Planctomyces sp.]                  50.2    1e-04
SBW20251.1 putative membrane protein [Candidatus Frankia californ...  50.5    1e-04
WP_138465116.1 zinc-ribbon domain-containing protein [Poseidonoce...  53.2    1e-04
RLM60084.1 hypothetical protein DVK07_19755 [Halorubrum sp. Atlit...  49.8    1e-04
NWG72642.1 zinc-ribbon domain-containing protein [Parvularculacea...  53.2    1e-04
WP_131900554.1 hypothetical protein [Jiangella asiatica]TDD99113....  50.2    1e-04
PKP92064.1 thioredoxin, partial [Alphaproteobacteria bacterium HG...  47.1    1e-04
MBI3971862.1 hypothetical protein [Chloroflexi bacterium]             52.1    1e-04
WP_184195778.1 glycerophosphoryl diester phosphodiesterase membra...  52.1    1e-04
CAN74577.1 hypothetical protein VITISV_009110 [Vitis vinifera]        54.0    1e-04
NLN04541.1 hypothetical protein [Clostridiaceae bacterium]            52.9    1e-04
NBO18873.1 DUF4339 domain-containing protein [Proteobacteria bact...  52.5    2e-04
WP_145030715.1 hypothetical protein [Caulifigura coniformis]QDT54...  53.2    2e-04
MBR89700.1 hypothetical protein [Verrucomicrobiales bacterium]        52.9    2e-04
MBN33850.1 hypothetical protein [Rhodospirillaceae bacterium]         49.0    2e-04
MAF67884.1 hypothetical protein [Micavibrio sp.]                      52.1    2e-04
OYU42076.1 hypothetical protein CFE44_26090 [Burkholderiales bact...  50.5    2e-04
WP_137105721.1 zinc-ribbon domain-containing protein [Azospirillu...  51.7    2e-04
OGW32305.1 hypothetical protein A2X59_00640 [Nitrospirae bacteriu...  52.9    2e-04
RLE93661.1 hypothetical protein DRN04_06455 [Thermoprotei archaeon]   53.6    2e-04
KMT09158.1 hypothetical protein BVRB_6g132640 [Beta vulgaris subs...  50.5    2e-04
HIF06606.1 TPA: hypothetical protein [Gemmatimonadetes bacterium]     48.2    2e-04
MBI1339012.1 hypothetical protein [bacterium]                         52.9    2e-04
NVM56621.1 zinc-ribbon domain-containing protein [Desulfobacteral...  51.7    2e-04
WP_174541478.1 hypothetical protein [Methyloligella sp. GL2]QKP77...  52.9    2e-04
NMD37062.1 hypothetical protein [Christensenellaceae bacterium]       52.5    2e-04
MBD5440315.1 DUF975 family protein [Treponema sp.]                    50.5    2e-04
PYT09483.1 hypothetical protein DMF49_01935 [Acidobacteria bacter...  51.7    2e-04
HIM47199.1 TPA: hypothetical protein [Alphaproteobacteria bacterium]  49.0    2e-04
ARU42874.1 hypothetical protein CCB81_01385 [Armatimonadetes bact...  52.1    2e-04
MXW26595.1 hypothetical protein [Dehalococcoidia bacterium]MYA528...  52.1    2e-04
WP_044217345.1 hypothetical protein [Flammeovirga pacifica]OHX647...  52.5    2e-04
NLX23555.1 YIP1 family protein [Phycisphaerae bacterium]              53.2    2e-04
WP_183346504.1 zinc-ribbon domain-containing protein [Geomonas pa...  53.2    2e-04
RXH69783.1 hypothetical protein DVH24_007039 [Malus domestica]        53.6    2e-04
MSY59729.1 hypothetical protein [Actinobacteria bacterium]            52.5    2e-04
NND43154.1 hypothetical protein [Silicimonas sp.]                     52.1    2e-04
MBI4951725.1 zinc-ribbon domain-containing protein [Myxococcales ...  51.3    2e-04
KKU18356.1 hypothetical protein UX28_C0001G0213 [Candidatus Paceb...  52.5    2e-04
HHV09035.1 TPA: DUF975 family protein [Clostridiales bacterium]       51.7    2e-04
KAF9613703.1 hypothetical protein IFM89_010145 [Coptis chinensis]     52.9    2e-04
MBI2594007.1 hypothetical protein [Candidatus Daviesbacteria bact...  51.7    2e-04
PWT71969.1 hypothetical protein C5B60_10110 [Chloroflexi bacterium]   52.9    2e-04
RZM07946.1 hypothetical protein EOP67_72485, partial [Sphingomona...  49.4    2e-04
OGP62797.1 hypothetical protein A2170_16600, partial [Deltaproteo...  52.1    2e-04
NTU99436.1 hypothetical protein [Candidatus Falkowbacteria bacter...  51.7    2e-04
OGK09556.1 hypothetical protein A2767_06010 [Candidatus Roizmanba...  52.9    2e-04
HHS69818.1 TPA: zinc ribbon domain-containing protein [Candidatus...  52.1    2e-04
OYV09983.1 hypothetical protein CG446_1323, partial [Methanosaeta...  51.3    2e-04
WP_182098911.1 glycerophosphoryl diester phosphodiesterase membra...  51.3    2e-04
NLW78468.1 DUF975 family protein [Ruminococcaceae bacterium]          52.5    2e-04
CAA9225334.1 hypothetical protein AVDCRST_MAG93-630, partial [unc...  51.7    2e-04
HHD11436.1 TPA: DUF3426 domain-containing protein [Deltaproteobac...  52.9    2e-04
MBF0519763.1 hypothetical protein [Nitrospirae bacterium]             51.7    2e-04
TEY47489.1 hypothetical protein Saspl_039123 [Salvia splendens]       52.9    2e-04
GEV76394.1 hypothetical protein CTI12_AA110850 [Tanacetum cinerar...  50.2    2e-04
WP_193388356.1 zinc-ribbon domain-containing protein, partial [An...  49.4    2e-04
HBE82154.1 TPA: hypothetical protein [Blastocatellia bacterium]       53.2    2e-04
MAM76783.1 hypothetical protein [Tistrella sp.]                       52.1    2e-04
HGW15319.1 TPA: hypothetical protein [Geobacteraceae bacterium]       52.1    2e-04
WP_075726472.1 hypothetical protein [Corynebacterium aquilae]APT8...  52.5    2e-04
WP_142407150.1 hypothetical protein [Mycobacterium sp. EPG1]          52.9    2e-04
MBE2197726.1 DUF4013 domain-containing protein [Anaerolinea sp.]      52.9    2e-04
TVQ98381.1 hypothetical protein EA398_13415 [Deltaproteobacteria ...  52.9    2e-04
RLC26940.1 hypothetical protein DRH56_03745 [Deltaproteobacteria ...  52.5    2e-04
MBI4393811.1 protein kinase [Euryarchaeota archaeon]                  52.9    2e-04
WP_137150880.1 zinc-ribbon domain-containing protein [Devosia sp....  52.5    2e-04
TAL36625.1 hypothetical protein EPN93_07315 [Spirochaetes bacterium]  52.9    3e-04
TNE54099.1 hypothetical protein EP341_05890, partial [Sphingomona...  51.7    3e-04
KJV06413.1 hypothetical protein VZ94_11375 [Methylocucumis oryzae]    52.5    3e-04
MBC8559302.1 hypothetical protein [Clostridiaceae bacterium NSJ-3...  52.5    3e-04
HGF33636.1 TPA: hypothetical protein [Desulfobacca acetoxidans]       48.2    3e-04
PIZ30407.1 hypothetical protein COY40_04730 [Alphaproteobacteria ...  52.9    3e-04
NJO84589.1 tetratricopeptide repeat protein [Blastochloris sp.]       52.5    3e-04
QHI70525.1 hypothetical protein GT409_14115 [Kiritimatiellaeota b...  51.7    3e-04
HIE06348.1 TPA: hypothetical protein [Candidatus Stahlbacteria ba...  50.5    3e-04
WP_107584647.1 hypothetical protein [Alkalicoccus saliphilus]PTL3...  51.7    3e-04
WP_014373691.1 hypothetical protein [Saprospira grandis]AFC23448....  51.7    3e-04
HGF76395.1 TPA: hypothetical protein [Firmicutes bacterium]           52.1    3e-04
MXU63985.1 hypothetical protein [Rhodobacteraceae bacterium KN286]    52.5    3e-04
NOX39819.1 hypothetical protein [Alphaproteobacteria bacterium]       52.1    3e-04
QDP19568.1 hypothetical protein FMM02_06090 [Sphingomonas sp. AE3]    52.1    3e-04
TXD38845.1 hypothetical protein FRC98_00125 [Bradymonadales bacte...  52.9    3e-04
KQC13745.1 hypothetical protein APR63_07605 [Desulfuromonas sp. SDB]  51.7    3e-04
KAF7842967.1 putative transmembrane protein [Senna tora]              52.9    3e-04
TLY28689.1 hypothetical protein E6K62_09855, partial [Nitrospirae...  51.7    3e-04
WP_144864987.1 hybrid sensor histidine kinase/response regulator ...  52.9    3e-04
PAW78819.1 hypothetical protein B9S32_05405 [Verrucomicrobia bact...  52.1    3e-04
TMQ34936.1 hypothetical protein E6K70_04975 [Planctomycetes bacte...  51.3    3e-04
OQX01670.1 hypothetical protein BWK80_59455 [Desulfobacteraceae b...  51.7    3e-04
WP_134724441.1 zinc-ribbon domain-containing protein [Paracoccus ...  51.3    3e-04
HBE00133.1 TPA: hypothetical protein [Gemmatimonadetes bacterium]     49.8    3e-04
HAC57592.1 TPA: hypothetical protein [Rhodobiaceae bacterium]         52.1    3e-04
MBB24536.1 hypothetical protein [Geminicoccus sp.]                    50.9    3e-04
RMG21182.1 hypothetical protein D6729_01370, partial [Deltaproteo...  47.8    3e-04
NUQ91842.1 DUF3426 domain-containing protein [Gemmatimonadaceae b...  47.8    3e-04
KAB2879190.1 hypothetical protein F9K33_10200 [bacterium]             52.5    3e-04
MBI4544038.1 zinc-ribbon domain-containing protein [Gemmatimonade...  49.4    3e-04
NJN47961.1 hypothetical protein [Candidatus Competibacteraceae ba...  52.5    3e-04
EGT3616849.1 DUF975 family protein [Clostridium perfringens]          51.7    3e-04
MBI5241206.1 hypothetical protein [Elusimicrobia bacterium]           51.7    3e-04
RJL00941.1 thioredoxin, partial [Paracoccus siganidrum]               46.7    3e-04
WP_148240793.1 hypothetical protein [Nocardioides sp. S-1144]QCW5...  50.2    3e-04
RLG70713.1 hypothetical protein DRO04_01305 [Candidatus Diapherot...  51.3    3e-04
WP_090520267.1 zinc-ribbon domain-containing protein [Paracoccus ...  52.1    3e-04
WP_194189952.1 DUF975 family protein [Clostridium sp. PT]             51.7    3e-04
PYV00698.1 hypothetical protein DMG26_14940 [Acidobacteria bacter...  52.5    3e-04
WP_144846936.1 DUF975 family protein [Lactobacillus gasseri]TVV01...  51.3    3e-04
KAA0218568.1 hypothetical protein EDM80_00105 [bacterium]RIK65599...  52.1    3e-04
WP_108103268.1 zinc-ribbon domain-containing protein [Geobacter s...  51.7    3e-04
MBI4184865.1 zinc-ribbon domain-containing protein [Proteobacteri...  52.5    3e-04
WP_169849444.1 zinc-ribbon domain-containing protein, partial [Co...  49.4    3e-04
PLX91758.1 hypothetical protein C0621_10775, partial [Desulfuromo...  47.8    3e-04
HBL12542.1 TPA: hypothetical protein [Cyanobacteria bacterium UBA...  52.5    3e-04
WP_181180903.1 zinc-ribbon domain-containing protein, partial [Pa...  48.6    3e-04
OMO95110.1 hypothetical protein COLO4_16060 [Corchorus olitorius]     52.1    4e-04
MBD3311948.1 hypothetical protein [archaeon]                          51.7    4e-04
NQZ57222.1 hypothetical protein [Lentisphaeraceae bacterium]          52.1    4e-04
WP_172344592.1 hypothetical protein [Prevotella sp. PCHR]NPE25109...  51.7    4e-04
MBT40490.1 hypothetical protein [Deltaproteobacteria bacterium]       51.3    4e-04
MBI3010989.1 hypothetical protein [Candidatus Omnitrophica bacter...  50.9    4e-04
KAF5935985.1 hypothetical protein HYC85_027114 [Camellia sinensis]    50.9    4e-04
VUT24393.1 hypothetical protein MOIL_00483 [Candidatus Methanolli...  50.9    4e-04
MBD3788613.1 zinc-ribbon domain-containing protein [Sphingomonada...  49.4    4e-04
NCW14115.1 hypothetical protein [Rhodobacteraceae bacterium]          50.9    4e-04
MBI5810497.1 zinc-ribbon domain-containing protein [Deltaproteoba...  47.1    4e-04
NRA02540.1 zinc-ribbon domain-containing protein [Myxococcales ba...  52.1    4e-04
RJL06555.1 hypothetical protein D3P06_03555, partial [Paracoccus ...  49.0    4e-04
OUX70770.1 hypothetical protein CBD00_02145 [Rhodospirillaceae ba...  51.3    4e-04
MBC1391767.1 DUF975 family protein [Listeria welshimeri]              49.8    4e-04
WP_158010726.1 hypothetical protein [Tardibacter chloracetimidivo...  51.3    4e-04
HAS78877.1 TPA: hypothetical protein [Ruminococcus sp.]               52.1    4e-04
MAP90714.1 hypothetical protein [Candidatus Poribacteria bacterium]   51.7    4e-04
NOK61900.1 hypothetical protein [Chloroflexi bacterium AL-N1]NOK6...  51.7    4e-04
HEB78997.1 TPA: thioredoxin [Rhodospirillales bacterium]              45.9    4e-04
MBE6554379.1 DUF975 family protein [Ruminococcaceae bacterium]        51.7    4e-04
HIA92278.1 TPA: hypothetical protein [Candidatus Saccharibacteria...  50.9    4e-04
MBH2006988.1 hypothetical protein [Candidatus Saccharibacteria ba...  51.7    4e-04
MBA3761007.1 zinc-ribbon domain-containing protein [Gemmatimonada...  49.8    4e-04
WP_028099953.1 hypothetical protein [Dongia sp. URHE0060]             51.3    4e-04
TDJ12461.1 hypothetical protein E2O66_07375 [Deltaproteobacteria ...  52.1    4e-04
SHW22474.1 proline and glycine rich transmembrane protein [Mycoba...  48.2    4e-04
PKL36072.1 hypothetical protein CVV44_17785 [Spirochaetae bacteri...  52.1    4e-04
KPJ48918.1 hypothetical protein AMJ41_03980 [candidate division Z...  50.5    4e-04
WP_120068225.1 hypothetical protein [Halococcus sp. IIIV-5B]RJT08...  49.0    4e-04
NQY39059.1 zinc-ribbon domain-containing protein [Henriciella sp.]    50.5    4e-04
GAF40281.1 integral membrane protein [Agrilactobacillus composti ...  50.5    4e-04
NNC43223.1 hypothetical protein [Acidimicrobiia bacterium]            49.0    4e-04
MBC8277666.1 zinc-ribbon domain-containing protein [FCB group bac...  46.7    5e-04
MBI2415304.1 hypothetical protein [Candidatus Kerfeldbacteria bac...  50.9    5e-04
OLA64591.1 hypothetical protein BHW56_06520 [Acetobacter sp. 46_36]   51.7    5e-04
PIE74612.1 hypothetical protein CSA18_04495, partial [Deltaproteo...  50.2    5e-04
HBW20264.1 TPA: hypothetical protein [Actinobacteria bacterium]       51.7    5e-04
WP_121876654.1 hypothetical protein [Umboniibacter marinipuniceus...  50.5    5e-04
NLL50801.1 hypothetical protein [Eubacteriaceae bacterium]            51.7    5e-04
WP_152142453.1 zinc-ribbon domain-containing protein [Amylibacter...  52.1    5e-04
MBJ7725272.1 putative Zn finger-like uncharacterized protein [Cau...  48.6    5e-04
NEE07014.1 hypothetical protein [Streptomyces sp. SID7499]            49.4    5e-04
PME47739.1 hypothetical protein BCV34_17405, partial [Vibrio lent...  49.4    5e-04
PIN70052.1 hypothetical protein COV93_03080 [Candidatus Woesearch...  50.9    5e-04
MBI3021119.1 hypothetical protein [Candidatus Omnitrophica bacter...  48.6    5e-04
TAJ25767.1 hypothetical protein EPO67_20125, partial [Reyranella ...  49.4    5e-04
MSP52232.1 hypothetical protein [Alphaproteobacteria bacterium]       50.9    5e-04
MBI3494833.1 hypothetical protein [Candidatus Saccharibacteria ba...  51.3    5e-04
WP_167175055.1 zinc-ribbon domain-containing protein [Brevundimon...  51.3    5e-04
PYP42224.1 hypothetical protein DMD43_03640 [Gemmatimonadetes bac...  49.4    5e-04
HHW31036.1 TPA: hypothetical protein [Clostridiaceae bacterium]       50.5    5e-04
NTU70650.1 hypothetical protein [Coriobacteriia bacterium]            52.1    5e-04
OUZ99439.1 hypothetical protein BVC80_1801g4 [Macleaya cordata]       48.6    5e-04
KAF9687129.1 hypothetical protein SADUNF_Sadunf02G0061600 [Salix ...  51.7    5e-04
NIT13175.1 hypothetical protein [Candidatus Dadabacteria bacterium]   49.4    5e-04
ACC98177.1 hypothetical protein Emin_0622 [Elusimicrobium minutum...  51.3    5e-04
GAK33662.1 family finger-like domain protein [alpha proteobacteri...  51.7    5e-04
OQA95757.1 hypothetical protein BWY23_02291 [Spirochaetes bacteri...  51.7    5e-04
PLX89717.1 hypothetical protein C0614_01625 [Desulfuromonas sp.]      48.6    5e-04
MBE6462622.1 hypothetical protein [Alphaproteobacteria bacterium]     50.9    5e-04
HIJ45883.1 TPA: thioredoxin [Rhodospirillaceae bacterium]             47.1    5e-04
OYT39889.1 hypothetical protein B6U86_04785 [Candidatus Altiarcha...  50.9    5e-04
HAQ0966128.1 TPA: glycerophosphodiester phosphodiesterase [Entero...  51.7    5e-04
RZB42193.1 hypothetical protein D0Y65_052964 [Glycine soja]           52.1    5e-04
MBF0344757.1 hypothetical protein [Nitrospirae bacterium]             51.3    5e-04
WP_199753933.1 zinc-ribbon domain-containing protein, partial [Co...  49.0    5e-04
MBE6700725.1 hypothetical protein [Ruminococcaceae bacterium]         50.5    5e-04
TET29854.1 hypothetical protein E3J69_12685 [Anaerolineales bacte...  50.5    5e-04
OGF21504.1 hypothetical protein A2Y83_04475 [Candidatus Falkowbac...  50.9    5e-04
MBC7870492.1 hypothetical protein [Chitinophagaceae bacterium]        51.3    6e-04
KPN30121.1 hypothetical protein SY89_00843 [Halolamina pelagica]      49.8    6e-04
GBD36071.1 hypothetical protein HRbin36_01191 [bacterium HR36]        51.3    6e-04
WP_006063008.1 hypothetical protein [Corynebacterium durum]EKX913...  50.5    6e-04
MTI08817.1 hypothetical protein [Rhodospirillaceae bacterium RKSG...  51.7    6e-04
HAI09426.1 TPA: hypothetical protein [Dehalococcoidia bacterium]      50.2    6e-04
OJW54001.1 hypothetical protein BGO67_08015 [Alphaproteobacteria ...  50.2    6e-04
MBE9516992.1 hypothetical protein [Bacteroidetes bacterium]           49.4    6e-04
MBD0865688.1 zinc-ribbon domain-containing protein [Rhodobacterac...  49.8    6e-04
KAF8358351.1 hypothetical protein PRIPAC_93346 [Pristionchus paci...  52.1    6e-04
MSW87328.1 hypothetical protein [Actinobacteria bacterium]            50.9    6e-04
NLN62172.1 hypothetical protein [Myxococcales bacterium]              50.5    6e-04
RLB05769.1 hypothetical protein DRG50_06725 [Deltaproteobacteria ...  46.7    6e-04
PYV69652.1 hypothetical protein DMG97_21290 [Acidobacteria bacter...  50.5    6e-04
NQZ85032.1 hypothetical protein [Nanoarchaeales archaeon]             50.9    6e-04
HBY13721.1 TPA: hypothetical protein [Rhodobacteraceae bacterium]     48.2    6e-04
WP_155987575.1 hypothetical protein [Acidobacterium ailaaui]          50.9    6e-04
WP_172819116.1 zinc-ribbon domain-containing protein, partial [Co...  47.8    6e-04
NNE45683.1 hypothetical protein [Rhodothermales bacterium]            50.9    6e-04
QDT58231.1 hypothetical protein SV7mr_07200 [Planctomycetes bacte...  51.7    6e-04
HCS52016.1 TPA: hypothetical protein [Planctomycetaceae bacterium]    50.9    6e-04
RDE16307.1 hypothetical protein C4K48_01870 [Candidatus Thorarcha...  49.0    6e-04
OGQ12530.1 hypothetical protein A2138_07660 [Deltaproteobacteria ...  50.5    6e-04
OLS27509.1 hypothetical protein HeimC3_02870 [Candidatus Heimdall...  51.3    6e-04
WP_088217549.1 zinc-ribbon domain-containing protein [Haematobact...  48.6    7e-04
BAU82378.1 integral membrane protein [Streptomyces laurentii]         48.6    7e-04
TGV64675.1 hypothetical protein EN792_068835, partial [Mesorhizob...  49.8    7e-04
MBA4366017.1 hypothetical protein [Desulfobacterium sp.]              50.9    7e-04
WP_152420686.1 hypothetical protein [Haloferax sulfurifontis]         49.8    7e-04
MBR32394.1 hypothetical protein [Spirochaetaceae bacterium]           50.9    7e-04
MBF1339007.1 DUF975 family protein [Mogibacterium diversum]           50.2    7e-04
RME50712.1 hypothetical protein D6795_09580 [Deltaproteobacteria ...  50.9    7e-04
RLI93329.1 hypothetical protein DRO89_00300 [Candidatus Altiarcha...  51.3    7e-04
ORB58330.1 hypothetical protein BST43_09855 [Mycobacteroides saop...  49.4    7e-04
WP_023451178.1 hypothetical protein [Asticcacaulis sp. AC402]ESQ7...  50.2    7e-04
OLS15241.1 hypothetical protein RBG13Loki_1127 [Candidatus Lokiar...  51.3    7e-04
HCF56487.1 TPA: hypothetical protein [Myxococcales bacterium]         49.0    7e-04
KAD5317672.1 hypothetical protein E3N88_17618 [Mikania micrantha]     51.7    7e-04
NJM62597.1 hypothetical protein [Oscillatoriales cyanobacterium R...  49.0    7e-04
MAG22987.1 hypothetical protein [Rhodospirillaceae bacterium]HAQ3...  49.8    7e-04
MBI3392904.1 hypothetical protein [Nitrospirae bacterium]             50.9    7e-04
NIA23486.1 hypothetical protein [Proteobacteria bacterium]            50.5    7e-04
TMF60332.1 hypothetical protein E6I20_14375 [Chloroflexi bacterium]   48.6    7e-04
WP_181040972.1 glycerophosphoryl diester phosphodiesterase membra...  50.5    7e-04
WP_064441743.1 DUF2510 domain-containing protein [Hoyosella altam...  50.9    7e-04
MBI5448614.1 hypothetical protein [Gammaproteobacteria bacterium]     50.2    7e-04
TES90920.1 hypothetical protein E3J87_08890 [Candidatus Cloacimon...  50.5    7e-04
RMD51407.1 hypothetical protein D6827_02300 [Candidatus Parcubact...  49.8    8e-04
TML15946.1 hypothetical protein E6G39_06320 [Actinobacteria bacte...  50.9    8e-04
MBC7792898.1 zinc-ribbon domain-containing protein [Clostridia ba...  47.8    8e-04
NVM26697.1 zinc-ribbon domain-containing protein [Desulfobacteral...  46.3    8e-04
NNG04887.1 DUF3426 domain-containing protein [Inquilinus sp.]         49.8    8e-04
HIB11376.1 TPA: hypothetical protein [Dehalococcoidia bacterium]      46.7    8e-04
RME62938.1 DUF3426 domain-containing protein [Alphaproteobacteria...  50.5    8e-04
NWF77212.1 hypothetical protein [Chloroflexi bacterium]               50.2    8e-04
MBJ7525797.1 zinc-ribbon domain-containing protein [Sphingomonada...  47.5    8e-04
HBE54169.1 TPA: hypothetical protein [Cyanobacteria bacterium UBA...  50.5    8e-04
MAI61605.1 hypothetical protein [Micavibrio sp.]OUT91955.1 hypoth...  50.9    8e-04
MBD0313472.1 glycerophosphoryl diester phosphodiesterase membrane...  47.5    8e-04
PPR29948.1 hypothetical protein CFH31_00124 [Alphaproteobacteria ...  47.1    8e-04
MBI2615337.1 zinc-ribbon domain-containing protein [Gemmatimonade...  47.8    8e-04
KKW35289.1 hypothetical protein UY81_C0046G0002 [Candidatus Giova...  51.3    9e-04
SFE08120.1 Uncharacterized membrane protein [Peptostreptococcacea...  50.9    9e-04
HDI61330.1 TPA: hypothetical protein [Desulfobacteraceae bacterium]   50.9    9e-04
MBA2765166.1 hypothetical protein [Thermoleophilaceae bacterium]      50.5    9e-04
HDW96168.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  45.9    9e-04
MBI1319779.1 hypothetical protein [Candidatus Hydrogenedens sp.]      50.2    9e-04
MBI1274627.1 hypothetical protein [bacterium]                         50.2    9e-04
MSR02063.1 hypothetical protein [Gemmatimonadetes bacterium]          45.1    9e-04
OYO04906.1 hypothetical protein CGZ95_03285 [Propionibacteriaceae...  50.2    0.001
WP_066805025.1 hypothetical protein [Sphingomonas asaccharolytica]    49.8    0.001
TIU27954.1 hypothetical protein E5W34_02840, partial [Mesorhizobi...  48.6    0.001
HHI88810.1 TPA: thioredoxin [Hellea balneolensis]                     46.3    0.001
MBI3758476.1 hypothetical protein [Deltaproteobacteria bacterium]     50.2    0.001
WP_180272178.1 glycerophosphoryl diester phosphodiesterase membra...  47.5    0.001
NED07194.1 hypothetical protein [Streptomyces sp. SID6648]            45.9    0.001
OFW87939.1 hypothetical protein A3B66_01025 [Alphaproteobacteria ...  49.8    0.001
PLY04071.1 hypothetical protein C0624_06265 [Desulfuromonas sp.]      51.3    0.001
MBI3830845.1 hypothetical protein [Planctomycetes bacterium]          50.2    0.001
WP_191750981.1 hypothetical protein [Clostridium sp. Sa3CUN1]MBD7...  50.2    0.001
XP_008466772.1 PREDICTED: uncharacterized protein LOC103504102 [C...  48.2    0.001
RMI04990.1 hypothetical protein D6681_08750, partial [Calditricha...  49.8    0.001
MXW90477.1 hypothetical protein [Rhodospirillaceae bacterium]MYB1...  48.2    0.001
MBK67525.1 hypothetical protein [Rickettsiales bacterium]             49.8    0.001
RKU30297.1 hypothetical protein C6497_04880 [Candidatus Poribacte...  50.5    0.001
HIJ38483.1 TPA: hypothetical protein [Rhodospirillaceae bacterium]    49.4    0.001
WP_191518733.1 DUF4013 domain-containing protein [Candidatus Adam...  49.8    0.001
WP_183640183.1 hypothetical protein [Neomicrococcus lactis]MBB559...  50.5    0.001
MSR54880.1 hypothetical protein [Gemmataceae bacterium]               50.5    0.001
HCI61646.1 TPA: thioredoxin [Erythrobacter sp.]                       49.0    0.001
MBI4065111.1 hypothetical protein [Candidatus Gottesmanbacteria b...  49.4    0.001
OQW46457.1 hypothetical protein A4S16_10670 [Proteobacteria bacte...  50.2    0.001
KEO86939.1 hypothetical protein EH30_04705 [Erythrobacter sp. JL475]  49.0    0.001
WP_081969459.1 zinc-ribbon domain-containing protein [Paracoccus ...  50.5    0.001
WP_099594523.1 zinc-ribbon domain-containing protein [Amylibacter...  50.9    0.001
SBW01284.1 membrane hypothetical protein [uncultured delta proteo...  50.2    0.001
HGL81338.1 TPA: hypothetical protein [Deltaproteobacteria bacteri...  47.1    0.001
WP_038471065.1 hypothetical protein [Candidatus Izimaplasma sp. H...  49.8    0.001
EGS5729550.1 DUF975 family protein [Clostridium perfringens]          47.8    0.001
HAQ34615.1 TPA: hypothetical protein [Alphaproteobacteria bacterium]  49.0    0.001
HGS39381.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  50.5    0.001
HBM67623.1 TPA: hypothetical protein [Rhodobacteraceae bacterium]     48.2    0.001
VWX52028.1 conserved membrane hypothetical protein [Novosphingobi...  49.0    0.001
RPH65632.1 tetratricopeptide repeat protein, partial [Myxococcace...  50.9    0.001
WP_172814629.1 zinc-ribbon domain-containing protein, partial [Co...  49.0    0.001
NMM43559.1 hypothetical protein [Rhodospirillaceae bacterium KN72]    50.2    0.001
MBF0470824.1 hypothetical protein [Gammaproteobacteria bacterium]     50.5    0.001
MBN35005.1 hypothetical protein [Rhodospirillaceae bacterium]         50.2    0.001
PKK92391.1 hypothetical protein CVV62_00570 [Tenericutes bacteriu...  48.6    0.001
WP_073630711.1 hypothetical protein [Scytonema sp. HK-05]OKH58728...  49.4    0.001
WP_176874803.1 hypothetical protein [Parasphingopyxis sp. CP4]QLC...  49.4    0.001
WP_155875301.1 zinc-ribbon domain-containing protein [Desulfuromo...  50.5    0.001
WP_102908277.1 hypothetical protein [Streptomyces sp. 13K301]PNG2...  50.2    0.001
MBI5639594.1 zinc-ribbon domain-containing protein [Nitrospirae b...  49.8    0.001
WP_171778011.1 DUF975 family protein [Bacillus megaterium]QJX8002...  49.8    0.001
NLE48147.1 tetratricopeptide repeat protein [Sandaracinaceae bact...  50.9    0.001
MBF0613188.1 zinc-ribbon domain-containing protein [Magnetococcal...  49.8    0.001
NLE42758.1 hypothetical protein [Lentisphaerae bacterium]             49.4    0.001
NIQ37400.1 DUF3426 domain-containing protein [Proteobacteria bact...  50.2    0.001
WP_182297868.1 glycerophosphoryl diester phosphodiesterase membra...  49.8    0.001
OWJ84013.1 hypothetical protein CDV51_14660 [Haematobacter massil...  50.9    0.001
EAH0363985.1 DUF975 family protein [Listeria monocytogenes]           49.0    0.001
WP_083768574.1 zinc-ribbon domain-containing protein [Geobacter l...  45.1    0.001
MBI3506340.1 zinc-ribbon domain-containing protein [Proteobacteri...  49.8    0.001
KPL81400.1 hypothetical protein SE18_22410 [Herpetosiphon geyseri...  49.8    0.001
WP_130630585.1 hypothetical protein [Janibacter limosus]QBF47395....  47.5    0.001
HGA37708.1 TPA: hypothetical protein [Candidatus Aenigmarchaeota ...  49.4    0.001
WP_199539243.1 hypothetical protein [Desertihabitans brevis]          50.5    0.001
MBO85307.1 hypothetical protein [Deltaproteobacteria bacterium]HC...  49.4    0.001
WP_156260253.1 zinc-ribbon domain-containing protein [Oceanobacil...  49.8    0.001
MAJ22042.1 hypothetical protein [Marinovum sp.]OUU07406.1 hypothe...  49.0    0.001
HGQ55783.1 TPA: hypothetical protein [Candidatus Aenigmarchaeota ...  49.0    0.001
PZP85762.1 hypothetical protein DI582_04970 [Azospirillum brasile...  49.0    0.001
OYV43294.1 hypothetical protein B7Z75_08885 [Acidocella sp. 20-57...  49.4    0.002
MRG70484.1 hypothetical protein [Alphaproteobacteria bacterium HT...  49.8    0.002
AWX92788.1 hypothetical protein DPM13_05265 [Paracoccus mutanolyt...  49.8    0.002
TMA38529.1 hypothetical protein E6J82_17630 [Deltaproteobacteria ...  48.6    0.002
RME48588.1 hypothetical protein D6795_12775 [Deltaproteobacteria ...  50.2    0.002
MBL91501.1 hypothetical protein [Myxococcales bacterium]              50.5    0.002
HBO69709.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  48.2    0.002
HAS10806.1 TPA: hypothetical protein [Acidimicrobiaceae bacterium]    49.8    0.002
MBC8206348.1 hypothetical protein [Kiritimatiellaeota bacterium]      49.4    0.002
WP_198926574.1 efflux RND transporter permease subunit, partial [...  50.5    0.002
WP_117591346.1 hypothetical protein [Haloprofundus halophilus]        49.4    0.002
RYZ02686.1 hypothetical protein EOO73_30980, partial [Myxococcale...  44.8    0.002
TMK30199.1 hypothetical protein E6G69_10350, partial [Alphaproteo...  45.5    0.002
WP_171183858.1 zinc ribbon domain-containing protein [Alienimonas...  50.2    0.002
PKL14862.1 hypothetical protein CVV50_02165 [Spirochaetae bacteri...  49.0    0.002
NJK90111.1 hypothetical protein [Myxococcales bacterium]              47.5    0.002
NBR40721.1 hypothetical protein [Alphaproteobacteria bacterium]       47.5    0.002
WP_172472239.1 DUF975 family protein [[Clostridium] cocleatum]GFI...  49.4    0.002
MBG77668.1 hypothetical protein [Alphaproteobacteria bacterium]HC...  49.8    0.002
PWU13708.1 hypothetical protein C5B50_18770 [Verrucomicrobia bact...  50.2    0.002
TNN37383.1 hypothetical protein EYF80_052451 [Liparis tanakae]        49.0    0.002
WP_083225497.1 zinc-ribbon domain-containing protein [Neptunicocc...  50.2    0.002
NCO61614.1 hypothetical protein [bacterium]OIP37934.1 hypothetica...  49.4    0.002
NLB53179.1 hypothetical protein [Syntrophomonadaceae bacterium]       49.0    0.002
HBD83533.1 TPA: hypothetical protein [Dehalococcoidia bacterium]      47.8    0.002
WP_129204334.1 DUF4129 domain-containing protein [Xylanimonas all...  50.2    0.002
WP_027175982.1 zinc-ribbon domain-containing protein [Desulfovibr...  49.4    0.002
HHB90317.1 TPA: hypothetical protein [Anaerolineae bacterium]         49.8    0.002
WP_162658013.1 hypothetical protein [Tuwongella immobilis]VIP0288...  49.8    0.002
NJO84766.1 hypothetical protein [Blastochloris sp.]                   49.4    0.002
MYE60404.1 hypothetical protein [Alphaproteobacteria bacterium]       48.6    0.002
MBA3502893.1 hypothetical protein [Deltaproteobacteria bacterium]     49.4    0.002
HBW19175.1 TPA: hypothetical protein [Actinobacteria bacterium]       49.0    0.002
MAJ63489.1 hypothetical protein [Alphaproteobacteria bacterium]MA...  49.4    0.002
RYD76756.1 hypothetical protein EOP53_14130 [Sphingobacteriales b...  49.4    0.002
HAO79507.1 TPA: hypothetical protein [Verrucomicrobia subdivision...  49.4    0.002
MBI1322290.1 hypothetical protein [bacterium]                         49.4    0.002
HHN39454.1 TPA: AgmX/PglI C-terminal domain-containing protein [D...  50.2    0.002
WP_074826113.1 zinc-ribbon domain-containing protein [Paracoccus ...  49.4    0.002
OQA96685.1 hypothetical protein BWY22_01645 [Bacteroidetes bacter...  49.0    0.002
KAB2879169.1 hypothetical protein F9K33_10095 [bacterium]             49.0    0.002
OGV63804.1 hypothetical protein A2498_10680 [Lentisphaerae bacter...  48.6    0.002
NIV28282.1 hypothetical protein [Anaerolineae bacterium]              47.5    0.002
TGN46554.1 hypothetical protein E4L95_19345, partial [Paracoccus ...  44.8    0.002
PYP10035.1 hypothetical protein DMD59_06995 [Gemmatimonadetes bac...  49.8    0.002
MBI5228627.1 hypothetical protein [Candidatus Micrarchaeota archa...  49.4    0.002
NNF76756.1 hypothetical protein [Rhizobiales bacterium]               49.4    0.002
QLH43106.1 hypothetical protein HWD59_10515 [Coxiellaceae bacterium]  47.8    0.002
KKT24208.1 hypothetical protein UW09_C0001G0271 [candidate divisi...  49.0    0.002
CCZ83064.1 putative uncharacterized protein [Ruminococcus sp. CAG...  49.0    0.002
MBE6365153.1 hypothetical protein [Lentisphaerae bacterium]           49.4    0.002
TMQ50188.1 hypothetical protein E6K71_03085, partial [Candidatus ...  46.7    0.002
MBB71708.1 hypothetical protein [Legionellales bacterium]             49.0    0.002
MBI83248.1 hypothetical protein [Planctomycetaceae bacterium]MBP6...  49.4    0.002
RMC22904.1 hypothetical protein DUI87_00090 [Hirundo rustica rust...  50.2    0.002
MAQ70683.1 hypothetical protein [Alphaproteobacteria bacterium]       49.0    0.002
KIP51965.1 hypothetical protein SD72_12085 [Leucobacter komagatae]    49.4    0.002
WP_004059610.1 hypothetical protein [Haloferax mediterranei]AFK19...  49.0    0.002
MBF1069313.1 hypothetical protein [Prevotellaceae bacterium]          48.6    0.002
PKL41312.1 hypothetical protein CVV44_01370 [Spirochaetae bacteri...  49.8    0.002
KPJ81436.1 hypothetical protein AMJ58_05070 [Gammaproteobacteria ...  49.0    0.002
KAA8535468.1 hypothetical protein F0562_030471 [Nyssa sinensis]       49.0    0.002
WP_011240330.1 zinc-ribbon domain-containing protein [Zymomonas m...  49.4    0.002
HCP47820.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  44.4    0.002
MBC7792681.1 hypothetical protein [Clostridia bacterium]              47.8    0.002
WP_016670306.1 hypothetical protein [Propionibacterium sp. oral t...  49.4    0.002
HHU43896.1 TPA: DUF975 family protein [Clostridiales bacterium]       46.3    0.002
NQW61576.1 zinc-ribbon domain-containing protein [Deltaproteobact...  49.8    0.002
WP_088259214.1 hypothetical protein [Fimbriiglobus ruber]OWK36161...  48.6    0.002
MAU40852.1 hypothetical protein [Kordiimonas sp.]                     49.0    0.002
NOZ85968.1 DUF3426 domain-containing protein [Deltaproteobacteria...  49.8    0.002
HHS12604.1 TPA: hypothetical protein [bacterium]                      46.3    0.002
NUQ65953.1 hypothetical protein [Pirellulales bacterium]              45.5    0.002
NOQ52509.1 hypothetical protein [Desulfuromonadaceae bacterium]       48.2    0.002
OGY84421.1 hypothetical protein A3F54_00590 [Candidatus Kerfeldba...  48.6    0.002
WP_108261701.1 zinc-ribbon domain-containing protein [Mangrovicoc...  47.5    0.002
MBI3183634.1 zinc-ribbon domain-containing protein [Myxococcales ...  49.8    0.002
BAL53766.1 hypothetical conserved protein [uncultured Acetothermi...  48.2    0.002
TXD32378.1 hypothetical protein FRC96_17635, partial [Bradymonada...  48.2    0.002
WP_116716332.1 hypothetical protein [Euzebya tangerina]               49.0    0.002
NLN63514.1 hypothetical protein [Myxococcales bacterium]              49.4    0.002
WP_020949270.1 zinc-ribbon domain-containing protein [Paracoccus ...  49.4    0.002
HBY73909.1 TPA: hypothetical protein [Candidatus Kerfeldbacteria ...  48.2    0.002
ESW95477.1 hypothetical protein X769_28690 [Mesorhizobium sp. LSJ...  47.1    0.002
WP_191978909.1 hypothetical protein [Lactobacillus fructivorans]      48.2    0.003
MBE7075657.1 hypothetical protein [Clostridiales bacterium]           49.0    0.003
WP_124329045.1 zinc-ribbon domain-containing protein [Desulfonema...  49.4    0.003
NIQ94740.1 hypothetical protein [Desulfuromonadales bacterium]        44.0    0.003
WP_188660446.1 zinc-ribbon domain-containing protein [Terasakiell...  49.0    0.003
HIF44654.1 TPA: hypothetical protein [Dehalococcoidia bacterium]      49.4    0.003
RWX46182.1 MJ0042 family finger-like domain-containing protein [C...  49.4    0.003
WP_067487144.1 hypothetical protein [Actinomadura hibisca]            49.4    0.003
OQA80769.1 hypothetical protein BWY31_03911 [Lentisphaerae bacter...  49.8    0.003
EKD32722.1 hypothetical protein ACD_76C00161G0021 [uncultured bac...  48.2    0.003
WP_158820944.1 MULTISPECIES: hypothetical protein [unclassified S...  49.4    0.003
MBM85706.1 hypothetical protein [Rhodospirillaceae bacterium]         49.0    0.003
TMA21638.1 hypothetical protein E6J85_07075 [Deltaproteobacteria ...  48.6    0.003
AHE52920.1 hypothetical protein NX02_05925 [Sphingomonas sanxanig...  48.6    0.003
TMQ73111.1 hypothetical protein E6K81_05550 [Candidatus Eisenbact...  47.5    0.003
PIE89581.1 hypothetical protein CR997_10220 [Acidobacteria bacter...  49.0    0.003
MBI2410695.1 hypothetical protein [Candidatus Kerfeldbacteria bac...  49.0    0.003
WP_097279092.1 zinc-ribbon domain-containing protein [Caenispiril...  49.0    0.003
MBI3755643.1 zinc-ribbon domain-containing protein [Deltaproteoba...  46.3    0.003
NJL62767.1 hypothetical protein [Methylacidiphilales bacterium]NJ...  47.5    0.003
WP_165167952.1 hypothetical protein [Nordella sp. HKS 07]QIG48598...  49.0    0.003
WP_162938119.1 hypothetical protein [Kiloniella sp. EL199]            48.6    0.003
TAK29963.1 tetratricopeptide repeat protein [Myxococcaceae bacter...  49.8    0.003
MBI2931399.1 zinc-ribbon domain-containing protein [Planctomycete...  44.8    0.003
HAJ85267.1 TPA: hypothetical protein [Rhodobacteraceae bacterium]     45.9    0.003
GFS46009.1 hypothetical protein Acr_00g0099540 [Actinidia rufa]       47.8    0.003
MBI3818273.1 hypothetical protein [Planctomycetes bacterium]          48.6    0.003
MBI4836386.1 hypothetical protein [Candidatus Abawacabacteria bac...  48.6    0.003
KXI13466.1 cation diffusion facilitator family transporter [Pepto...  49.0    0.003
NUS34197.1 hypothetical protein [Gemmatimonadaceae bacterium]         44.4    0.003
WP_146842793.1 glycerophosphoryl diester phosphodiesterase membra...  49.4    0.003
XP_026450656.1 uncharacterized protein LOC113350748 [Papaver somn...  48.2    0.003
WP_191390328.1 DUF975 family protein [Candidatus Alangreenwoodia ...  49.0    0.003
OQX56310.1 hypothetical protein B5M53_02150 [Candidatus Cloacimon...  48.6    0.003
TMB02236.1 hypothetical protein E6J64_17535 [Deltaproteobacteria ...  49.4    0.003
KAF8675194.1 hypothetical protein HU200_047860 [Digitaria exilis]     49.8    0.003
MBF0622691.1 zinc-ribbon domain-containing protein [Magnetococcal...  49.0    0.003
WP_175478831.1 zinc-ribbon domain-containing protein [Rubrimonas ...  47.8    0.003
WP_146675064.1 hypothetical protein [Pirellula sp. SH-Sr6A]AMV306...  49.4    0.003
MBF0530743.1 zinc-ribbon domain-containing protein [Deltaproteoba...  45.5    0.003
NMO23222.1 hypothetical protein [Pyxidicoccus fallax]                 47.5    0.003
QNJ06529.1 putative membrane protein [Synechococcus sp. MEDNS5]       45.5    0.003
KIF00650.1 hypothetical protein PL81_40040 [Streptomyces sp. RSD-27]  45.5    0.003
NCU25591.1 hypothetical protein [Candidatus Nomurabacteria bacter...  48.6    0.003
MBI2002721.1 zinc-ribbon domain-containing protein [candidate div...  44.0    0.003
MBI4854132.1 zinc-ribbon domain-containing protein [Acidobacteria...  49.8    0.003
WP_199260036.1 zinc-ribbon domain-containing protein, partial [Pa...  48.2    0.003
WP_124057337.1 DUF975 family protein [Vaginisenegalia massiliensis]   48.6    0.003
MBA3550358.1 hypothetical protein [Nannocystis sp.]                   48.2    0.003
WP_057624075.1 glycerophosphoryl diester phosphodiesterase membra...  48.6    0.003
WP_026699054.1 hypothetical protein [Bacillus chagannorensis]         48.6    0.003
MBC6444239.1 zinc-ribbon domain-containing protein [Alphaproteoba...  47.8    0.003
NIT04273.1 thioredoxin [Candidatus Saccharibacteria bacterium]        44.4    0.003
PPD77655.1 hypothetical protein GOBAR_DD25419 [Gossypium barbadense]  49.4    0.003
TMA25139.1 hypothetical protein E6J78_18525 [Deltaproteobacteria ...  47.8    0.003
MBI4173361.1 hypothetical protein [Candidatus Aenigmarchaeota arc...  47.8    0.003
KKP91379.1 hypothetical protein UR97_C0002G0045 [Candidatus Nomur...  48.2    0.003
PPR28260.1 hypothetical protein CFH38_00110 [Alphaproteobacteria ...  45.9    0.003
HEA94367.1 TPA: hypothetical protein [Chloroflexi bacterium]HEI10...  48.6    0.003
MBI4411734.1 hypothetical protein [Deltaproteobacteria bacterium]     48.2    0.003
WP_146678041.1 zinc ribbon domain-containing protein [Pirellula s...  48.6    0.003
WP_081994941.1 zinc-ribbon domain-containing protein [Paracoccus ...  47.5    0.003
WP_175481698.1 zinc-ribbon domain-containing protein [Maribius pe...  49.0    0.003
MAX17452.1 hypothetical protein [Nitrospina sp.]                      44.8    0.004
MBI5583343.1 ankyrin repeat domain-containing protein [Deltaprote...  49.0    0.004
NNE56980.1 DUF3426 domain-containing protein [Hellea sp.]             48.2    0.004
KPJ79127.1 hypothetical protein AMJ54_00235 [Deltaproteobacteria ...  49.0    0.004
NLY88175.1 hypothetical protein [Firmicutes bacterium]                48.6    0.004
WP_020918113.1 zinc-ribbon domain-containing protein [Cystobacter...  49.4    0.004
WP_183377250.1 hypothetical protein [Helcobacillus massiliensis]M...  48.6    0.004
EUA73961.1 hypothetical protein I540_0458 [Mycobacteroides absces...  44.8    0.004
HCN04818.1 TPA: hypothetical protein [Bacteroidetes bacterium]        48.2    0.004
WP_009108026.1 zinc-ribbon domain-containing protein [Desulfovibr...  48.6    0.004
MBI4666505.1 zinc-ribbon domain-containing protein [Nitrospinae b...  48.2    0.004
TMA29198.1 hypothetical protein E6J88_07330, partial [Deltaproteo...  48.6    0.004
XP_037917966.1 glutenin, high molecular weight subunit DX5-like [...  49.4    0.004
MBI3014852.1 zinc-ribbon domain-containing protein [Candidatus Te...  48.6    0.004
MBI1948748.1 zinc-ribbon domain-containing protein [Deltaproteoba...  49.4    0.004
MBF0371418.1 zinc-ribbon domain-containing protein [Magnetococcal...  44.4    0.004
WP_072832764.1 hypothetical protein [Clostridium collagenovorans]...  48.2    0.004
WP_101357826.1 hypothetical protein [Raineya orbicola]PKQ70336.1 ...  48.6    0.004
MBI4821319.1 zinc-ribbon domain-containing protein [Deltaproteoba...  49.4    0.004
WP_014812726.1 hypothetical protein [Desulfomonile tiedjei]AFM276...  49.0    0.004
WP_197444114.1 RDD family protein [Maioricimonas rarisocia]QDU368...  49.0    0.004
MAH84786.1 hypothetical protein [Magnetovibrio sp.]OUT49876.1 hyp...  48.6    0.004
GFM37166.1 hypothetical protein DSM19430T_18500 [Desulfovibrio ps...  48.6    0.004
WP_078788646.1 zinc-ribbon domain-containing protein [Geobacter t...  48.2    0.004
WP_108102137.1 zinc-ribbon domain-containing protein [Geobacter s...  48.6    0.004
MBA2542066.1 hypothetical protein [Deltaproteobacteria bacterium]     48.2    0.004
WP_176424889.1 zinc-ribbon domain-containing protein, partial [My...  48.6    0.004
OIP57164.1 signal peptidase I [Candidatus Levybacteria bacterium ...  49.0    0.004
MBI4745440.1 zinc-ribbon domain-containing protein [Deltaproteoba...  44.0    0.004
WP_002619262.1 zinc-ribbon domain-containing protein, partial [St...  45.1    0.004
WP_084454530.1 hypothetical protein [Mycobacterium interjectum]       47.8    0.004
WP_102237581.1 glycerophosphoryl diester phosphodiesterase membra...  48.2    0.004
MYK17757.1 hypothetical protein [Candidatus Poribacteria bacterium]   46.7    0.004
PZQ46111.1 hypothetical protein DI551_05745 [Micavibrio aeruginos...  48.2    0.004
OYT56359.1 hypothetical protein B6U68_03535 [Candidatus Aenigmarc...  45.5    0.004
EFE80924.1 integral membrane protein [Streptomyces albidoflavus]      47.5    0.004
OQB43884.1 hypothetical protein BWY03_00525 [Parcubacteria group ...  48.2    0.004
MBF0127708.1 zinc-ribbon domain-containing protein [Magnetococcal...  43.6    0.004
MBI4391176.1 zinc-ribbon domain-containing protein [candidate div...  47.5    0.004
MBC3843528.1 hypothetical protein [Streptacidiphilus sp. 4-A2]        48.2    0.004
MBC7564201.1 hypothetical protein [Gemmatimonadaceae bacterium]       48.2    0.004
HBZ69447.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  44.4    0.005
RMG16541.1 hypothetical protein D6729_10685, partial [Deltaproteo...  44.4    0.005
WP_066022326.1 MULTISPECIES: DUF975 family protein [Clostridium]P...  47.8    0.005
MBA3430563.1 hypothetical protein [Actinobacteria bacterium]          47.8    0.005
HHJ03931.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  49.0    0.005
MBM04113.1 hypothetical protein [Chloroflexi bacterium]               47.8    0.005
MBE5763511.1 hypothetical protein [Clostridiales bacterium]           48.2    0.005
MBI2798498.1 hypothetical protein [Candidatus Saccharibacteria ba...  47.8    0.005
MBE7415897.1 zinc-ribbon domain-containing protein [Deltaproteoba...  48.2    0.005
PIN92876.1 hypothetical protein COU54_05125 [Candidatus Pacearcha...  48.2    0.005
MXZ63771.1 hypothetical protein [Chloroflexi bacterium]               46.3    0.005
MBE7453632.1 zinc-ribbon domain-containing protein [Kofleriaceae ...  45.9    0.005
WP_181754756.1 hypothetical protein [Paenactinomyces guangxiensis...  48.2    0.005
OEU53617.1 hypothetical protein BA868_07285 [Desulfobacterales ba...  48.6    0.005
MBE7465897.1 hypothetical protein [Planctomycetes bacterium]          48.6    0.005
NIQ97137.1 hypothetical protein [Desulfuromonadales bacterium]NIS...  44.0    0.005
XP_012858037.1 PREDICTED: uncharacterized protein LOC105977276 [E...  49.0    0.005
NIS18204.1 hypothetical protein [candidate division Zixibacteria ...  43.2    0.005
SCJ14231.1 Protein of uncharacterised function (DUF975) [uncultur...  48.6    0.005
PZN29980.1 hypothetical protein DIU80_08415, partial [Chloroflexi...  47.8    0.005
WP_146007727.1 hypothetical protein [Brachybacterium sp. UMB0905]     47.8    0.005
RZC64493.1 hypothetical protein C5167_008185 [Papaver somniferum]     48.6    0.005
GEY15929.1 Ty3/gypsy retrotransposon protein [Tanacetum cinerarii...  49.0    0.005
OPX36131.1 hypothetical protein B1H12_07605 [Desulfobacteraceae b...  49.0    0.005
HFK85759.1 TPA: hypothetical protein [Chloroflexi bacterium]          48.2    0.005
WP_025022160.1 glycerophosphoryl diester phosphodiesterase membra...  48.2    0.005
KRT62392.1 Uncharacterized protein XU10_C0029G0008 [Chloroflexi b...  47.8    0.005
MBI9020351.1 glycerophosphoryl diester phosphodiesterase membrane...  47.5    0.005
HDY19326.1 TPA: hypothetical protein [Gemmata sp.]HEJ50530.1 hypo...  47.5    0.005
HCC98734.1 TPA: hypothetical protein [Rhodobacteraceae bacterium]     47.8    0.005
BAS27211.1 hypothetical protein LIP_1360 [Limnochorda pilosa]         47.8    0.005
ALS89603.1 transposase zinc-ribbon domain protein, partial [uncul...  45.9    0.005
OGG19237.1 hypothetical protein A2721_00160 [Candidatus Gottesman...  47.5    0.005
MBD3679054.1 zinc-ribbon domain-containing protein [Rhodobacterac...  48.6    0.005
HAB91770.1 TPA: hypothetical protein [Pseudomonas sp.]                47.1    0.005
NOY64066.1 hypothetical protein [Nitrospirae bacterium]               44.0    0.005
WP_125933312.1 glycerophosphoryl diester phosphodiesterase membra...  45.9    0.005
MBE5763510.1 hypothetical protein [Clostridiales bacterium]           47.8    0.005
NCO17101.1 hypothetical protein [Alphaproteobacteria bacterium]       42.8    0.005
RKH55340.1 hypothetical protein D7W81_36500, partial [Corallococc...  48.2    0.005
MBD5085539.1 DUF975 family protein [Clostridiales bacterium]          47.8    0.005
MBC7978701.1 zinc-ribbon domain-containing protein [Myxococcales ...  47.1    0.006
MAR57047.1 hypothetical protein [Rickettsiales bacterium]             47.8    0.006
PHS29425.1 hypothetical protein COA85_00785 [Robiginitomaculum sp.]   47.8    0.006
TSC72604.1 hypothetical protein G01um101438_321 [Parcubacteria gr...  47.5    0.006
WP_115603384.1 zinc-ribbon domain-containing protein [Lujinxingia...  48.2    0.006
NNG01870.1 hypothetical protein [Desulfobacteraceae bacterium]        44.8    0.006
WP_068084195.1 zinc-ribbon domain-containing protein [Pseudovibri...  47.8    0.006
TDX43691.1 hypothetical protein C7959_1617, partial [Orenia maris...  47.8    0.006
QDT38887.1 hypothetical protein Pan189_32860 [Planctomycetes bact...  48.2    0.006
HEW91442.1 TPA: hypothetical protein [Thermotogaceae bacterium]       46.7    0.006
WP_091571326.1 hypothetical protein [Melghirimyces thermohalophil...  46.7    0.006
HCN23773.1 TPA: hypothetical protein [Candidatus Marinimicrobia b...  45.9    0.006
MBD3675420.1 hypothetical protein [Planctomycetaceae bacterium]       48.2    0.006
PIR22608.1 hypothetical protein COV44_07220 [Deltaproteobacteria ...  48.6    0.006
MAZ95616.1 hypothetical protein [Planctomycetaceae bacterium]         48.6    0.006
TMA22728.1 hypothetical protein E6J85_03870 [Deltaproteobacteria ...  48.2    0.006
OJV14073.1 hypothetical protein BGO27_01125 [Alphaproteobacteria ...  47.1    0.006
HBQ14507.1 TPA: hypothetical protein [Myxococcales bacterium]         48.2    0.006
NIS62866.1 hypothetical protein [Proteobacteria bacterium]            42.8    0.006
KRK27772.1 glycerophosphodiester phosphodiesterase [Lactobacillus...  48.2    0.006
WP_013628776.1 RDD family protein [Rubinisphaera brasiliensis]ADY...  48.6    0.006
GEU36801.1 activating signal cointegrator 1 complex subunit like ...  48.6    0.006
NOZ34417.1 hypothetical protein [Chlorobi bacterium]                  46.7    0.006
NNL65406.1 histidine kinase [Myxococcales bacterium]                  42.8    0.006
NLZ01476.1 hypothetical protein [Pirellulaceae bacterium]             48.6    0.006
HAD27950.1 TPA: hypothetical protein [Rhodobacteraceae bacterium]     46.3    0.006
HER25631.1 TPA: hypothetical protein [Rhodospirillales bacterium]     47.5    0.007
MBC6497248.1 zinc-ribbon domain-containing protein [Alphaproteoba...  45.9    0.007
MBI5286697.1 zinc-ribbon domain-containing protein [Deltaproteoba...  45.9    0.007
MSR80878.1 hypothetical protein [Gemmataceae bacterium]               47.8    0.007
WP_078790124.1 zinc-ribbon domain-containing protein [Geobacter t...  48.6    0.007
WP_028640253.1 hypothetical protein [Novosphingobium acidiphilum]     47.1    0.007
MBC7329443.1 hypothetical protein [bacterium]                         45.9    0.007
HEC14613.1 TPA: hypothetical protein [Rhodospirillales bacterium]     44.0    0.007
PIE16743.1 hypothetical protein CSA66_07135 [Proteobacteria bacte...  46.3    0.007
PYX29213.1 hypothetical protein DMG77_13235 [Acidobacteria bacter...  47.8    0.007
WP_181402181.1 zinc-ribbon domain-containing protein, partial [No...  46.3    0.007
MBF0496839.1 zinc-ribbon domain-containing protein [Deltaproteoba...  42.4    0.007
WP_191401559.1 DUF975 family protein [Candidatus Gallispira edinb...  47.5    0.007
NCX85299.1 hypothetical protein [Rhodobacteraceae bacterium]          46.7    0.007
HHG89683.1 TPA: hypothetical protein [Devosia sp.]                    47.5    0.007
PIR32800.1 hypothetical protein COV36_03655 [Alphaproteobacteria ...  47.1    0.007
PIY08491.1 hypothetical protein COZ18_12015 [Flexibacter sp. CG_4...  47.5    0.007
NVL90362.1 zinc-ribbon domain-containing protein [Desulfobacteral...  45.5    0.007
REJ62356.1 hypothetical protein DWQ28_11840, partial [Proteobacte...  44.8    0.007
OYT41231.1 hypothetical protein B6U86_02785 [Candidatus Altiarcha...  47.8    0.007
KAF7359109.1 hypothetical protein [Mycena sanguinolenta]              48.2    0.007
WP_020875731.1 zinc-ribbon domain-containing protein [Desulfococc...  47.1    0.007
NBQ17330.1 hypothetical protein [bacterium]                           47.5    0.007
HIB69216.1 TPA: hypothetical protein [Phycisphaerales bacterium]      47.1    0.007
WP_041977924.1 zinc-ribbon domain-containing protein [Pyrinomonas...  47.1    0.007
HGS20096.1 TPA: tetratricopeptide repeat protein [Deltaproteobact...  48.6    0.007
WP_144995878.1 hypothetical protein [Polystyrenella longa]QDU8061...  47.8    0.007
RNC80015.1 hypothetical protein ED557_12860 [Balneola sp.]            47.1    0.007
OGQ79628.1 hypothetical protein A2289_10340 [Deltaproteobacteria ...  48.2    0.007
OGL89228.1 hypothetical protein A3I45_01350 [Candidatus Uhrbacter...  47.8    0.007
KAF0092883.1 hypothetical protein FD128_2729, partial [Hyphomonad...  47.1    0.007
OIP39291.1 hypothetical protein AUK47_10290 [Deltaproteobacteria ...  48.2    0.007
WP_199388304.1 zinc-ribbon domain-containing protein [Geomonas sp...  48.2    0.007
WP_027749292.1 hypothetical protein [Streptomyces sp. CNH287]         47.8    0.007
HIC85926.1 TPA: hypothetical protein [Desulfobacterales bacterium]    46.7    0.007
NJL60346.1 hypothetical protein [Desulfobacteraceae bacterium]        43.6    0.007
NQV33576.1 zinc-ribbon domain-containing protein [Phycisphaeracea...  48.2    0.007
HIJ64731.1 TPA: hypothetical protein [Candidatus Hydrogenedentes ...  47.8    0.008
MBI2796380.1 zinc-ribbon domain-containing protein [Gemmatimonade...  48.2    0.008
WP_097115055.1 zinc-ribbon domain-containing protein [Alysiella f...  47.1    0.008
RZO67559.1 hypothetical protein EVA70_04065 [Parvularculaceae bac...  47.5    0.008
HCK07909.1 TPA: hypothetical protein [Rhodobacter sp.]                43.6    0.008
WP_139209220.1 hypothetical protein [Aquisalimonas asiatica]          46.7    0.008
NLW83980.1 hypothetical protein [Phycisphaerae bacterium]             43.6    0.008
KAF9624612.1 hypothetical protein IFM89_012034, partial [Coptis c...  47.8    0.008
TVQ85144.1 hypothetical protein EA357_01125 [Micavibrio sp.]          47.1    0.008
TMQ67476.1 hypothetical protein E6K78_04575, partial [Candidatus ...  44.4    0.008
NJC89222.1 hypothetical protein [Desulfuromonas sp.]                  47.8    0.008
RME02878.1 hypothetical protein D6805_08625 [Planctomycetes bacte...  47.8    0.008
HAU30617.1 TPA: hypothetical protein [Candidatus Dependentiae bac...  46.7    0.008
NQV35227.1 hypothetical protein [Phycisphaeraceae bacterium]          46.3    0.008
WP_172677062.1 zinc-ribbon domain-containing protein, partial [De...  43.2    0.008
NLT60994.1 hypothetical protein [Candidatus Hydrogenedentes bacte...  47.8    0.008
PKN92012.1 hypothetical protein CVU44_17110 [Chloroflexi bacteriu...  47.1    0.008
WP_198373755.1 zinc-ribbon domain-containing protein, partial [Ro...  43.2    0.009
RME62854.1 hypothetical protein D6778_10480 [Nitrospirae bacterium]   47.5    0.009
MBA3501508.1 zinc-ribbon domain-containing protein [Deltaproteoba...  47.5    0.009
NLJ30869.1 DUF2628 domain-containing protein [Clostridiales bacte...  47.5    0.009
WP_025291210.1 hypothetical protein [Sphingomonas sanxanigenens]A...  46.7    0.009
RKU28095.1 hypothetical protein C6499_10485 [Candidatus Poribacte...  48.2    0.009
QLH40229.1 zinc-ribbon domain-containing protein [Defluviicoccus ...  45.9    0.009
NDU92133.1 hypothetical protein [Ferrovum sp.]                        47.5    0.009
HBV58079.1 TPA: hypothetical protein [Candidatus Magasanikbacteri...  47.1    0.009
RMG14255.1 gliding motility protein, partial [Deltaproteobacteria...  44.0    0.009
SCI66204.1 Protein of uncharacterised function (DUF975) [uncultur...  46.7    0.009
HHO52398.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  46.7    0.009
NBX74576.1 hypothetical protein [Alphaproteobacteria bacterium]       47.1    0.009
QFR39297.1 hypothetical protein A9Q91_03610 [Candidatus Graciliba...  47.5    0.009
MBA4076763.1 hypothetical protein [Cyanobacteria bacterium PR.023]    47.1    0.009
MBE6990307.1 DUF975 family protein [Ruminococcaceae bacterium]        46.7    0.009
MAS87997.1 hypothetical protein [Micavibrio sp.]                      46.7    0.009
RZC46976.1 hypothetical protein C5167_039954 [Papaver somniferum]     47.8    0.010
NBO64891.1 hypothetical protein [Acidobacteria bacterium]             47.1    0.010
MBF0475646.1 zinc-ribbon domain-containing protein [Deltaproteoba...  47.5    0.010
PIR39501.1 hypothetical protein COV35_03025 [Alphaproteobacteria ...  46.3    0.010
NOS67550.1 hypothetical protein [Candidatus Peribacteraceae bacte...  47.1    0.010
TVQ02204.1 hypothetical protein EA381_03675 [Planctomycetaceae ba...  47.8    0.010
MSP71201.1 tetratricopeptide repeat protein [Myxococcales bacterium]  48.2    0.010
KKJ71346.1 glycerophosphodiester phosphodiesterase, partial [Ente...  46.3    0.010
OYV40783.1 hypothetical protein B7Z80_03680 [Rhodospirillales bac...  46.7    0.010
MBI1300350.1 hypothetical protein [Alphaproteobacteria bacterium]     47.1    0.010
RPJ64716.1 hypothetical protein EHM20_18110 [Alphaproteobacteria ...  47.5    0.010
WP_180490098.1 zinc-ribbon domain-containing protein, partial [Es...  42.8    0.010
MBI5826910.1 zinc-ribbon domain-containing protein [Deltaproteoba...  47.5    0.010
MBI1363544.1 hypothetical protein [Proteobacteria bacterium]          46.7    0.010
HGB07418.1 TPA: DUF3426 domain-containing protein [Deltaproteobac...  47.5    0.010
MBA3821522.1 zinc-ribbon domain-containing protein [Deltaproteoba...  47.5    0.010
PCJ60240.1 MSEP-CTERM sorting domain-containing protein [Planctom...  47.8    0.010
WP_178132558.1 hypothetical protein [Limnoglobus roseus]QEL16899....  47.5    0.010
MBI5071026.1 zinc-ribbon domain-containing protein [Deltaproteoba...  46.7    0.010
TMA29364.1 tetratricopeptide repeat protein [Deltaproteobacteria ...  47.5    0.011
MBI4701479.1 zinc-ribbon domain-containing protein [Deltaproteoba...  47.8    0.011
HHA58865.1 TPA: hypothetical protein [Syntrophobacterales bacterium]  45.5    0.011
WP_006501536.1 hypothetical protein [Austwickia chelonae]GAB76785...  47.5    0.011
KKT35684.1 hypothetical protein UW24_C0005G0032 [Parcubacteria gr...  46.7    0.011
OQX60845.1 hypothetical protein B5M51_09690 [Anaerolinea sp. 4484...  46.7    0.011
OGV19256.1 hypothetical protein A2X47_02255 [Lentisphaerae bacter...  47.8    0.011
CDW80596.1 wd-40 repeat protein [Stylonychia lemnae]                  47.8    0.011
MBF0571846.1 hypothetical protein [Candidatus Omnitrophica bacter...  46.3    0.011
NVN98292.1 hypothetical protein [Geobacteraceae bacterium]            47.8    0.011
RDW79714.1 hypothetical protein BP6252_04352 [Coleophoma cylindro...  47.8    0.011
ARU42875.1 hypothetical protein CCB81_01390 [Armatimonadetes bact...  45.5    0.011
WP_115937294.1 hypothetical protein [Aestuariispira insulae]RED49...  46.7    0.011
TAH00994.1 hypothetical protein EAZ17_05775 [Sphingobacteriales b...  47.1    0.011
HCM98355.1 TPA: hypothetical protein [Rhodobacter sp.]                44.8    0.011
WP_174403930.1 zinc-ribbon domain-containing protein [Desulfovibr...  46.7    0.011
HAA72950.1 TPA: hypothetical protein [Planctomycetaceae bacterium]    47.5    0.011
PIT94560.1 hypothetical protein COT98_02950, partial [Candidatus ...  44.0    0.012
WP_013627030.1 hypothetical protein [Rubinisphaera brasiliensis]A...  47.5    0.012
MBQ71941.1 hypothetical protein [Planctomycetaceae bacterium]         47.5    0.012
TFH30533.1 hypothetical protein E4G97_05570, partial [Deltaproteo...  44.0    0.012
SMO97843.1 hypothetical protein SAMN06265173_13929 [Lutimaribacte...  46.3    0.012
MSP94805.1 DUF3426 domain-containing protein [Alphaproteobacteria...  46.7    0.012
MBI2374423.1 zinc-ribbon domain-containing protein [Deltaproteoba...  42.4    0.012
KWT82062.1 hypothetical protein ASN18_2548 [Nitrospirae bacterium...  46.3    0.012
VDO31919.1 unnamed protein product [Brugia timori]                    47.8    0.012
WP_135244332.1 hypothetical protein [Polymorphobacter arshaanensi...  46.7    0.012
WP_121500478.1 hypothetical protein, partial [Pseudomonas aerugin...  45.9    0.012
CDD09235.1 unknown [Clostridium sp. CAG:349]                          47.1    0.012
RMF19123.1 tetratricopeptide repeat protein [Candidatus Dadabacte...  47.8    0.012
MBA3954950.1 hypothetical protein [Candidatus Dependentiae bacter...  46.3    0.012
PYN59115.1 thiol reductase thioredoxin, partial [Candidatus Rokub...  42.4    0.012
WP_150448468.1 glycerophosphoryl diester phosphodiesterase membra...  47.1    0.012
MBF0572433.1 zinc-ribbon domain-containing protein [Desulfamplus ...  45.9    0.013
PIE56796.1 hypothetical protein CSA34_02200 [Desulfobulbus propio...  42.4    0.013
NLE57325.1 hypothetical protein [Planctomycetes bacterium]            44.4    0.013
PHQ80397.1 hypothetical protein COB66_04855 [Coxiella sp.] [Coxie...  46.7    0.013
RPJ23292.1 hypothetical protein EHM26_00200, partial [Desulfobact...  46.7    0.013
MBL42700.1 hypothetical protein [Rhodospirillaceae bacterium]         46.7    0.013
NBU28287.1 hypothetical protein [Caulobacteraceae bacterium]          46.7    0.013
RME97676.1 hypothetical protein D6773_15455, partial [Alphaproteo...  46.7    0.013
WP_161808505.1 hypothetical protein [Methyloglobulus morosus]         44.0    0.013
HHI68480.1 TPA: hypothetical protein [Planctomycetes bacterium]       46.7    0.013
WP_198265140.1 efflux RND transporter permease subunit [sulfur-ox...  47.5    0.013
WP_013759537.1 DUF975 family protein [Treponema brennaborense]AEE...  46.7    0.013
WP_089343722.1 zinc-ribbon domain-containing protein [Paracoccus ...  46.7    0.013
HIJ62604.1 TPA: thioredoxin [Rhodospirillaceae bacterium]             42.8    0.013
WP_002772529.1 glycerophosphoryl diester phosphodiesterase membra...  46.7    0.013
NOY87574.1 DUF3426 domain-containing protein [Deltaproteobacteria...  47.1    0.014
KAF7127327.1 hypothetical protein RHSIM_Rhsim11G0075400 [Rhododen...  47.5    0.014
TDJ02067.1 hypothetical protein E2O73_03155 [Deltaproteobacteria ...  46.7    0.014
NOU32819.1 hypothetical protein [Polyangiaceae bacterium]             46.7    0.014
HAH08606.1 TPA: hypothetical protein [Alphaproteobacteria bacterium]  47.1    0.014
WP_058273742.1 hypothetical protein [Ruegeria atlantica]CUH43713....  43.6    0.014
KIV61629.1 hypothetical protein SZ55_4949 [Pseudomonas sp. FeS53a]    44.4    0.014
HIN76518.1 TPA: hypothetical protein [Rhodospirillales bacterium]     46.7    0.014
WP_162175824.1 zinc-ribbon domain-containing protein [Dongia sp. ...  46.3    0.014
OGP60032.1 hypothetical protein A2V67_08745 [Deltaproteobacteria ...  47.5    0.014
PAV71546.1 hypothetical protein WR25_17868 [Diploscapter pachys]      45.5    0.015
NJL08224.1 hypothetical protein [Methylacidiphilales bacterium]       44.0    0.015
VVB56624.1 Uncharacterised protein [uncultured archaeon]              46.3    0.015
PIR04431.1 hypothetical protein COV59_01120 [Candidatus Magasanik...  46.7    0.015
WP_172187921.1 glycerophosphoryl diester phosphodiesterase membra...  47.1    0.015
NLI33605.1 hypothetical protein [Deltaproteobacteria bacterium]       42.8    0.015
NBC58701.1 hypothetical protein [Bacteroidetes bacterium]             44.0    0.015
WP_088259215.1 DUF3566 domain-containing protein [Fimbriiglobus r...  46.3    0.015
NQV33552.1 hypothetical protein [Phycisphaeraceae bacterium]          47.1    0.015
OBQ27600.1 hypothetical protein AN483_19895, partial [Aphanizomen...  45.9    0.015
XP_001430535.1 hypothetical protein [Paramecium tetraurelia strai...  47.1    0.015
HHO51860.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  47.1    0.015
RLB29263.1 hypothetical protein DRG66_02270, partial [Deltaproteo...  43.6    0.015
NOY64397.1 hypothetical protein [Nitrospirae bacterium]               42.1    0.015
PVX25248.1 hypothetical protein CW691_05195 [Candidatus Bathyarch...  45.9    0.015
WP_141332527.1 zinc-ribbon domain-containing protein, partial [My...  47.5    0.015
WP_194869038.1 zinc-ribbon domain-containing protein, partial [My...  44.4    0.016
WP_126401825.1 zinc-ribbon domain-containing protein [Blastochlor...  46.7    0.016
RKZ30952.1 hypothetical protein DRQ36_03750 [bacterium]               47.1    0.016
HCF59164.1 TPA: hypothetical protein [Myxococcales bacterium]         47.5    0.016
THU71494.1 hypothetical protein C4D60_Mb04t02030 [Musa balbisiana]    46.7    0.016
WP_011985291.1 zinc-ribbon domain-containing protein [Anaeromyxob...  47.1    0.016
WP_021693655.1 zinc-ribbon domain-containing protein [Limimaricol...  47.1    0.016
WP_068041938.1 MULTISPECIES: zinc-ribbon domain-containing protei...  45.5    0.016
PIN85665.1 hypothetical protein COV47_01090 [Candidatus Diapherot...  46.3    0.016
WP_173743586.1 DUF975 family protein [Blautia wexlerae]NSF74130.1...  46.7    0.016
RLA85112.1 hypothetical protein DRG31_03810 [Deltaproteobacteria ...  45.5    0.016
WP_027308803.1 hypothetical protein [Caloramator sp. ALD01]           46.3    0.016
OYV07719.1 Uncharacterized protein CG444_193 [Methanosaeta sp. AS...  45.1    0.016
HGO75033.1 TPA: STAS domain-containing protein [Phycisphaerae bac...  46.3    0.017
PIP81684.1 hypothetical protein COR54_18990 [Elusimicrobia bacter...  46.3    0.017
WP_166820645.1 DUF4190 domain-containing protein [Rubinisphaera s...  45.5    0.017
NBQ82794.1 hypothetical protein [Alphaproteobacteria bacterium]       45.5    0.017
QIQ85562.1 hypothetical protein G9473_01845 [Erythrobacter sp.]       46.3    0.017
GEU92936.1 hypothetical protein [Tanacetum cinerariifolium]           46.7    0.017
MBI1209912.1 hypothetical protein [Alphaproteobacteria bacterium]     46.7    0.017
WP_161636928.1 hypothetical protein [Erysipelothrix rhusiopathiae...  45.5    0.017
WP_176437060.1 zinc-ribbon domain-containing protein, partial [My...  47.1    0.017
OHV87881.1 hypothetical protein ORS3428_03620 [Mesorhizobium sp. ...  46.3    0.017
WP_049757760.1 zinc-ribbon domain-containing protein [Magnetococc...  47.5    0.017
HCU59267.1 TPA: hypothetical protein [Alphaproteobacteria bacterium]  45.9    0.017
MBC7266751.1 hypothetical protein [Coriobacteriia bacterium]          46.3    0.017
WP_176423706.1 adventurous gliding motility protein GltJ, partial...  46.7    0.017
NLK39116.1 DUF975 family protein [Clostridiales bacterium]            46.3    0.017
WP_025028154.1 spore cortex biosynthesis protein YabQ [Bacillus m...  45.5    0.017
MBA2115874.1 hypothetical protein [Planctomycetes bacterium FF15]     46.3    0.018
MBI1949005.1 zinc-ribbon domain-containing protein [Deltaproteoba...  46.3    0.018
WP_152644732.1 hypothetical protein [Corynebacterium argentoratense]  46.7    0.018
PJC73259.1 hypothetical protein CO013_07260, partial [Syntrophoba...  44.0    0.018
MBE6357106.1 hypothetical protein [Lentisphaerae bacterium]           45.1    0.018
MBJ22024.1 hypothetical protein [Deltaproteobacteria bacterium]       46.3    0.019
WP_045387631.1 zinc-ribbon domain-containing protein [Falsirhodob...  45.1    0.019
NNF08013.1 hypothetical protein [Candidatus Eisenbacteria bacterium]  44.8    0.019
NNL00831.1 hypothetical protein [Eudoraea sp.]                        45.5    0.019
OEU61243.1 hypothetical protein BA870_11650 [Desulfuromonadales b...  46.7    0.019
EHI60156.1 hypothetical protein HMPREF9473_01828 [ [Hungatella ha...  44.4    0.019
WP_109355172.1 hypothetical protein [Sphingorhabdus sp. EL138]        44.8    0.019
WP_197430411.1 zinc-ribbon domain-containing protein, partial [Me...  41.3    0.019
WP_086052668.1 zinc-ribbon domain-containing protein [Pseudoruege...  46.7    0.019
MBI5846300.1 zinc-ribbon domain-containing protein [Deltaproteoba...  47.1    0.019
OQB87135.1 hypothetical protein BWX88_00545 [Planctomycetes bacte...  46.7    0.019
MBC5827089.1 hypothetical protein [Candidatus Eremiobacteraeota b...  46.3    0.019
PYV23996.1 hypothetical protein DMG27_14115 [Acidobacteria bacter...  45.5    0.020
OYT54045.1 hypothetical protein B6U72_04025 [Candidatus Altiarcha...  46.3    0.020
MBE6224901.1 hypothetical protein [Bacteroidales bacterium]           45.5    0.020
MBA3392926.1 zinc-ribbon domain-containing protein [Deltaproteoba...  47.1    0.020
WP_182076551.1 zinc-ribbon domain-containing protein [Deefgea sp....  43.6    0.020
WP_131925204.1 hypothetical protein [Hazenella coriacea]TCS93869....  46.3    0.020
NCA13882.1 zinc-ribbon domain-containing protein [Proteobacteria ...  46.7    0.020
MBK50175.1 hypothetical protein [Chloroflexi bacterium]MQG39568.1...  46.3    0.020
OUS04399.1 hypothetical protein A9Q96_15750 [Rhodobacterales bact...  46.3    0.020
MBA3701820.1 hypothetical protein [Rubrobacteraceae bacterium]        43.6    0.020
NET71271.1 hypothetical protein [Sphaerospermopsis sp. SIO1G2]        46.3    0.021
WP_148921784.1 zinc-ribbon domain-containing protein [Oceanicella...  46.3    0.021
WP_152889271.1 DUF975 family protein [Clostridium tarantellae]MPQ...  45.9    0.021
MBI4613963.1 hypothetical protein [Planctomycetes bacterium]          43.2    0.021
WP_144997406.1 hypothetical protein [Polystyrenella longa]QDU8178...  46.7    0.021
RDV37346.1 hypothetical protein DV096_15350 [Bradymonadaceae bact...  45.5    0.021
WP_169691242.1 zinc-ribbon domain-containing protein, partial [Vi...  41.7    0.021
MBI5186362.1 zinc-ribbon domain-containing protein [Nitrospinae b...  46.7    0.021
MBI5485606.1 zinc-ribbon domain-containing protein [Deltaproteoba...  45.9    0.021
HHW99952.1 TPA: DUF975 family protein [Acholeplasmataceae bacterium]  45.5    0.021
HCI62569.1 TPA: hypothetical protein [Erythrobacter sp.]              42.8    0.021
MBI4603044.1 hypothetical protein [Planctomycetes bacterium]          46.3    0.021
HDY19325.1 TPA: hypothetical protein [Gemmata sp.]                    44.0    0.022
MBI2128803.1 hypothetical protein [Candidatus Woesearchaeota arch...  44.4    0.022
HCR85950.1 TPA: hypothetical protein [Alphaproteobacteria bacterium]  42.4    0.022
WP_149254325.1 hypothetical protein [Labrys sp. KNU-23]QEN90133.1...  45.9    0.022
RKY52211.1 hypothetical protein DRP93_08535, partial [Candidatus ...  40.9    0.022
HBC53542.1 TPA: hypothetical protein [Alphaproteobacteria bacterium]  42.8    0.022
HGH09376.1 TPA: hypothetical protein [bacterium]                      45.1    0.022
XP_010680474.1 PREDICTED: uncharacterized protein LOC104895612 [B...  43.6    0.022
KAB2603648.1 hypothetical protein D8674_004653 [Pyrus ussuriensis...  46.3    0.022
XP_022318699.1 uncharacterized protein LOC111121634 isoform X2 [C...  46.3    0.022
XP_028991998.1 trace amine-associated receptor 13c-like [Betta sp...  46.3    0.023
KAA3632145.1 hypothetical protein DWP97_11630 [Calditrichaeota ba...  45.5    0.023
TKR67062.1 hypothetical protein L596_023271 [Steinernema carpocap...  47.1    0.023
WP_042278998.1 MULTISPECIES: hypothetical protein [Nonlabens]ALM2...  45.9    0.023
RLG14639.1 hypothetical protein DRN66_01465 [Candidatus Nanohaloa...  45.9    0.023
WP_161790246.1 hypothetical protein, partial [Streptacidiphilus c...  45.9    0.023
WP_181885661.1 zinc-ribbon domain-containing protein, partial [Tr...  44.4    0.023
WP_136799889.1 hypothetical protein [Desulfopila sp. IMCC35004]       45.9    0.023
OAI45068.1 hypothetical protein AYO44_13170 [Planctomycetaceae ba...  46.7    0.023
WP_068135767.1 glycerophosphoryl diester phosphodiesterase membra...  46.7    0.023
WP_051533940.1 hypothetical protein [Desulfitibacter alkalitolerans]  45.5    0.023
MBA4071726.1 hypothetical protein [Gemmatimonas sp.]                  46.3    0.023
WP_191346790.1 DUF975 family protein [Candidatus Gallimonas merdae]   45.5    0.023
KAF2556170.1 hypothetical protein F2Q68_00012976 [Brassica cretica]   47.1    0.024
MBL91301.1 hypothetical protein [Myxococcales bacterium]              46.7    0.024
KAB2836129.1 hypothetical protein F9K49_02430, partial [Caedimona...  44.8    0.024
OQA12877.1 hypothetical protein BWY64_03851 [bacterium ADurb.Bin363]  45.9    0.024
HHJ03925.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  46.3    0.024
KPA16453.1 magnetosome protein Mad28-2 [Candidatus Magnetomorum s...  46.7    0.024
WP_163302419.1 zinc-ribbon domain-containing protein [Desulfovibr...  46.7    0.025
HGU84329.1 TPA: hypothetical protein [Gemmatimonadetes bacterium]     44.0    0.025
NBB94226.1 hypothetical protein [Planctomycetes bacterium]            46.7    0.025
PZR07111.1 hypothetical protein DI536_29080 [Archangium gephyra]      46.7    0.025
NBX66208.1 hypothetical protein [Proteobacteria bacterium]            45.1    0.025
MBI3550627.1 hypothetical protein [Elusimicrobia bacterium]           45.9    0.026
MAB62548.1 hypothetical protein [Marinovum sp.]MBU12724.1 hypothe...  42.4    0.026
NOR85605.1 hypothetical protein [archaeon]                            42.4    0.026
GAB06159.1 hypothetical protein GOAMR_48_00700 [Gordonia amarae N...  45.5    0.026
HIH43664.1 TPA: hypothetical protein [Candidatus Methanoperedenac...  45.9    0.026
HCF62182.1 TPA: hypothetical protein [Myxococcales bacterium]         44.8    0.026
CCG41759.1 conserved hypothetical protein [Phaeospirillum molisch...  44.4    0.026
WP_013321181.1 hypothetical protein [Gloeothece verrucosa]ADN1307...  45.9    0.027
MBH23226.1 hypothetical protein [Myxococcales bacterium]              46.3    0.027
XP_007224075.2 uncharacterized protein LOC18792349 [Prunus persica]   46.3    0.027
WP_085440340.1 hypothetical protein [Magnetofaba australis]           42.4    0.027
PYT06293.1 hypothetical protein DMF49_11630 [Acidobacteria bacter...  46.3    0.027
PID73240.1 hypothetical protein CSB33_04800 [Desulfobacterales ba...  46.7    0.027
TVP89066.1 hypothetical protein EA347_04860 [Thioalkalivibrio sp.]    45.5    0.027
NOY91010.1 hypothetical protein [Deltaproteobacteria bacterium]       45.9    0.027
WP_193388349.1 zinc-ribbon domain-containing protein, partial [An...  45.1    0.027
WP_123779737.1 DUF975 family protein [Aerococcus sp. SJQ22]RPA607...  45.5    0.027
WP_161779186.1 hypothetical protein, partial [Proteus sp. G2675]N...  44.8    0.028
KKB96300.1 hypothetical protein SZ25_00622 [Candidatus Arcanobact...  45.1    0.028
TAF43635.1 hypothetical protein EAZ64_08585 [Sphingobacteriales b...  45.5    0.028
MBF0634602.1 zinc-ribbon domain-containing protein [Nitrospinae b...  41.3    0.028
MBI5681715.1 zinc-ribbon domain-containing protein [Deltaproteoba...  45.9    0.028
HIP24123.1 TPA: hypothetical protein [Rhodobacteraceae bacterium]     45.5    0.028
HHO51993.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  46.3    0.028
PIP53302.1 hypothetical protein COX08_01740, partial [Candidatus ...  42.4    0.028
WP_125119873.1 DUF975 family protein [Intestinibaculum porci]BBH2...  45.5    0.028
HHA47197.1 TPA: hypothetical protein [Armatimonadetes bacterium]      45.5    0.028
PIE32553.1 hypothetical protein CSA56_15230 [candidate division K...  46.3    0.029
WP_095975748.1 zinc-ribbon domain-containing protein [Melittangiu...  42.1    0.029
MBI5527366.1 zinc-ribbon domain-containing protein [Deltaproteoba...  46.3    0.029
ELU13346.1 hypothetical protein CAPTEDRAFT_219079 [Capitella teleta]  46.7    0.029
MBC8334379.1 hypothetical protein [Anaerolineales bacterium]          45.5    0.029
WP_116391669.1 zinc-ribbon domain-containing protein [Parvularcul...  45.9    0.029
HHU06987.1 TPA: hypothetical protein [Clostridiaceae bacterium]       45.1    0.029
WP_187281031.1 hypothetical protein [Nonomuraea sp. C10]              46.3    0.029
HCB52327.1 TPA: hypothetical protein [Rhodobacter sp.]                45.5    0.030
HAO22669.1 TPA: hypothetical protein [Desulfobacteraceae bacterium]   43.6    0.030
HIJ41512.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  41.7    0.030
ABB37917.1 MJ0042 family finger-like protein [Desulfovibrio alask...  45.9    0.031
WP_165356614.1 glycerophosphoryl diester phosphodiesterase membra...  44.8    0.031
QNN22952.1 hypothetical protein HED60_11965 [Planctomycetales bac...  45.5    0.031
NLB29755.1 DUF975 family protein [Clostridiales bacterium]            44.8    0.031
WP_154431035.1 GGDEF domain-containing protein [Roseburia sp. MUC...  46.3    0.032
TDI61717.1 hypothetical protein E2O91_03075 [Alphaproteobacteria ...  45.1    0.032
NVJ93742.1 zinc-ribbon domain-containing protein [Methylocystacea...  44.4    0.032
RLN03787.1 hypothetical protein C2845_PM13G00480 [Panicum miliaceum]  45.9    0.032
TMQ51863.1 hypothetical protein E6K73_04420 [Candidatus Eisenbact...  43.6    0.032
WP_020887907.1 zinc-ribbon domain-containing protein [Desulfohalo...  46.3    0.032
TMG17295.1 hypothetical protein E6H98_07720 [Chloroflexi bacterium]   44.8    0.032
GAK57210.1 hypothetical protein U27_04175 [Candidatus Vecturithri...  45.5    0.032
MBF0283138.1 zinc-ribbon domain-containing protein [Magnetococcal...  42.1    0.032
QDS92289.1 hypothetical protein FF011L_10310 [Planctomycetes bact...  45.9    0.032
MBF6559520.1 zinc-ribbon domain-containing protein [Candidatus Bi...  45.9    0.033
WP_131155613.1 hypothetical protein [Egibacter rhizosphaerae]QBI2...  45.9    0.033
OGW98200.1 hypothetical protein A2Z81_02485 [Omnitrophica WOR_2 b...  45.1    0.033
NOR60995.1 cell envelope integrity protein TolA [Rhodobacteraceae...  45.9    0.033
TMB13093.1 hypothetical protein E6J66_03980 [Deltaproteobacteria ...  45.5    0.033
WP_144971118.1 hypothetical protein [Bremerella volcania]             44.8    0.033
WP_124330109.1 zinc-ribbon domain-containing protein [Desulfonema...  46.3    0.033
HAG52565.1 TPA: hypothetical protein [Alphaproteobacteria bacterium]  44.8    0.033
PHS00622.1 hypothetical protein COA78_23775 [Blastopirellula sp.]     45.5    0.035
RKY07547.1 hypothetical protein DRP56_05520 [Planctomycetes bacte...  42.8    0.035
MBF0295426.1 zinc-ribbon domain-containing protein [Magnetococcal...  44.4    0.035
WP_187139776.1 hypothetical protein [Sphingopyxis sp. OPL5]QNO265...  45.5    0.035
TMB08409.1 hypothetical protein E6J64_01945 [Deltaproteobacteria ...  45.9    0.036
WP_189575605.1 hypothetical protein [Parvularcula lutaonensis]        45.9    0.036
PSQ08636.1 hypothetical protein BRC93_14835 [Halobacteriales arch...  43.6    0.036
MBD5405312.1 hypothetical protein [bacterium]                         41.3    0.036
MBI5543635.1 zinc-ribbon domain-containing protein [Deltaproteoba...  45.9    0.036
MSQ40701.1 hypothetical protein [Dehalococcoidia bacterium]           44.8    0.036
MBD3286106.1 hypothetical protein [candidate division WOR-3 bacte...  45.1    0.037
WP_007741265.1 glycerophosphoryl diester phosphodiesterase membra...  44.8    0.037
MBF0150403.1 zinc-ribbon domain-containing protein [Magnetococcal...  44.4    0.037
NIA19101.1 hypothetical protein [Xanthomonadaceae bacterium]          44.8    0.037
SDB61763.1 MJ0042 family finger-like domain-containing protein [B...  44.0    0.037
WP_157758574.1 zinc-ribbon domain-containing protein [Cystobacter...  44.8    0.038
TET10455.1 hypothetical protein E3J86_05635, partial [Candidatus ...  45.1    0.038
HBT83852.1 TPA: hypothetical protein [Desulfuromonas sp.]             43.6    0.038
RMG19694.1 tetratricopeptide repeat protein, partial [Deltaproteo...  45.9    0.038
RKU36333.1 hypothetical protein C6496_13835 [Candidatus Poribacte...  45.5    0.038
RMG61088.1 DUF3426 domain-containing protein [Deltaproteobacteria...  45.9    0.038
MBI2376544.1 zinc-ribbon domain-containing protein [Deltaproteoba...  45.9    0.038
MBE7200905.1 zinc-ribbon domain-containing protein [Parafilimonas...  43.2    0.038
MBA3247973.1 zinc-ribbon domain-containing protein [Pyrinomonadac...  40.5    0.038
MBE74794.1 hypothetical protein [Rhodopirellula sp.]                  45.1    0.038
RLC30587.1 hypothetical protein DRH32_05550, partial [Deltaproteo...  44.0    0.039
RKX61650.1 hypothetical protein DRP41_08160 [Thermodesulfobacteri...  45.1    0.039
OPX25013.1 hypothetical protein B1H04_00955 [Planctomycetales bac...  45.9    0.039
MBC7288743.1 zinc ribbon domain-containing protein [Armatimonadet...  45.5    0.039
NOZ83095.1 hypothetical protein [Euryarchaeota archaeon]              45.1    0.039
HBJ38725.1 TPA: hypothetical protein [Planctomycetaceae bacterium]    45.9    0.039
MBC7708400.1 hypothetical protein [Polaromonas sp.]                   45.5    0.039
NCB49684.1 hypothetical protein [Alphaproteobacteria bacterium]       44.8    0.039
PKI55621.1 hypothetical protein CRG98_024014 [Punica granatum]        44.4    0.039
NCO33277.1 hypothetical protein [Armatimonadetes bacterium]NCO903...  44.8    0.040
NIS75466.1 hypothetical protein [Deltaproteobacteria bacterium]       44.0    0.040
NCD24537.1 DUF3426 domain-containing protein [Deltaproteobacteria...  45.9    0.040
OLB24717.1 hypothetical protein AUH95_02275 [Nitrospirae bacteriu...  43.2    0.040
GDX64267.1 hypothetical protein LBMAG35_11050 [Chlorobi bacterium]    44.8    0.041
XP_001024639.2 Ibr domain protein [Tetrahymena thermophila SB210]...  45.5    0.041
PID97948.1 hypothetical protein CSA83_02005 [Actinomycetales bact...  45.5    0.042
PJA23892.1 hypothetical protein COX57_11340 [Alphaproteobacteria ...  45.5    0.042
WP_092870459.1 DUF975 family protein [Acetitomaculum ruminis]SFA8...  44.8    0.042
MBI5870759.1 hypothetical protein [Actinobacteria bacterium]          45.5    0.042
RLC05449.1 hypothetical protein DRI57_27550, partial [Deltaproteo...  45.5    0.042
XP_005789164.1 hypothetical protein EMIHUDRAFT_420859 [Emiliania ...  44.8    0.042
HHS83215.1 TPA: hypothetical protein [Devosia sp.]                    44.4    0.042
OQB43596.1 tRNA_anti-like protein [Candidatus Hydrogenedentes bac...  44.4    0.042
WP_171420597.1 zinc-ribbon domain-containing protein, partial [Co...  40.9    0.042
PMP79746.1 hypothetical protein C0184_09735 [Chloroflexus aggregans]  44.0    0.043
TKJ40567.1 hypothetical protein CEE36_09265 [candidate division T...  45.1    0.043
WP_167050864.1 glycerophosphoryl diester phosphodiesterase membra...  45.5    0.043
OUT67844.1 hypothetical protein CBB70_08095 [Planctomycetaceae ba...  44.4    0.043
RLB96685.1 hypothetical protein DRH90_24405, partial [Deltaproteo...  45.5    0.043
HBI52056.1 TPA: hypothetical protein [Ruminococcaceae bacterium]      45.1    0.043
RHQ11090.1 DUF975 family protein [Lachnospiraceae bacterium AM48-...  44.8    0.043
WP_182096764.1 zinc-ribbon domain-containing protein [Enhydrobact...  44.0    0.043
TFH07157.1 hypothetical protein E4H08_09965 [Candidatus Atribacte...  45.1    0.043
MBI4817803.1 glycerophosphoryl diester phosphodiesterase membrane...  45.1    0.044
OLS24189.1 hypothetical protein HeimC3_20880 [Candidatus Heimdall...  45.5    0.044
RLC31491.1 hypothetical protein DRH37_02940 [Deltaproteobacteria ...  45.1    0.044
PPR31196.1 hypothetical protein CFH36_02276 [Alphaproteobacteria ...  44.8    0.044
WP_145211101.1 hypothetical protein [Gimesia alba]QDT40707.1 hypo...  45.5    0.044
MBI3760084.1 zinc-ribbon domain-containing protein [Deltaproteoba...  44.8    0.044
XP_030940958.1 uncharacterized protein LOC115965797 [Quercus lobata]  43.6    0.044
NJD92159.1 hypothetical protein [Geobacter sp.]                       45.1    0.044
BBK39348.1 MJ0042 family finger-like domain protein [Stella sp. A...  44.4    0.045
ALP05310.1 Glycerophosphoryl diester phosphodiesterase [Clostridi...  45.5    0.045
HBI44919.1 TPA: hypothetical protein [Planctomycetales bacterium]     44.4    0.045
WP_194538646.1 hypothetical protein [Thermogemmata fonticola]MBA2...  44.0    0.045
MBC7863755.1 hypothetical protein [Bacteroidia bacterium]             45.1    0.045
MBE6369220.1 DUF975 family protein [Lentisphaerae bacterium]          44.8    0.046
MBI4723487.1 zinc-ribbon domain-containing protein [Rhodomicrobiu...  45.1    0.046
WP_097115037.1 hypothetical protein [Alysiella filiformis]QMT3195...  44.8    0.046
NWG23342.1 hypothetical protein [Pseudorhodoplanes sp.]               45.1    0.046
MBC6406805.1 zinc-ribbon domain-containing protein [Rhodobacterac...  44.4    0.046
NNM31535.1 hypothetical protein [Gemmatimonadetes bacterium]          42.8    0.047
QKF93524.1 chitin synthase [Fadolivirus 1]                            45.9    0.048
RII26264.1 hypothetical protein CXR31_11245 [Geobacter sp.]           45.1    0.048
MBJ7471536.1 hypothetical protein [Solirubrobacteraceae bacterium]    44.8    0.048
TAL03411.1 DUF3426 domain-containing protein [Rhodospirillaceae b...  45.9    0.048
ETR71671.1 fimbriae-associated protein [Candidatus Magnetoglobus ...  45.9    0.048
WP_191282923.1 glycerophosphoryl diester phosphodiesterase membra...  45.1    0.049
HDH90011.1 TPA: hypothetical protein [Candidatus Bathyarchaeota a...  45.1    0.049
QHQ37057.1 hypothetical protein GO499_18660 [Rhodobacteraceae bac...  44.8    0.049
HAC91574.1 TPA: hypothetical protein [Planctomycetaceae bacterium]    45.1    0.050
MBF0263282.1 zinc-ribbon domain-containing protein [Magnetococcal...  44.8    0.051
OLA82105.1 hypothetical protein BHW58_02080 [Azospirillum sp. 51_20]  44.8    0.051
MBI5528023.1 zinc-ribbon domain-containing protein [Deltaproteoba...  45.5    0.051
NOR25893.1 hypothetical protein [Desulforhopalus sp.]                 44.4    0.052
NOU26329.1 hypothetical protein [Polyangiaceae bacterium]             40.5    0.052
TNF35770.1 hypothetical protein EP329_05890 [Deltaproteobacteria ...  45.1    0.052
MAQ02903.1 hypothetical protein [Rhodospirillaceae bacterium]         45.1    0.053
CDB39928.1 mJ0042 finger-like region protein [Azospirillum sp. CA...  44.8    0.053
TMA15853.1 hypothetical protein E6J85_19310, partial [Deltaproteo...  45.5    0.053
QKK10258.1 hypothetical protein HND59_00175 [Proteobacteria bacte...  44.4    0.053
NIY18218.1 hypothetical protein [Nitrospinaceae bacterium]            44.0    0.053
HBT83656.1 TPA: hypothetical protein [Desulfuromonas sp.]             41.7    0.053
WP_171982531.1 glycerophosphoryl diester phosphodiesterase membra...  44.8    0.054
OZB73339.1 thiol reductase thioredoxin, partial [Thiomonas sp. 14...  39.8    0.054
KAA3653043.1 DUF3426 domain-containing protein, partial [Proteoba...  41.7    0.055
MBA2603838.1 hypothetical protein [Acidobacteria bacterium]           42.8    0.055
WP_053550135.1 zinc-ribbon domain-containing protein [Desulfuromo...  43.6    0.055
HGM97971.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  43.6    0.055
WP_054966758.1 zinc-ribbon domain-containing protein [Thiohalorha...  44.8    0.055
MBC8145691.1 hypothetical protein [bacterium]                         43.6    0.055
MBI3467178.1 hypothetical protein [Planctomycetes bacterium]          44.0    0.056
RLS58209.1 hypothetical protein DWH91_02825 [Planctomycetes bacte...  44.8    0.056
TNE52213.1 hypothetical protein EP343_02090 [Deltaproteobacteria ...  45.1    0.056
OGP84803.1 hypothetical protein A2Y95_07645 [Deltaproteobacteria ...  45.1    0.056
RME19960.1 hypothetical protein D6806_17360, partial [Deltaproteo...  44.8    0.056
SEH05063.1 Uncharacterised protein [Thiotrichales bacterium HS_08]    42.4    0.057
MXV76647.1 hypothetical protein [Candidatus Poribacteria bacterium]   44.0    0.057
MBU48998.1 hypothetical protein [Deltaproteobacteria bacterium]       44.8    0.057
MBF0308179.1 zinc-ribbon domain-containing protein [Magnetococcal...  45.5    0.057
NUO54404.1 hypothetical protein [Polyangiaceae bacterium]             44.8    0.058
PYP89631.1 hypothetical protein DMF61_02885 [Blastocatellia bacte...  44.8    0.058
EYU18031.1 hypothetical protein MIMGU_mgv1a013845mg [Erythranthe ...  44.0    0.058
TNE54541.1 hypothetical protein EP338_06910 [Bacteroidetes bacter...  44.4    0.059
MBA4077435.1 hypothetical protein [Cyanobacteria bacterium PR.023]    44.4    0.059
CDO74879.1 hypothetical protein BN946_scf185004.g29 [Trametes cin...  45.5    0.059
TLM67218.1 hypothetical protein FDZ69_04835 [Deltaproteobacteria ...  44.0    0.059
KRP05744.1 hypothetical protein ABS25_02925 [Cryomorphaceae bacte...  44.8    0.060
TVW80792.1 glycerophosphodiester phosphodiesterase, partial [Stre...  44.8    0.060
NXV60213.1 KRA43 protein [Molothrus ater]                             44.0    0.060
WP_089966368.1 hypothetical protein [Lihuaxuella thermophila]SEM9...  44.8    0.060
HCC69002.1 TPA: hypothetical protein [Nitrospiraceae bacterium]       43.6    0.061
MBC7793367.1 zinc-ribbon domain-containing protein [Clostridia ba...  43.2    0.061
NIR99057.1 hypothetical protein [Gammaproteobacteria bacterium]       41.3    0.061
HIF96739.1 TPA: hypothetical protein [Myxococcales bacterium]         44.8    0.061
XP_029967131.1 basic proline-rich protein-like [Salarias fasciatus]   45.5    0.061
KUK51213.1 75k gamma secalin [candidate division TA06 bacterium 3...  45.1    0.061
MBO87689.1 hypothetical protein [Deltaproteobacteria bacterium]HC...  45.1    0.061
KAA0205950.1 hypothetical protein EDM68_03715 [Candidatus Uhrbact...  44.8    0.062
QNR22563.1 hypothetical protein H4K34_09190 [Cryomorphaceae bacte...  44.8    0.062
TMB08116.1 tetratricopeptide repeat protein [Deltaproteobacteria ...  45.5    0.062
NOG92060.1 hypothetical protein [Armatimonadetes bacterium]           44.8    0.062
NHZ47757.1 hypothetical protein [Desulfovibrio sp. XJ01]              44.0    0.062
HBP93529.1 TPA: hypothetical protein [Alcanivorax sp.]                40.9    0.062
WP_084235725.1 DUF975 family protein [Papillibacter cinnamivorans...  44.8    0.063
MBJ26091.1 hypothetical protein [Alphaproteobacteria bacterium]MB...  44.0    0.063
CDQ74224.1 unnamed protein product [Oncorhynchus mykiss]              45.5    0.064
WP_072697496.1 zinc-ribbon domain-containing protein [Desulfovibr...  45.1    0.064
MBI5895757.1 zinc-ribbon domain-containing protein [Desulfobacter...  40.9    0.064
RLA88716.1 hypothetical protein DRG20_05730 [Deltaproteobacteria ...  42.1    0.065
WP_149285000.1 glycerophosphoryl diester phosphodiesterase membra...  45.1    0.065
HGZ68260.1 TPA: hypothetical protein [Deltaproteobacteria bacteri...  42.1    0.065
EKK01429.1 hypothetical protein RBSH_03254 [Rhodopirellula baltic...  44.8    0.065
MBA4391882.1 hypothetical protein [Syntrophus sp. (in: Bacteria)]     40.1    0.066
WP_144067417.1 zinc-ribbon domain-containing protein [Ferrovibrio...  44.8    0.066
WP_073709516.1 hypothetical protein [Actinomyces liubingyangii]OK...  44.8    0.067
MBI2551191.1 hypothetical protein [Candidatus Uhrbacteria bacterium]  44.4    0.067
WP_185884924.1 zinc-ribbon domain-containing protein [Croceicoccu...  42.8    0.068
MBE0564325.1 zinc-ribbon domain-containing protein [Krumholzibact...  40.9    0.068
MBI3099836.1 zinc-ribbon domain-containing protein [Planctomycete...  43.6    0.069
WP_189585541.1 hypothetical protein [Litorimonas cladophorae]GGX7...  44.4    0.069
WP_008319499.1 hypothetical protein [Haloferax mucosum]ELZ95956.1...  44.4    0.069
TMQ32887.1 hypothetical protein E6K70_16135, partial [Planctomyce...  44.0    0.069
HGM99368.1 TPA: response regulator [Deltaproteobacteria bacterium]    44.8    0.069
MST98034.1 hypothetical protein [Victivallaceae bacterium BBE-744...  44.8    0.070
MBI2117027.1 zinc-ribbon domain-containing protein [candidate div...  44.4    0.070
WP_060929981.1 glycerophosphoryl diester phosphodiesterase membra...  44.4    0.071
WP_103017879.1 hypothetical protein [Salinibacter ruber]              44.4    0.071
TND08756.1 hypothetical protein FD123_1972 [Bacteroidetes bacterium]  44.4    0.071
WP_149108714.1 hypothetical protein [Limnoglobus roseus]QEL13773....  45.1    0.071
WP_052608971.1 DUF975 family protein [Candidatus Izimaplasma sp. ...  44.4    0.072
WP_012470329.1 zinc-ribbon domain-containing protein [Geobacter l...  44.4    0.072
NCB30924.1 hypothetical protein [Clostridia bacterium]                40.1    0.072
TMI14652.1 DUF3426 domain-containing protein, partial [Betaproteo...  40.9    0.072
HDM25517.1 TPA: hypothetical protein [Thermoplasmatales archaeon]     43.6    0.073
HBN26829.1 TPA: hypothetical protein [Desulfobacteraceae bacterium]   44.8    0.073
HDM09293.1 TPA: hypothetical protein [Desulfobacteraceae bacterium]   39.8    0.074
HIM68373.1 TPA: hypothetical protein [Verrucomicrobia bacterium]      45.1    0.074
MBI5695645.1 zinc-ribbon domain-containing protein [Nitrospirae b...  45.1    0.074
WP_082025357.1 zinc-ribbon domain-containing protein [Methylocean...  44.8    0.075
MBI4774594.1 zinc-ribbon domain-containing protein [Deltaproteoba...  44.0    0.075
WP_197528571.1 hypothetical protein [Aeoliella mucimassa]QDU58229...  44.4    0.075
HAC90890.1 TPA: hypothetical protein [Planctomycetaceae bacterium]    44.8    0.077
MBA2279251.1 hypothetical protein [Candidatus Saccharibacteria ba...  44.4    0.077
MBE6381682.1 hypothetical protein [Lentisphaerae bacterium]           44.4    0.077
MBA04561.1 hypothetical protein [Gammaproteobacteria bacterium]HA...  44.0    0.077
PCJ64762.1 hypothetical protein COA61_18855 [Zetaproteobacteria b...  44.0    0.078
OJV85663.1 hypothetical protein BGO43_02970 [Gammaproteobacteria ...  44.0    0.078
KKU86576.1 hypothetical protein UY17_C0043G0002 [Candidatus Beckw...  45.1    0.078
MBE9521216.1 zinc-ribbon domain-containing protein [Proteobacteri...  39.4    0.079
MBA4419182.1 hypothetical protein [Syntrophus sp. (in: Bacteria)]     40.5    0.080
MBF0186832.1 hypothetical protein [Magnetococcales bacterium]         45.1    0.080
NYZ76501.1 hypothetical protein [Candidatus Micrarchaeota archaeon]   44.0    0.080
MBI3070741.1 zinc-ribbon domain-containing protein [Deltaproteoba...  44.8    0.080
MSR31338.1 hypothetical protein [Gemmataceae bacterium]               44.0    0.081
MBF0285537.1 zinc-ribbon domain-containing protein [Magnetococcal...  45.1    0.081
MBF0310654.1 zinc-ribbon domain-containing protein [Magnetococcal...  44.8    0.082
RQD73129.1 hypothetical protein D5S03_13350, partial [Desulfonatr...  43.2    0.083
HBH06630.1 TPA: hypothetical protein [Flavobacteriales bacterium]     44.4    0.083
MBF0476205.1 zinc-ribbon domain-containing protein [Deltaproteoba...  42.4    0.083
WP_182054359.1 DUF975 family protein, partial [Escherichia coli]M...  41.7    0.083
QKJ99168.1 hypothetical protein HND40_06140 [Ignavibacteriae bact...  43.6    0.084
MBC8067732.1 zinc-ribbon domain-containing protein [Deltaproteoba...  42.4    0.085
RZC46974.1 hypothetical protein C5167_039955 [Papaver somniferum]     44.4    0.085
WP_181549701.1 zinc-ribbon domain-containing protein [Desulfosals...  45.1    0.085
NCC21459.1 hypothetical protein [Alphaproteobacteria bacterium]       44.0    0.085
VEN74250.1 hypothetical protein EPICR_30185 [uncultured Desulfoba...  44.4    0.086
WP_089815367.1 hypothetical protein [Halomicrobium zhouii]SFR9434...  44.0    0.088
HEE45818.1 TPA: hypothetical protein [Candidatus Dadabacteria bac...  44.4    0.088
WP_153577741.1 DUF975 family protein, partial [Bacillus thuringie...  41.3    0.089
TXD38075.1 hypothetical protein FRC98_04040 [Bradymonadales bacte...  44.4    0.089
MTI60200.1 hypothetical protein [Firmicutes bacterium]                43.2    0.089
RPH70928.1 hypothetical protein EHM78_09510 [Myxococcaceae bacter...  44.8    0.090
PWA83336.1 hypothetical protein CTI12_AA172870 [Artemisia annua]      44.4    0.090
NPD87431.1 hypothetical protein [Asgard group archaeon]               44.4    0.090
XP_013980498.1 PREDICTED: proline-rich protein 36-like [Salmo salar]  44.8    0.090
OIP66012.1 hypothetical protein AUK29_01630 [Nitrospirae bacteriu...  43.6    0.091
RLB48936.1 hypothetical protein DRJ42_22120 [Deltaproteobacteria ...  44.0    0.093
KFZ27160.1 hypothetical protein KQ78_00619 [Candidatus Izimaplasm...  44.0    0.093
NBU30202.1 hypothetical protein [Actinobacteria bacterium]            44.0    0.093
MBS39610.1 hypothetical protein [Rhodobiaceae bacterium]              40.1    0.093
CAB4072221.1 unnamed protein product [Lactuca saligna]                44.4    0.096
WP_147729643.1 zinc-ribbon domain-containing protein [Methylobact...  42.4    0.097
HAZ62755.1 TPA: hypothetical protein [Armatimonadetes bacterium]      41.3    0.098
HDP81212.1 TPA: zinc ribbon domain-containing protein [Spirochaet...  44.4    0.098
CCD00318.1 protein of unknown function [Azospirillum baldaniorum]     40.9    0.098
WP_069655230.1 DUF975 family protein [Enterococcus plantarum]OEG1...  44.0    0.098
PSP97841.1 hypothetical protein BRC89_10555 [Halobacteriales arch...  44.0    0.099
NQV23783.1 hypothetical protein [Rhodopirellula sp.]                  44.4    0.099
PLY12567.1 hypothetical protein C0624_00835 [Desulfuromonas sp.]      44.0    0.099
KKT11990.1 hypothetical protein UV92_C0036G0006, partial [Parcuba...  44.0    0.100
OPX19969.1 hypothetical protein BZ151_06540 [Desulfobacca sp. 448...  44.0    0.10 
HID76639.1 TPA: hypothetical protein [Planctomycetaceae bacterium]    39.8    0.10 
WP_179943892.1 zinc-ribbon domain-containing protein, partial [Wo...  40.5    0.10 
HIJ19836.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  39.8    0.10 
OPX37867.1 hypothetical protein B1H13_12190 [Desulfobacteraceae b...  42.8    0.10 
HEN14015.1 TPA: hypothetical protein [Schlesneria paludicola]         40.5    0.10 
WP_129791223.1 glycerophosphoryl diester phosphodiesterase membra...  43.6    0.10 
HBL46452.1 TPA: hypothetical protein [Planctomycetaceae bacterium]    44.4    0.11 
XP_037871600.1 uncharacterized protein LOC119629549 [Bombyx mori]     44.4    0.11 
WP_173764179.1 zinc-ribbon domain-containing protein [Azoarcus sp...  41.3    0.11 
HGR91282.1 TPA: hypothetical protein [Deltaproteobacteria bacterium]  44.4    0.11 
OGP84945.1 hypothetical protein A2Z08_12055 [Deltaproteobacteria ...  44.0    0.11 
HAY78460.1 TPA: hypothetical protein [Planctomycetaceae bacterium]    43.6    0.11 
WP_185157027.1 zinc-ribbon domain-containing protein, partial [Me...  40.9    0.11 
NRA74281.1 hypothetical protein [Rickettsiales bacterium]             43.2    0.11 
HCU77251.1 TPA: hypothetical protein [Microbacterium sp.]             43.6    0.11 
WP_154809514.1 hypothetical protein [Methanolobus vulcani]TQD2588...  42.8    0.11 
XP_005110944.1 uncharacterized protein LOC101861007 [Aplysia cali...  44.8    0.11 
PWL49465.1 hypothetical protein DBY36_07800 [Clostridiales bacter...  43.6    0.11 
MBI29240.1 hypothetical protein [Pelagibacteraceae bacterium]PPR4...  41.7    0.11 
NEP27797.1 hypothetical protein [Moorea sp. SIO3I6]                   42.1    0.11 
AJF61271.1 hypothetical protein QT06_C0001G0431 [archaeon GW2011_...  44.0    0.11 
OLQ00702.1 hypothetical protein AK812_SmicGene16598 [Symbiodinium...  44.8    0.11 
MBE0536951.1 hypothetical protein [Phycisphaerae bacterium]           44.0    0.12 
WP_083809629.1 zinc-ribbon domain-containing protein [Candidatus ...  40.9    0.12 
WP_040658730.1 YARHG domain-containing protein [Oscillibacter rum...  44.4    0.12 
HIG12636.1 TPA: hypothetical protein [Planctomycetes bacterium]HI...  44.0    0.12 
WP_191138168.1 glycerophosphoryl diester phosphodiesterase membra...  43.6    0.12 
OFX19659.1 hypothetical protein A2V77_07340 [Anaeromyxobacter sp....  43.2    0.12 
KKS94389.1 hypothetical protein UV70_C0001G0003 [Parcubacteria gr...  43.6    0.12 
BBM83698.1 hypothetical protein UABAM_02051 [Planctomycetes bacte...  44.8    0.12 
WP_148115202.1 zinc-ribbon domain-containing protein [Wolbachia p...  41.3    0.12 
WP_096909167.1 hypothetical protein [Halobacteriovorax marinus]AT...  42.8    0.12 
NLW72924.1 hypothetical protein [Chloroflexi bacterium]               44.0    0.12 
OGV41067.1 hypothetical protein A2X48_11945 [Lentisphaerae bacter...  42.8    0.12 
NLE84788.1 hypothetical protein [Myxococcales bacterium]              44.0    0.12 
MBI2923660.1 hypothetical protein [Planctomycetes bacterium]          43.6    0.12 
NQZ95038.1 zinc-ribbon domain-containing protein [Myxococcales ba...  44.0    0.12 
HCL81743.1 TPA: hypothetical protein [Nitrospiraceae bacterium]       44.4    0.12 
MBI3821877.1 hypothetical protein [Planctomycetes bacterium]          43.2    0.12 
GFZ94090.1 hypothetical protein CYANOKiyG1_04760 [Okeania sp. KiyG1]  42.1    0.12 
QHI68099.1 hypothetical protein GT409_01070 [Kiritimatiellaeota b...  43.6    0.13 
RPI78644.1 HAMP domain-containing protein [Desulfobacteraceae bac...  43.2    0.13 
PMC61510.1 hypothetical protein CJ204_10755 [Corynebacterium xero...  43.6    0.13 
PCJ92756.1 hypothetical protein COA52_07305 [Rhizobiales bacterium]   44.0    0.13 
NIS31609.1 hypothetical protein [Actinobacteria bacterium]NIU6672...  41.3    0.13 
KCW62326.1 hypothetical protein EUGRSUZ_H04969 [Eucalyptus grandis]   43.2    0.13 
MBE6986978.1 DUF975 family protein [Ruminococcaceae bacterium]        43.6    0.13 
RJS68686.1 hypothetical protein CW714_09725 [Methanophagales arch...  44.0    0.13 
WP_174525436.1 zinc-ribbon domain-containing protein, partial [Wo...  39.0    0.13 
OFW70235.1 hypothetical protein A2065_03550 [Alphaproteobacteria ...  43.2    0.13 
RME88612.1 hypothetical protein D6785_00700, partial [Planctomyce...  43.6    0.13 
MBI3841470.1 stage II sporulation protein M [Thaumarchaeota archa...  44.0    0.13 
KKR63361.1 hypothetical protein UU02_C0026G0004 [Candidatus Woese...  40.5    0.14 
MQA06694.1 hypothetical protein [Streptosporangiales bacterium]       44.4    0.14 
HCH35343.1 TPA: hypothetical protein [Dehalococcoidia bacterium]      42.4    0.14 
NIR16342.1 hypothetical protein [Desulfobacterales bacterium]         40.5    0.14 
NPA43632.1 hypothetical protein [Chlorobi bacterium]                  43.6    0.14 
PYT03044.1 hypothetical protein DMF60_19570, partial [Acidobacter...  40.9    0.14 
MQL89493.1 hypothetical protein [Colocasia esculenta]                 44.4    0.14 
MBI5180597.1 zinc-ribbon domain-containing protein [Nitrospirae b...  44.0    0.14 
MBF1189903.1 DUF975 family protein [[Eubacterium] sulci]              41.7    0.14 
RJO71902.1 tetratricopeptide repeat protein [Myxococcales bacterium]  44.4    0.14 
MSQ33989.1 hypothetical protein [Dehalococcoidia bacterium]           42.1    0.14 
HAD04567.1 TPA: hypothetical protein [Desulfuromonas sp.]             43.2    0.14 
MXX00040.1 hypothetical protein [Acidimicrobiia bacterium]            43.2    0.14 
NNJ64170.1 thiol reductase thioredoxin [Xanthomonadales bacterium]    39.4    0.14 
WP_085536141.1 FHA domain-containing protein [Massilibacteroides ...  42.1    0.15 
PYQ07574.1 hypothetical protein DMF82_03555 [Acidobacteria bacter...  40.5    0.15 
RZV37437.1 hypothetical protein EVJ48_08995 [Candidatus Acidulode...  42.4    0.15 
WP_044075409.1 hypothetical protein [Prevotella pectinovora]KIP54...  42.8    0.15 
MBE6716609.1 hypothetical protein [Ruminococcaceae bacterium]         43.6    0.15 
MXX43460.1 hypothetical protein [Acidimicrobiales bacterium]MYI09...  43.2    0.16 
MSR81192.1 hypothetical protein [Gemmataceae bacterium]               43.2    0.16 
NLK08425.1 zinc ribbon domain-containing protein [Firmicutes bact...  43.2    0.16 
NIQ98473.1 hypothetical protein [Desulfuromonadales bacterium]NIR...  40.9    0.16 
MBI1920332.1 zinc-ribbon domain-containing protein [Geobacter sp.]    43.2    0.16 
PIP32948.1 hypothetical protein COX23_02015 [Candidatus Gottesman...  43.2    0.16 
RAI38450.1 hypothetical protein CH341_27855, partial [Rhodoplanes...  38.6    0.16 
NLA43499.1 hypothetical protein [Candidatus Saccharibacteria bact...  43.2    0.16 
MBC8200311.1 zinc-ribbon domain-containing protein [Desulfobacter...  42.4    0.16 
MRR11338.1 hypothetical protein [bacterium]                           42.8    0.16 
MBL92324.1 hypothetical protein [Myxococcales bacterium]              44.0    0.17 
XP_016435567.1 PREDICTED: uncharacterized protein LOC107761799 [N...  44.0    0.17 
WP_015837784.1 zinc-ribbon domain-containing protein [Geobacter s...  43.6    0.17 
XP_014671178.1 PREDICTED: anoctamin-4-like [Priapulus caudatus]       43.6    0.17 
MAR09255.1 hypothetical protein [Blastopirellula sp.]                 43.6    0.17 
NLG07741.1 hypothetical protein [Deinococcales bacterium]             43.2    0.17 
WP_112206957.1 zinc-ribbon domain-containing protein [Lactobacill...  43.2    0.17 
MBE6883006.1 DUF975 family protein [Ruminococcaceae bacterium]        43.6    0.17 
NIA06755.1 hypothetical protein [Actinobacteria bacterium]            42.1    0.17 
MSU80190.1 hypothetical protein [Gemmataceae bacterium]               43.2    0.18 
PJN93938.1 hypothetical protein CNY89_17450, partial [Amaricoccus...  39.0    0.18 
NRB21845.1 hypothetical protein [Candidatus Dependentiae bacterium]   43.2    0.18 
MBE7706338.1 hypothetical protein [Cyanobacteria bacterium SIG30]     42.8    0.18 
MBI3185600.1 adventurous gliding motility protein GltJ [Myxococca...  44.0    0.18 
CRH97083.1 membrane protein [Streptococcus pneumoniae]                42.4    0.19 
XP_010027371.1 PREDICTED: uncharacterized protein LOC104417864 [E...  42.1    0.19 
MBI4691221.1 histone deacetylase [Nitrospirae bacterium]              43.6    0.19 
XP_020899657.1 uncharacterized protein LOC110238333 [Exaiptasia d...  44.0    0.19 
WP_192031542.1 hypothetical protein [Pseudoxanthomonas sp. CAU 15...  43.2    0.19 
HFS54482.1 TPA: hypothetical protein [Planctomycetes bacterium]       43.2    0.19 
KKR97207.1 hypothetical protein UU49_C0034G0010 [Candidatus Magas...  41.7    0.19 
MBC8481828.1 hypothetical protein [Planctomycetes bacterium]          43.2    0.19 
QOV87739.1 hypothetical protein IPV69_15760 [Phycisphaerales bact...  43.6    0.19 
MBI1292689.1 hypothetical protein [bacterium]                         42.8    0.20 
BAJ06951.1 putative uncharacterized protein [uncultured bacterium...  42.8    0.20 
CVH76891.1 hypothetical protein BN3662_01862 [Clostridiales bacte...  42.8    0.21 
KPA17729.1 MscS mechanosensitive ion channel, partial [Candidatus...  40.9    0.21 
MUL56545.1 hypothetical protein [Pseudomonas aeruginosa]              40.9    0.21 
OHB57512.1 hypothetical protein A2Y12_16000 [Planctomycetes bacte...  43.2    0.21 
KAF1876438.1 hypothetical protein Lal_00029786 [Lupinus albus]        43.2    0.21 
KAF9586782.1 hypothetical protein IFM89_040000 [Coptis chinensis]     43.2    0.22 
WP_013709369.1 zinc-ribbon domain-containing protein [Coriobacter...  43.2    0.23 
WP_015284590.1 hypothetical protein [Methanoregula formicica]AGB0...  43.2    0.23 
NQX96170.1 hypothetical protein [Erythrobacter sp.]                   42.8    0.23 
MBK77953.1 hypothetical protein [Flavobacteriaceae bacterium]         42.1    0.23 
HBE68206.1 TPA: hypothetical protein [Planctomycetaceae bacterium]    42.8    0.23 
MBC7765611.1 DUF975 family protein [Hyphomonadaceae bacterium]        42.8    0.24 
MBA3535518.1 hypothetical protein [Tatlockia sp.]                     43.2    0.24 
HBG95021.1 TPA: hypothetical protein [Chromatiaceae bacterium]        42.8    0.24 
BAG04142.1 hypothetical protein MAE_43200 [Microcystis aeruginosa...  41.7    0.24 
MSS74868.1 hypothetical protein [Candidatus Pacearchaeota archaeon]   42.8    0.24 
KPA13535.1 chemotaxis protein CheY [Candidatus Magnetomorum sp. H...  43.6    0.25 
HAE64104.1 TPA: thioredoxin TrxC [Acinetobacter johnsonii]            39.0    0.25 
HEJ47611.1 TPA: hypothetical protein [Gemmata sp.]                    42.1    0.25 
MBI3205911.1 hypothetical protein [Myxococcales bacterium]            42.1    0.25 
WP_124100516.1 DUF975 family protein [Ruminococcus sp. Marseille-...  43.2    0.25 
WP_145200508.1 hypothetical protein [Thalassoglobus polymorphus]Q...  42.8    0.25 
MAJ57073.1 hypothetical protein [Candidatus Pelagibacter sp.]OUW1...  39.0    0.26 
PID34750.1 hypothetical protein CR971_01640 [candidate division S...  42.4    0.26 
NIR13333.1 hypothetical protein [Desulfobacterales bacterium]         39.8    0.26 
NJL30993.1 hypothetical protein [Phycisphaerales bacterium]           41.3    0.26 
WP_051951045.1 hypothetical protein [Streptomyces yeochonensis]       40.5    0.27 
WP_195396648.1 DUF975 family protein [[Ruminococcus] gnavus]          43.2    0.27 
GAA52080.1 ATP-binding cassette subfamily A (ABC1) member 5 [Clon...  43.6    0.27 
MBC8122155.1 glycerophosphoryl diester phosphodiesterase membrane...  41.7    0.28 
WP_051013188.1 glycerophosphoryl diester phosphodiesterase membra...  40.1    0.28 
WP_129487629.1 hypothetical protein [Fusibacter sp. A1]NPE21708.1...  42.8    0.28 
CDE10768.1 putative uncharacterized protein [Clostridium sp. CAG:...  42.8    0.28 
XP_023240341.1 nose resistant to fluoxetine protein 6-like [Centr...  43.2    0.29 
WP_147005555.1 hypothetical protein [Leptotrichia hongkongensis]B...  42.4    0.29 
OGS02707.1 hypothetical protein A2278_05740 [Elusimicrobia bacter...  42.4    0.29 
PSP59313.1 hypothetical protein BRC72_00190 [Halobacteriales arch...  42.4    0.29 
XP_030470570.1 uncharacterized protein LOC115688784 [Syzygium ole...  42.4    0.29 
NOZ24287.1 hypothetical protein [Planctomycetes bacterium]            42.8    0.29 
WP_145370413.1 hypothetical protein [Maioricimonas rarisocia]QDU3...  41.7    0.30 
WP_194891411.1 hypothetical protein [Catenulispora pinisilvae]        43.2    0.30 
NLW73749.1 hypothetical protein [Clostridiales bacterium]             42.4    0.31 
PKK94162.1 hypothetical protein CVV61_00790, partial [Tenericutes...  40.5    0.31 
WP_171815915.1 glycerophosphoryl diester phosphodiesterase membra...  42.4    0.31 
OQY55980.1 hypothetical protein B6245_18975 [Desulfobacteraceae b...  43.2    0.31 
MBI5816578.1 zinc-ribbon domain-containing protein [Nitrospinae b...  43.2    0.32 
MBI2344543.1 hypothetical protein [Candidatus Dependentiae bacter...  42.4    0.32 
PKO16062.1 hypothetical protein CVU37_11960 [candidate division B...  42.1    0.32 
PIN94033.1 hypothetical protein COU54_00680 [Candidatus Pacearcha...  41.7    0.32 
MBI2378740.1 hypothetical protein [Deltaproteobacteria bacterium]     40.1    0.32 
HGI34077.1 TPA: hypothetical protein [Euryarchaeota archaeon]         42.4    0.32 
OHD17722.1 hypothetical protein A2Y37_03925 [Spirochaetes bacteri...  42.4    0.32 
OGS06240.1 hypothetical protein A3J70_03020 [Elusimicrobia bacter...  42.1    0.33 
TXI13440.1 hypothetical protein E6Q66_09590 [Pedobacter sp.]          42.1    0.34 
HAV92694.1 TPA: hypothetical protein [candidate division WOR-3 ba...  40.9    0.34 
NBO93895.1 hypothetical protein [Planctomycetia bacterium]            41.7    0.35 
RKU28812.1 hypothetical protein C6497_08615 [Candidatus Poribacte...  42.4    0.35 
RLG17831.1 hypothetical protein DRN63_02535 [Nanoarchaeota archaeon]  42.1    0.35 
OQC14498.1 hypothetical protein BWX73_01757 [Lentisphaerae bacter...  42.4    0.35 
MBI1275309.1 hypothetical protein [bacterium]                         42.4    0.35 
WP_137178817.1 zinc-ribbon domain-containing protein [Roseomonas ...  40.9    0.35 
CVH77937.1 hypothetical protein BN3662_02518 [Clostridiales bacte...  42.1    0.36 
TPX71151.1 hypothetical protein SpCBS45565_g01187 [Spizellomyces ...  43.2    0.36 
KKP29518.1 hypothetical protein UR12_C0006G0005 [candidate divisi...  42.1    0.36 
NCC86655.1 DUF975 family protein [Clostridia bacterium]               42.1    0.36 
PYX10427.1 hypothetical protein DMG88_02280 [Acidobacteria bacter...  42.4    0.36 
OUW61959.1 hypothetical protein CBD58_02495 [bacterium TMED198]       41.7    0.37 
XP_013329970.1 Signal transduction protein Syg1 [Rasamsonia emers...  42.8    0.37 
MXV82240.1 hypothetical protein [Candidatus Poribacteria bacteriu...  42.1    0.38 
WP_179899663.1 hypothetical protein [Actinomyces bowdenii]MBF0696...  42.1    0.38 
TRY69347.1 hypothetical protein TCAL_06925 [Tigriopus californicus]   42.8    0.38 
MBI5804310.1 hypothetical protein [Candidatus Pacearchaeota archa...  41.7    0.38 
SQB60826.1 Protein of uncharacterised function (DUF975) [Clostrid...  40.9    0.39 
MBA2256010.1 hypothetical protein [Thermoleophilaceae bacterium]      41.7    0.39 
WP_161600755.1 zinc-ribbon domain-containing protein [Roseomonas ...  41.3    0.40 
KAF9588719.1 hypothetical protein IFM89_015156 [Coptis chinensis]     42.8    0.40 
MBD3174297.1 hypothetical protein [Armatimonadia bacterium]           41.7    0.40 
OLD50887.1 hypothetical protein AUI42_01215 [Actinobacteria bacte...  41.7    0.40 
KKP95651.1 hypothetical protein US03_C0003G0051 [candidate divisi...  42.1    0.41 
WP_090816307.1 DUF975 family protein [Oribacterium sp. KHPX15]SEA...  42.4    0.41 
NLE36775.1 zinc ribbon domain-containing protein [Pirellulaceae b...  42.4    0.41 
MBI4764717.1 zinc-ribbon domain-containing protein [Deltaproteoba...  41.3    0.42 
NLH15281.1 hypothetical protein [Phycisphaerae bacterium]             42.4    0.42 
XP_030942390.1 uncharacterized protein LOC115967438 [Quercus lobata]  42.1    0.42 
CAB1348301.1 unnamed protein product [Coregonus sp. 'balchen']        42.8    0.42 
WP_146582994.1 hypothetical protein [Rhodopirellula pilleata]         42.1    0.43 
MBD3166946.1 hypothetical protein [bacterium]                         42.4    0.43 
RMD59649.1 hypothetical protein D6821_00790 [Candidatus Parcubact...  42.1    0.45 
RLG18588.1 hypothetical protein DRN67_04035, partial [Candidatus ...  41.7    0.46 
WP_084417361.1 hypothetical protein [Mariniblastus fucicola]QEG23...  41.7    0.46 
MBC8872297.1 hypothetical protein [Planctomycetes bacterium]          42.4    0.46 
HAN63642.1 TPA: hypothetical protein [Rhizobiales bacterium]          37.8    0.47 
MBI1913873.1 hypothetical protein [Planctomycetes bacterium]          39.8    0.47 
QDU80323.1 hypothetical protein Pla110_20500 [Polystyrenella longa]   42.1    0.47 
PWA73344.1 hypothetical protein CTI12_AA261870 [Artemisia annua]      42.1    0.48 
MBI2836781.1 zinc-ribbon domain-containing protein [Acidobacteria...  42.4    0.48 
BAT06933.1 Os09g0129600, partial [Oryza sativa Japonica Group]        40.5    0.49 
WP_172304196.1 FHA domain-containing protein [Pseudenhygromyxa sp...  42.4    0.49 
HBE07214.1 TPA: hypothetical protein [Firmicutes bacterium]           41.7    0.49 
RPI63797.1 hypothetical protein EHM48_01870, partial [Planctomyce...  42.1    0.49 
RLI90893.1 hypothetical protein DRO65_02145 [Candidatus Altiarcha...  41.3    0.49 
WP_052298606.1 hypothetical protein [Syntrophobotulus glycolicus]     40.9    0.50 
MBC2734268.1 hypothetical protein [Desulfobacteraceae bacterium]M...  42.1    0.50 
OHD13769.1 hypothetical protein A2Y41_03625 [Spirochaetes bacteri...  41.7    0.50 
HAS54469.1 TPA: hypothetical protein [Nitrospiraceae bacterium]       41.3    0.50 
OAD55042.1 putative Dol-P-Man:Man(7)GlcNAc(2)-PP-Dol alpha-1,6-ma...  42.4    0.52 
PKO19890.1 hypothetical protein CVU38_21450 [Chloroflexi bacteriu...  41.3    0.53 
RIO89492.1 zinc ribbon domain-containing protein [Staphylococcus ...  41.7    0.53 
OIW10275.1 hypothetical protein TanjilG_28026 [Lupinus angustifol...  41.3    0.53 
VVA18096.1 PREDICTED: LOC100276777 [Prunus dulcis]                    40.5    0.54 
MBE0634690.1 hypothetical protein [Candidatus Bipolaricaulota bac...  41.3    0.54 
WP_109274543.1 hypothetical protein [Brachybacterium endophyticum...  41.7    0.54 
HIF59940.1 TPA: hypothetical protein [Rhodospirillales bacterium]     39.4    0.55 
WP_125026058.1 hypothetical protein [Chryseobacterium carnis]AZI3...  41.7    0.56 
WP_152622641.1 zinc-ribbon domain-containing protein [Archangium ...  41.7    0.56 
NLP29229.1 hypothetical protein [Clostridia bacterium]HCW05157.1 ...  40.9    0.57 
OJU53307.1 hypothetical protein BGN96_09595 [Bacteroidales bacter...  41.7    0.58 
OQY57355.1 hypothetical protein B6247_00535 [Beggiatoa sp. 4572_84]   40.9    0.58 
NIQ96043.1 NINE protein [Desulfuromonadales bacterium]NIS42136.1 ...  39.4    0.58 
NTU80423.1 hypothetical protein [Chloroflexales bacterium]            41.7    0.59 
WP_169527880.1 hypothetical protein [Flavobacterium sp. SE-s28]NM...  41.3    0.59 
WP_005398774.1 hypothetical protein [Helcococcus kunzii]EHR33215....  40.9    0.60 
AVV84616.1 glycerophosphoryl diester phosphodiesterase [Shewanell...  42.1    0.60 
OGF20315.1 hypothetical protein A2Y83_03000 [Candidatus Falkowbac...  41.3    0.60 
XP_022895190.1 uncharacterized protein LOC111409367 [Olea europae...  41.7    0.61 
AKF17187.1 membrane protein [uncultured bacterium Csd4]               41.3    0.61 
MBI2932895.1 DUF4013 domain-containing protein [Planctomycetes ba...  41.7    0.61 
MBI5143244.1 zinc-ribbon domain-containing protein [Nitrospirae b...  40.1    0.61 
MBI3407269.1 hypothetical protein [Planctomycetes bacterium]          39.4    0.62 
SFB38995.1 hypothetical protein SAMN03159300_10480 [Janthinobacte...  42.1    0.63 
OPX25475.1 hypothetical protein B1H05_03755 [Candidatus Cloacimon...  41.7    0.64 
WP_088109248.1 DUF975 family protein [Tyzzerella sp. An114]OUQ579...  40.9    0.65 
NLH07244.1 hypothetical protein [Chloroflexi bacterium]               41.3    0.65 
MBA4064256.1 hypothetical protein [Isosphaera sp.]                    40.9    0.67 
WP_145428205.1 hypothetical protein [Symmachiella dynata]QDT50945...  41.7    0.68 
OGO90750.1 hypothetical protein A3F10_06675 [Coxiella sp. RIFCSPH...  41.3    0.69 
NEE54877.1 hypothetical protein [Streptomyces sp. SID8455]            40.9    0.70 
KZV19977.1 hypothetical protein F511_27652 [Dorcoceras hygrometri...  41.3    0.71 
MBI4857296.1 glycerophosphoryl diester phosphodiesterase membrane...  41.7    0.72 
HIH51751.1 TPA: hypothetical protein [Nanoarchaeota archaeon]         40.9    0.74 
WP_176066946.1 hypothetical protein [Anaeromyxobacter sp. R267]       40.9    0.75 
RME82318.1 hypothetical protein D6771_07200 [Zetaproteobacteria b...  41.7    0.75 
MBI5180183.1 zinc-ribbon domain-containing protein [Nitrospirae b...  40.5    0.75 
WP_141556510.1 DUF975 family protein, partial [Bacillus pseudomyc...  38.6    0.78 
WP_146323698.1 hypothetical protein [Corynebacterium canis]TWT268...  41.3    0.78 
OLL89795.1 putative integral membrane protein [Pseudonocardia sp....  41.3    0.80 
WP_157949979.1 hypothetical protein [Vallitalea okinawensis]          41.3    0.82 
WP_016148070.1 DUF975 family protein [Butyricicoccus pullicaecoru...  40.9    0.82 
HHS15334.1 TPA: hypothetical protein [Phycisphaerae bacterium]        41.3    0.86 
EKD49365.1 hypothetical protein ACD_63C00169G0001, partial [uncul...  41.7    0.86 
NYZ79902.1 hypothetical protein [Candidatus Micrarchaeota archaeon]   40.9    0.87 
RMD59350.1 hypothetical protein D6821_01435, partial [Candidatus ...  41.7    0.87 
WP_153043259.1 DUF975 family protein, partial [Bacillus cereus]       38.6    0.89 
HBC03416.1 TPA: hypothetical protein [Aequorivita sp.]                40.5    0.89 
NTU71802.1 hypothetical protein [Coriobacteriia bacterium]            41.3    0.90 
WP_145372228.1 hypothetical protein [Maioricimonas rarisocia]QDU4...  41.3    0.90 
PJA37463.1 hypothetical protein CO181_03395, partial [candidate d...  40.9    0.91 
EWC43568.1 hypothetical protein DRE_01455 [Drechslerella stenobro...  41.3    0.91 
CUB59715.1 hypothetical protein BN2127_JRS10_05231 [Bacillus subt...  39.8    0.93 
HCK71050.1 TPA: hypothetical protein [Planctomycetaceae bacterium]    41.3    0.94 
WP_012633310.1 zinc-ribbon domain-containing protein [Anaeromyxob...  41.3    0.94 
PIR48170.1 hypothetical protein COU80_05915 [Candidatus Peregrini...  40.9    0.96 
WP_156137542.1 hypothetical protein [Methyloceanibacter caenitepidi]  40.9    0.96 
WP_107725008.1 hypothetical protein [Desmospora activa]PTM58174.1...  40.9    0.96 
OZA74597.1 hypothetical protein B7X77_08285, partial [Caulobacter...  39.8    0.97 
OQD88128.1 hypothetical protein PENANT_c004G04864 [Penicillium an...  41.3    0.97 
WP_198181099.1 MULTISPECIES: zinc ribbon domain-containing protei...  40.9    0.98 


>WP_072908504.1 hypothetical protein [Malonomonas rubra]SHJ30007.1 MJ0042 family 
finger-like domain-containing protein [Malonomonas rubra 
DSM 5091]
Length=630

 Score = 299 bits (764),  Expect = 4e-90, Method: Composition-based stats.
 Identities = 630/630 (100%), Positives = 630/630 (100%), Gaps = 0/630 (0%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL
Sbjct  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
            QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG
Sbjct  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI
Sbjct  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
            YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA
Sbjct  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
            DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL
Sbjct  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300

Query  301  TPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAE  360
            TPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAE
Sbjct  301  TPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAE  360

Query  361  QLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPV  420
            QLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPV
Sbjct  361  QLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPV  420

Query  421  TLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEH  480
            TLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEH
Sbjct  421  TLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEH  480

Query  481  PAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIG  540
            PAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIG
Sbjct  481  PAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIG  540

Query  541  KTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSL  600
            KTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSL
Sbjct  541  KTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSL  600

Query  601  RQMFDGNIESITVLVAGDSMTQSYPFELTR  630
            RQMFDGNIESITVLVAGDSMTQSYPFELTR
Sbjct  601  RQMFDGNIESITVLVAGDSMTQSYPFELTR  630


>PLY07819.1 hypothetical protein C0624_03270 [Desulfuromonas sp.]
Length=912

 Score = 154 bits (388),  Expect = 1e-36, Method: Composition-based stats.
 Identities = 126/605 (21%), Positives = 216/605 (36%), Gaps = 26/605 (4%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
             T+ CPHC   R+    K+PA+  + +C +C Q   FD   +    +     T     L 
Sbjct  94   ITITCPHCKVSRSVNRDKIPARPVTLQCHQCQQRFDFDIRTTAPQPSALPAQTTNPEMLP  153

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
               P+     +               QPE     + + L  I +L + S+ +F RR   L
Sbjct  154  PTPPTQPTPAEK----------PASAQPEAPAETARTHLSDIPELFSRSFAVFKRRILTL  203

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA-YILLGLSWMTGSMFI  180
            +GI LL +VL FA  F       A             +  A      LL L+ + G++ +
Sbjct  204  IGINLLAMVLTFAAYFLLGAAIDALQNMFGQNPVVLTLASAVSIGLSLLMLTALGGAITM  263

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             I   D+G+  ++ LG++   SF  + +LL  ++ GG L   IPG++F VWF F Q++LA
Sbjct  264  AIVDEDLGVRPALGLGIQRFASFLWVFVLLGFIISGGYLAFFIPGVVFTVWFIFAQFILA  323

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
            ++++ G++AL KSR  V GH+W I GR  ++       S L   IP +G    L      
Sbjct  324  EEDLRGMEALLKSRSYVEGHFWGILGRLAVIA----VASGLLTSIPLLGILFALFAG---  376

Query  301  TPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAE  360
             PF+ +YY+ +Y DL A  +   +   +++      A     LI  L  + L    L   
Sbjct  377  -PFTLIYYHEVYCDLVAIKQQVSYTSSRKEKSKWLLAGASGYLIVLLGGILLLGPALLQG  435

Query  361  QLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLS----KQRKTTSEGGLS  416
                   D+                S        +       ++          ++  + 
Sbjct  436  LATFGAHDLNPFEQETSAPGEGWLSSDAYLDLPKTQYVTHEPITVNVVAPANLPADAWVG  495

Query  417  LGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQH  476
            + P  +       +DQ+   +  L      ++             ++ D D         
Sbjct  496  IVPSYVPHGDEVRNDQHNLSYQYLNGEILTSMQFYAPAVPGDYDLRLNDSDNNGRELTST  555

Query  477  SFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTR  536
            +F     +           L     S     G +   + +    L+    + I   Q+  
Sbjct  556  TFTVVLPYDDNSPAATSAQLILARTSF--VPGAKVVVIFTAPEDLQTDAWVGIVPTQIAH  613

Query  537  NDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDL-LNVHASNSHAEPLREIGFTWQKSG  595
             D     Q       L    +  +T           L +H ++S    +  + FT   S 
Sbjct  614  GDEALNDQYDLSYQYLNGQTTGTLTFTAPNSSGQYDLRMHDTDSSGREIASVTFTVGASS  673

Query  596  DAFSL  600
               SL
Sbjct  674  PQQSL  678


>RME32505.1 tetratricopeptide repeat protein, partial [Gammaproteobacteria 
bacterium]
Length=725

 Score = 147 bits (369),  Expect = 2e-34, Method: Composition-based stats.
 Identities = 84/453 (19%), Positives = 152/453 (34%), Gaps = 34/453 (8%)

Query  202  SFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHW  261
                             L ++   + F V       V A++ IG   A+ ++  L  G+ 
Sbjct  185  WLQYAGSAASPAFALLLLAVVAASIFFLVALSLTGAVAANEEIGPWDAVRRAWDLARGYR  244

Query  262  WAIFGRFVLLLVISLTLSFLTA-------------------RIPYVGEAANLAFSLLLTP  302
            WAI G  + L++  + L  L                      I  +     +    L   
Sbjct  245  WAIVGNVLFLMLAVMVLYLLLQFAGFLLGLLAGLIFAPLAVVILPLTLILMVLLQFLSMA  304

Query  303  FSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQL  362
                   L Y + +    G + P  +        +      + G  + +           
Sbjct  305  LFHFLITLFYLEARVVKEGWRPPWREALRQDWPRSDAAPPHLGGRGVRAWMELLGVTLLA  364

Query  363  LSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTL  422
             +    +    G   +       +  +   +           +    +   G+       
Sbjct  365  TALLAALMLWTGRSMEAWLRHATTGLDLQMQQGQGTPPAGADQGDGLSITLGVDTF----  420

Query  423  FADRFWADDQNPHLWLKLELSDFPNLSLAQKGS---ARIEIDKVLDDDARDLYDRQHSFE  479
                F  + Q+P LWL L L+ F  L  A+      ARI I++V DD   + Y+    FE
Sbjct  421  ----FTQNRQDPSLWLVLRLNGFRPLWPAELHPQGLARIVIEEVRDDRGGNRYNAGSQFE  476

Query  480  HPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDI  539
               F  V +          GIR+++L  GT+ E V +I G++EL LP  + SL LT    
Sbjct  477  QGVFSRVDLQPGTRQGELRGIRTVHLLPGTREEDVKAISGRVELNLPDEVSSLVLTPAQE  536

Query  540  GKTLQIG-GKQLILQRLGSNAVTLRF--LGDRTDLLNVHASNSHAEPLREIGFTWQKSGD  596
            G+T+    G + +LQ+ GS  + L     G +  LL V   ++    +  +     +  +
Sbjct  537  GETVTTEDGLKFVLQQFGSEEIRLALPDDGPQERLLGVLGVDNGGRLIAPVSRASSRGRN  596

Query  597  AFSLRQMFDGNIESITVLVAGDSMTQSYPFELT  629
             + +   F   +  + + + G       PF L 
Sbjct  597  RW-ISYGFSTPVRELRLYLGGPRRKLELPFTLR  628


>PYM54254.1 hypothetical protein DMD77_23985, partial [Candidatus Rokubacteria 
bacterium]
Length=481

 Score = 135 bits (337),  Expect = 4e-31, Method: Composition-based stats.
 Identities = 38/181 (21%), Positives = 73/181 (40%), Gaps = 3/181 (2%)

Query  451  AQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQ  510
            A     R+ ID V     ++L   +              +T  +      +++ L  G  
Sbjct  34   ASDDRVRLSIDSVKSTGGQELLRPEACGRER-NPQPAAFKTWGSHRLKASKTLRLIDGAD  92

Query  511  AEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTD  570
               + S+ G+++L LP   E   L+    G   +  G    + ++   +V+ +  G R  
Sbjct  93   PHALQSVSGQVQLRLPTRTEVASLSHPAAGAVAERHGAVFTVTKVAGGSVSYQIEGARDR  152

Query  571  LLNVHASNSHAEPLR--EIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFEL  628
            +L + A N+  +PL       +    GD  S ++ + G I+ + V+ A +  T  +PF L
Sbjct  153  VLFLRALNAKGQPLASPSSFSSSFMFGDGASGQKNYAGTIDRLEVVFAAEEQTLQFPFTL  212

Query  629  T  629
            T
Sbjct  213  T  213


>NNJ91639.1 hypothetical protein [Gammaproteobacteria bacterium]
Length=873

 Score = 136 bits (340),  Expect = 2e-30, Method: Composition-based stats.
 Identities = 44/264 (17%), Positives = 83/264 (31%), Gaps = 9/264 (3%)

Query  373  LGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQ  432
                 Q           +    ++    L     +   +      GP      +    D 
Sbjct  358  AKDPSQSVGPDKIQTHPKDYAANAVLANLPDITFKDHETPALFYQGPFAADIKQISQTDD  417

Query  433  N---PHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFE----HPAFHW  485
                  +  K+ + +  N            +D + D    +L   +        H     
Sbjct  418  GLFEIWIEAKVAIPESQNWGREYTAETTFIVDSITDQQGNELLRDERCVTGMMIHGKNQE  477

Query  486  VGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQI  545
               N    +D  S  + + L    Q E++H I G +E T P+++    +     G +++ 
Sbjct  478  PSSNTMINSDTSSAWKRVRLIPDAQLEKIHRIEGNIEFTAPVSVSVHDV-PLKAGASVEA  536

Query  546  GGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSLRQMFD  605
             G +  +     + +T +  GD   +L + A NS  +PL                 Q + 
Sbjct  537  EGVRFYINHFKDSQLTYQLSGDTDRVLEIRALNSAGQPLETSWKMGSVDPTGQ-QNQAYQ  595

Query  606  GNIESITVLVAGDSMTQSYPFELT  629
            G I S+ V +A     QS  F +T
Sbjct  596  GKIASVQVFIARQFNQQSMQFVIT  619


>WP_049721524.1 hypothetical protein [Gilvimarinus polysaccharolyticus]
Length=1031

 Score = 135 bits (337),  Expect = 4e-30, Method: Composition-based stats.
 Identities = 47/326 (14%), Positives = 101/326 (31%), Gaps = 13/326 (4%)

Query  317  ANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQ  376
                  ++ P                 +   +      Q   +  + S    +       
Sbjct  437  WRTDAEKNWPEVAGLYDRLEVTRSDESVGATVFFDRHLQRDISNWVSSIFSGMFSSSDDA  496

Query  377  PQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHL  436
            P +               +         +   +        GP  +  +     ++    
Sbjct  497  PTEERLETDPPVFTNITGAGFKPYASYPELHNSMFTPATEAGPFAIGIESLKVSEEQQLE  556

Query  437  WLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDL  496
                 L+     +  +  +A   +  V D   + L       E        I  +     
Sbjct  557  VALKMLAFNLANAPTRGAAATFTLVDVTDTQGQSLLPALECGEDKNLAPADIGTSMSATH  616

Query  497  F-----------SGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQI  545
            +            G ++I L   T+   +  + G ++     A+E ++L +   G+ ++ 
Sbjct  617  YVDGEPVPYTSTQGSKTIALSPETKVSDIAELKGYIDYRQVTAVEQIKLQQPLAGQVIES  676

Query  546  GGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQK--SGDAFSLRQM  603
             G +L     G +A+  R+ G+  +LL+VHA N + +PL   G  W     G   +    
Sbjct  677  NGLRLRFMDAGPHALQYRYSGEVDNLLHVHALNKNGQPLSSGGAMWGGAMFGSGKTASVN  736

Query  604  FDGNIESITVLVAGDSMTQSYPFELT  629
            ++G I S+ V+VA  + +  YP  + 
Sbjct  737  YNGEIASVAVVVATQASSTRYPLSVK  762


>PYN97354.1 hypothetical protein DMD91_18295 [Candidatus Rokubacteria bacterium]
Length=879

 Score = 134 bits (336),  Expect = 5e-30, Method: Composition-based stats.
 Identities = 41/260 (16%), Positives = 90/260 (35%), Gaps = 5/260 (2%)

Query  374  GTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQN  433
                    +   + P   Q  +S          +    +     GP  L           
Sbjct  353  PAAETARAEHLDTSPLVFQATTSTSTLGAYDPAKTFAEDVDQIQGPFGLRLGEIRLGSSP  412

Query  434  PHLWLKLELSDFPNLSL--AQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQT  491
                  +  S    +     +   AR+ +D V     ++L   +               +
Sbjct  413  DVGLEVVVESFASAIPNVTEESRRARLFVDSVKSSGGQELLRPEACGRER-NDEPSSFTS  471

Query  492  DENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLI  551
              +      +++ L  G     +HS+ G +EL LP   ES+ ++ +  G  ++  G   +
Sbjct  472  SVSQWVQASKAVRLIAGADPRALHSVSGHVELRLPTRTESVMISASQPGTKIERHGATFV  531

Query  552  LQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKS--GDAFSLRQMFDGNIE  609
            +  +   ++  + +G R  +L+  A N+  +PL   G        G+  + ++ + G ++
Sbjct  532  VTSVTGGSLGYQIVGARDRILHFRALNASGQPLASSGGFSADFLLGEGVAGQKEYAGVVD  591

Query  610  SITVLVAGDSMTQSYPFELT  629
             + V+ A +  T   PF L+
Sbjct  592  RVEVVFAAEEQTVRSPFTLS  611


>PYN07539.1 hypothetical protein DME02_11495, partial [Candidatus Rokubacteria 
bacterium]
Length=649

 Score = 131 bits (328),  Expect = 3e-29, Method: Composition-based stats.
 Identities = 44/205 (21%), Positives = 77/205 (38%), Gaps = 4/205 (2%)

Query  427  FWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWV  486
                D  P L ++      PNL+   +  AR+ +D V+   A+++   +           
Sbjct  204  PSVSDSGPELDVEAFAGAVPNLAGGGE-RARLFVDSVMSITAQEVLRPEPCGRQR-NGLA  261

Query  487  GINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIG  546
                          +++ L  G     V +I G +EL LP  +E+ +  R   G TL   
Sbjct  262  STFTDSGGRRLRATKTVRLLAGADPRTVGTIAGHVELRLPTRVETQRAARPTPGATLAAH  321

Query  547  GKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQK--SGDAFSLRQMF  604
            G  + + ++    V  +  G R  +L V A N+  +PL            GD  + R  +
Sbjct  322  GATVTITKVDGGDVQYQITGARDRVLAVRALNAAGQPLASEMKMSSDFLFGDGSTARVQY  381

Query  605  DGNIESITVLVAGDSMTQSYPFELT  629
             G  ++I         +  +PF LT
Sbjct  382  AGIADTIEATFVVSEQSLRWPFRLT  406


>MBE0616275.1 RDD family protein [Burkholderiales bacterium]
Length=662

 Score = 128 bits (320),  Expect = 3e-28, Method: Composition-based stats.
 Identities = 62/360 (17%), Positives = 122/360 (34%), Gaps = 17/360 (5%)

Query  278  LSFLTARIPYVGEAANLA---FSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPL  334
            LSFL A +  + +  +        ++  F+     L   D+       +  P     +  
Sbjct  98   LSFLRAFLRNLAKIISAIPFDIGFIMAAFTGRKQAL--HDMITKCLVVRTRPSHFGRVLA  155

Query  335  TAAIFGWMLIPGLLLVSLSRQNLSAEQL---LSAGKDIQQRLGTQPQQTPDLNRSLPEEP  391
            T      ++I G +           ++                      P +        
Sbjct  156  TGFGALVLVIGGSIAYFYYVAVPQMQEQTAASMQAAVKHAPPKKAMPARPTVANKPMPAT  215

Query  392  QRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLA  451
               +    +            G +  GP  L  D+F+       +W+K++L    +L LA
Sbjct  216  TAKAEPRMQGAAPALTGFDKPGSVRAGPAILTLDQFFPTS----IWIKVQLPLIKDLELA  271

Query  452  QKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTD-ENDLFSGIRSIYLRQGTQ  510
               +  I +  V++   ++ +D    FE   F    ++Q       FSG+R+++++ G  
Sbjct  272  P--APEITVSDVMNKSGQNFHDAASPFETGIFLRARLSQQPMPVPHFSGLRTLHIKSGLD  329

Query  511  AEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTD  570
             + +  I G+L +++P+    +     DIGK        + L+ L      L + G   +
Sbjct  330  EKALQKIDGELRISIPVDARPVAFEAADIGKEKPAHTSTVALKSLSGAEAVLHYRGTPAN  389

Query  571  LLNVHASNSHAEPLREIGFTWQKSGDAFSL--RQMFDGNIESITVLVAGDSMTQSYPFEL  628
            LL V         +         +G A  +  +  F   +  +  +VA     Q YPF L
Sbjct  390  LLRVRGYAKDGAAVASGWSQSPPAGKAADVDLKFKFKAPVSKVEAVVAAGLNEQKYPFSL  449


>WP_153498920.1 hypothetical protein [Alcanivorax sp. PA15-N-34]MQX52186.1 hypothetical 
protein [Alcanivorax sp. PA15-N-34]
Length=983

 Score = 127 bits (318),  Expect = 9e-28, Method: Composition-based stats.
 Identities = 40/328 (12%), Positives = 99/328 (30%), Gaps = 13/328 (4%)

Query  314  DLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRL  373
            D  A++       +KR  +  +    G  L+              A              
Sbjct  382  DTLAHHEPETVKLLKRVAVVRSKDQLGASLVLDGNAGEELMSLGQAVMGQMFSISSSMAG  441

Query  374  GTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQN  433
                ++   L  +      +   A+                          +       +
Sbjct  442  EENTERQEQLEDNPARFFLQFDPANLAAFDDIPDDHFHAAWKQGPLALRIKEIGVDPKND  501

Query  434  PHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDE  493
               +L L        ++A++    + +  + D     +       ++       +++T +
Sbjct  502  NRSYLLLNGQGRSLTNVAKEQGGSLVVSGIYDQSGNAIMPAPSCGQNRNQDPAMLSKTMD  561

Query  494  NDLFSG-----------IRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKT  542
               ++             + + L  GT  + V  + G++ L +    E+  +     G +
Sbjct  562  GTYYADGEFQHYGKHEVDKKVLLAPGTSLDAVARVEGEMTLQVATRTEARVVQMPVTGAS  621

Query  543  LQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQK--SGDAFSL  600
            +Q+ G ++        +++    GD + +L V A N   +PL     +      G   + 
Sbjct  622  VQLAGARVAFGNSEGQSLSYVTSGDVSRILAVRALNKDGKPLASQSHSSFGPVLGTGKAH  681

Query  601  RQMFDGNIESITVLVAGDSMTQSYPFEL  628
            +  + G   S+ V++A +     YPF +
Sbjct  682  QYDYHGTPVSVEVVIATELKPLRYPFVI  709


>WP_094984345.1 hypothetical protein [Cellvibrio mixtus]OZY86742.1 hypothetical 
protein CBP51_06950 [Cellvibrio mixtus]
Length=991

 Score = 127 bits (316),  Expect = 2e-27, Method: Composition-based stats.
 Identities = 60/342 (18%), Positives = 110/342 (32%), Gaps = 16/342 (5%)

Query  303  FSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQL  362
             +  +  L  +  K   +  Q  P                 +   +       +  +  L
Sbjct  386  INNTHKTLQITLDKTRQQIAQDWPEVSAIYDRLQLNHSAQQLQASIRFDQQINDELSALL  445

Query  363  LSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTL  422
             S          +   Q   ++ +     +                  +    + GP  L
Sbjct  446  SSMLSISAATEQSAVVQQEVIDENPARFAKAHREQLIPFAPQDDF-MDAVYKTAAGPFGL  504

Query  423  FADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYD---------  473
              +     D    + + ++  D PNL   +  SA + +  VLD     L           
Sbjct  505  GVESLALQDGRIQVKVGVKAYDLPNLGN-ENSSAFLVVTDVLDQQGNSLLALAPACGESD  563

Query  474  ---RQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIE  530
                +      A + +   Q       +G ++I L  G    QV  I G ++  LPL +E
Sbjct  564  PRAPEAIQYSYAINRIQDGQFVPGRALTGGKTIALPAGVNLGQVARIKGYIDYHLPLDVE  623

Query  531  SLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFT  590
            +  +     GK + + G ++  +  G++++   + G    LL V+A N+  +PL      
Sbjct  624  TKIVDAPLAGKLVDLHGTRIRFKSTGASSLRFEYSGATEHLLQVNALNAANKPLSSSSAM  683

Query  591  WQK--SGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFELTR  630
                  G   S    F G +E   ++VA     Q Y FEL R
Sbjct  684  RSSLFWGSGKSASLDFKGTVEKAEIVVASAIENQRYDFELNR  725


>PCJ44889.1 hypothetical protein COA99_06475, partial [Moraxellaceae bacterium]
Length=651

 Score = 124 bits (308),  Expect = 9e-27, Method: Composition-based stats.
 Identities = 43/270 (16%), Positives = 97/270 (36%), Gaps = 12/270 (4%)

Query  369  IQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFW  428
                   + ++              L+    KL  +K         L  GP  +  D   
Sbjct  344  YAMGGNNKQRKPEAERIENNPVDYALNKHFEKLPDAKDSPFLESPLLKNGPFIIDVDYLK  403

Query  429  ADDQNPHLWLKLELSDFPNLSLAQKGSAR----IEIDKVLDDDARDLYDRQHSFEHPA--  482
             ++        L     PN+             + +  VLD + ++L   +    + +  
Sbjct  404  KNENGQFEMRVLGKVSLPNIETKDGSKPSGRMKLHVKSVLDKNGKELLKNERCDSNKSHF  463

Query  483  ---FHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDI  539
                H   +++    D  S  +++ L +G    +V  ++G++    P+ +   ++    +
Sbjct  464  GDRNHKATVSEYYSGDDISSWKTVRLEEGAIPSEVDRLIGEISFNSPIKVHKFKV-PLKV  522

Query  540  GKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFS  599
            G  +   G +  L ++GS  V  +  G+ + LL V A N+  + L +         D   
Sbjct  523  GSEIDHLGMRFYLSKVGSQQVGYQLSGNTSRLLEVRALNADGKVLSKSWR--FGDADGSK  580

Query  600  LRQMFDGNIESITVLVAGDSMTQSYPFELT  629
            + Q + G+++S+ + +   +     PFE+ 
Sbjct  581  VVQGYKGDVKSLELYLVEQTKELKLPFEVN  610


>WP_198262883.1 hypothetical protein [sulfur-oxidizing endosymbiont of Gigantopelta 
aegis]
Length=1036

 Score = 124 bits (308),  Expect = 2e-26, Method: Composition-based stats.
 Identities = 50/349 (14%), Positives = 107/349 (31%), Gaps = 22/349 (6%)

Query  304  SFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLL  363
                   + + L   +     P  K  W+ +   +   +     L ++       +    
Sbjct  424  MSAGLTNLQNLLNKIHLQSSLPEDKHGWVSMQLKLDNALKTSMQLSMNELVAKFLSTTFD  483

Query  364  SAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLF  423
                D+         +T +     P + Q    A                    GP  + 
Sbjct  484  LNSNDLSSTNTASKNKTTEEIELTPVKFQATYQASQLKAFDDSLDMFFTPAWIDGPFAIA  543

Query  424  ADRFWADDQNPHLWLKLELSDFPN-LSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPA  482
             D    +       +   L      +    +  A+I+I  V D    ++  + H  +  +
Sbjct  544  VDELLLESSGDAEQIIFHLRGKAQNIDNLGQQQAKIKILAVTDQQGNNVLLKNHCNDSNS  603

Query  483  FHWVGINQTDEN-------------------DLFSGIRSIYLRQGTQAEQVHSILGKLEL  523
                   + +                     +       + L+      QV SI G++EL
Sbjct  604  KIAETDAKNETYFSLFGTAKTAFVGNKQVHYNELEVRHKVKLKPDVSFSQVKSIRGEIEL  663

Query  524  TLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEP  583
             L    + ++  +    K + +   Q++ +   S+ ++   LG+   +L+V A N + + 
Sbjct  664  NLATQTQRIEFQKTRQNKNVMVQDSQILFKPSASDTLSYTVLGNEKRVLSVRALNKNKQV  723

Query  584  LR--EIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFELTR  630
            L               +S+ Q + G I  I V+ A +     YPFE+ +
Sbjct  724  LHRMSRSSMSNLLMSGYSVSQTYQGEIAFIEVIYATEFSPLKYPFEIKK  772


>RME67327.1 hypothetical protein D6778_03425 [Nitrospirae bacterium]
Length=233

 Score = 115 bits (286),  Expect = 3e-26, Method: Composition-based stats.
 Identities = 57/237 (24%), Positives = 100/237 (42%), Gaps = 6/237 (3%)

Query  393  RLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQ  452
               +A  ++L  K            GP  L +DRFW D  +PHLWLK+ +  FPN+S   
Sbjct  1    MSPAAYDRVLQRKAPAFGEGPYTYAGPAVLKSDRFWDDPNSPHLWLKVRVPSFPNISY--  58

Query  453  KGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAE  512
                 ++ID V+  +  DLYDRQ  FE     +  ++    N    G+R I+L +GT+ E
Sbjct  59   -SVVEVKIDSVMSKNGADLYDRQSMFEKTEV-FHRVHLAANNQFSEGLRDIHLLKGTKKE  116

Query  513  QVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLL  572
             +  I G + L LPL +  L++ +       +     + ++      +TL   G +  + 
Sbjct  117  MIKEIKGSVILHLPLGLSVLEV-QTKKAAVSRTNSATVTVEDATPQRITLLVKG-KAQVS  174

Query  573  NVHASNSHAEPLREIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFELT  629
             V   N+    +         +     +   F    + + +++      ++YPF L 
Sbjct  175  AVSGFNATGREITPTMTLTSDTAGGQRIVMTFPEVPKKLKLILKTGEHLKTYPFSLR  231


>PLX80586.1 hypothetical protein C0615_00685 [Desulfuromonas sp.]
Length=630

 Score = 121 bits (302),  Expect = 5e-26, Method: Composition-based stats.
 Identities = 92/632 (15%), Positives = 191/632 (30%), Gaps = 38/632 (6%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  + CP+CG  R  P  K P K   A C  C  T   DPA+ Q       +        
Sbjct  1    MVKISCPYCGLTRTIPKEKAPKKPVKAHCHRCKHTFPVDPAKLQPVDPQRELVESARTAA  60

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
            +    + + +           + +                      +             
Sbjct  61   RASQETQQSQAP---------SSAETEPTTHTGLGGSPESIGHLFTMTWGIFSSRFFTLF  111

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
             L + L  ++L    I     +    ++         A  L     +  G+ W  G+   
Sbjct  112  GLYLLLFLLILVPGGILGGGTIALTQFIPGMEWLVVPAGSLLAALAVTYGMFWGYGAFIH  171

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +   + G   +++ G   V SF   L L   ++ GG LL  IPG+LF VWF F  +++A
Sbjct  172  AVIDQECGFKEALRRGREQVWSFLWALSLSSFLIVGGYLLFFIPGILFSVWFLFVPFIIA  231

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
             +   G+ AL KS   + G     F +  +L ++   +      IP  G   +L      
Sbjct  232  VEQERGMDALLKSYEYIKGFSMNSFFKLFILTLVCGAI----GAIPIAGPFLSLL----T  283

Query  301  TPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAE  360
            TPF+ ++++++Y DL+           + +      A     ++  +++ +L    + + 
Sbjct  284  TPFTLVFFFVMYYDLQQIKGNRPIEASRGRKTKWILAGSIGYIVLPIIIFALFGGTIFSG  343

Query  361  QLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPV  420
             + +      +      Q  P       ++    +S     +L      T          
Sbjct  344  LMFAMNSSDFKFTPPVVQSPPRPATPSVKKESPQASIPPAKVLESNATLTISTLKVDPGE  403

Query  421  TLFADRFWADDQNPHLWLKLELSDFPNLSL-AQKGSARIEIDKVLDDDARDLYDRQHSF-  478
             +       D+  P  W+ L   D PN      +     +   V           +    
Sbjct  404  NIVIKFHGIDEPAPKDWIALFPVDAPNQEYGEWEYLGSKKNGAVTFIAPARPGVYEARLF  463

Query  479  ---EHPAFHWVGINQTDENDLFSGIRSI----YLRQGTQAEQVHSILGKLELTLPLAIES  531
                   +H    +        +   ++      +  +  ++     G+   + P +  +
Sbjct  464  LDWPTGGYHAAARSTKIYVGEITPTTTVQTAPPPKPSSPLDKFRPKNGQTADSAPTSFSA  523

Query  532  LQLTRNDI---------GKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAE  582
                   +            +++ G+     + G   ++  + G        +  + +  
Sbjct  524  SVADPAQLYIYVFAINYTGEVRLNGESFYELK-GEQDMSYNYTGQGKLKHGENVFDVNFN  582

Query  583  PLREIGFTWQKSGDAFSLRQMFDGNIESITVL  614
             L      W    +       F+   E+I   
Sbjct  583  TL--PDDPWTTKIEIKVYSYDFNSKKETIVAK  612


>TNE28966.1 hypothetical protein EP349_07150 [Alphaproteobacteria bacterium]
Length=901

 Score = 119 bits (297),  Expect = 3e-25, Method: Composition-based stats.
 Identities = 40/275 (15%), Positives = 86/275 (31%), Gaps = 23/275 (8%)

Query  378  QQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQN----  433
            +Q  D            +     L          +   + GP  +  ++     Q+    
Sbjct  361  KQREDTVPKRELPQFSENYTRDNLPAFSPANVFIKADAAAGPFGIAVEKAALITQHGRTI  420

Query  434  PHLWLKLELSDFPNLSLAQ-------KGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWV  486
              L LK+      N+              A ++I  V D    DL   +           
Sbjct  421  TELTLKVHSESLKNIPADNLNIDDDASAVATVQISGVYDRMGEDLLAAEDCGTDSNNRET  480

Query  487  GI----------NQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTR  536
             +            +  +    G +++ L      + +  I G++ L LP  I+S++   
Sbjct  481  PLAVHSRLEFDNGTSSTHTYLEGRKTLRLIPEAALQDIAKITGRVTLQLPQNIKSVRYAA  540

Query  537  NDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGF--TWQKS  594
                + ++    +L  +    + +     G +  +L VHA N     LR      +    
Sbjct  541  PLDQQEVKTDNARLYFRDNDEHRLKYTVEGRKNHILAVHALNEQGYVLRNRSSLVSSSAF  600

Query  595  GDAFSLRQMFDGNIESITVLVAGDSMTQSYPFELT  629
             +  ++ +   G I    ++   ++  Q +PFE+ 
Sbjct  601  KNQRNVSRDIQGRIAGAEIIYTDETQPQEFPFEIN  635


>NOY85388.1 hypothetical protein [Nitrospirae bacterium]
Length=277

 Score = 112 bits (279),  Expect = 7e-25, Method: Composition-based stats.
 Identities = 55/263 (21%), Positives = 109/263 (41%), Gaps = 13/263 (5%)

Query  372  RLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADD  431
                  Q+   + ++              +    + + ++  G  LGPV++  D F    
Sbjct  20   NAQLSFQEGRKIWKNYSASQYDERLRAAAVNFPDRFRVSTNRGADLGPVSIRLDDFVD-G  78

Query  432  QNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQT  491
             +P+    + ++  PNL L  + S ++ I  V D +  DLY+R+        + +G    
Sbjct  79   DHPYFEYWVRVAALPNLELDNQVSLKLIIRHVWDLEKNDLYNREAVSRQSLQNIIG----  134

Query  492  DENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLI  551
              +D  + I  +YL+QG+    +  I G++   LPL +  L   ++D+ K  Q+ G    
Sbjct  135  --DDALTFISKLYLKQGSDVRVIQEISGEIVFILPLELTWLTFEKSDLKKKKQVPGIVST  192

Query  552  LQ------RLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSLRQMFD  605
            +       +  S+ V +R+ G     L   A +   + L++ GF      + +     F 
Sbjct  193  VSLSNLRFKDESSWVEIRYQGKEEAYLTTLAYDKTGKQLKKKGFAKNSVSEGWLYAYAFA  252

Query  606  GNIESITVLVAGDSMTQSYPFEL  628
            G +E I  L +   + + YPF++
Sbjct  253  GQVEKIQTLFSSHVIERKYPFKI  275


>WP_113955051.1 hypothetical protein [Arenicella xantha]RBP49185.1 hypothetical 
protein DFR28_104113 [Arenicella xantha]
Length=1042

 Score = 116 bits (289),  Expect = 3e-24, Method: Composition-based stats.
 Identities = 41/299 (14%), Positives = 92/299 (31%), Gaps = 15/299 (5%)

Query  336  AAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLS  395
                   ++ G   +++   +  +           Q+   +               ++  
Sbjct  473  LFSELGSMVFGGGALTMGSPSEESLMEAVWDFSQNQQFVEENPFQLGSFDKYLPATKQGG  532

Query  396  SADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGS  455
             A +   +  +   +         + L  +            +      F          
Sbjct  533  QAVFVGGVGLKPINSFASDAWAEDLALQLELVAKRMG----PVLPAGQTFEGDVSDSGME  588

Query  456  ARIEIDKVLDDDARDLYDRQHSFEHPA----FHWVGINQTDENDLFSGIRSIYLRQGTQA  511
              +++  VL+   + +   +H     A     H +  +   +  +F   +S+ L  GTQ 
Sbjct  589  YSLQLKDVLNKQGQSILRDEHCLRGAAFGGRNHELAESHQYQAGVFEVRKSVRLTSGTQI  648

Query  512  EQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDL  571
              +  I G+L   +P  ++S  L+        +  G    L  +   +V     GD + +
Sbjct  649  ADIGQIQGQLSFRVPTEVQSTVLSMRGER-MFEYEGTAFRLSDIEGGSVQYSIDGDESKI  707

Query  572  LNVHASNSHAEPLREIGFTWQKSGDAFSLRQMFDGNIESITVLVA--GDSMTQSYPFEL  628
            + + A N+    LR  G          S  Q F G ++S+ V  +   D   Q + F+ 
Sbjct  708  VALRALNADGNVLRSQGKMTF----GDSTAQSFAGAVDSLQVFYSVKTDEQLQDFDFDF  762


>WP_041551596.1 hypothetical protein [Cellvibrio japonicus]ACE85850.1 hypothetical 
protein CJA_3128 [Cellvibrio japonicus Ueda107]QEI13415.1 
hypothetical protein FY117_15075 [Cellvibrio japonicus]QEI16989.1 
hypothetical protein FY116_15080 [Cellvibrio japonicus]QEI20567.1 
hypothetical protein FY115_15075 [Cellvibrio 
japonicus]
Length=1121

 Score = 116 bits (289),  Expect = 4e-24, Method: Composition-based stats.
 Identities = 57/359 (16%), Positives = 106/359 (30%), Gaps = 15/359 (4%)

Query  284  RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWML  343
              P   E ++   S        +   L  S  +A +      P         +       
Sbjct  498  LFPPAVEFSSALASSDAQAIQAIRQQLSNSLQQARHTLAVDWPETLPLYERLSVGGDESH  557

Query  344  IPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSAD-YKLL  402
            +    L     Q    + + S        LG   +   +     P     + +       
Sbjct  558  VAINALFDEQVQQEIQQWISSLLGKAFSPLGLTMEVAQERVDESPLLFADVPATPLADFA  617

Query  403  LSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDK  462
              K    +    +  GP  L        D    L L +   + PNL   +  S  + I  
Sbjct  618  SVKHFNQSFVPQVETGPFGLAISSLEQTDNGALLNLDVAAFNLPNLGR-ETDSVSLRITD  676

Query  463  VLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLF-----------SGIRSIYLRQGTQA  511
            + D     L                IN   + ++F            G + I L      
Sbjct  677  IADAYGNSLLATTDCATQSLRQATPINLIYQGNVFDQGQPVGFVGLQGGKKILLPPTLDL  736

Query  512  EQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDL  571
             +   I G +E  LP  IE L L +   G+ +   G  +     G + +  +  G+   L
Sbjct  737  TRAGLIKGVIEHRLPTRIERLSLAQPLAGQKIDAQGVSVRFLSAGPSQLHFQVAGNTAAL  796

Query  572  LNVHASNSHAEPLREIG--FTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFEL  628
            L+V+A N   + L                ++   F G + ++ +++  + + Q++ F L
Sbjct  797  LHVNALNEAGKVLASNSALRGSNLLDQGKTVTIDFQGKVAAVELILVREQVLQTHEFTL  855


>PPR15797.1 hypothetical protein CFH43_00866, partial [Proteobacteria bacterium]
Length=389

 Score = 112 bits (278),  Expect = 7e-24, Method: Composition-based stats.
 Identities = 33/233 (14%), Positives = 84/233 (36%), Gaps = 20/233 (9%)

Query  410  TSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLS-----LAQKGSARIEIDKVL  464
                  S+ P   F  +    D    +++++E    PN++      ++    ++ I KV 
Sbjct  154  PFMISPSVEPFFSFGQKKEPTDDLERIYVQIESGIIPNMNISMHGDSENPRVQLFIQKVA  213

Query  465  DDDARDLYDRQHSFEHPAFHWVGINQTDENDL----------FSGIRSIYLRQGTQAEQV  514
            D   +++ + +    +     V +                   SG R + L +G   +Q 
Sbjct  214  DHSGQNILNIETCGPNRTAKAVELTAMPRISYEGNEQKRHTTVSGKRELKLLKGHTQKQA  273

Query  515  HSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNV  574
             ++ G ++L +P  I+++ +     G+T    G    + +     +     G +  +L+V
Sbjct  274  KTVEGYIQLRIPSMIKTVDI-PAKEGETYSFKGTNFTITKSSYGELGYTIEGQKDLVLDV  332

Query  575  HASNSHAEPLREIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFE  627
               N   + +     +        ++ Q   G      +++A     +++ F+
Sbjct  333  MPLNKDKKVIEANSKSSS----GNNITQYPSGTPVFARLILASQEQERTFSFK  381


>PIR46206.1 hypothetical protein COV07_04540 [Candidatus Vogelbacteria bacterium 
CG10_big_fil_rev_8_21_14_0_10_45_14]
Length=530

 Score = 112 bits (278),  Expect = 3e-23, Method: Composition-based stats.
 Identities = 75/517 (15%), Positives = 154/517 (30%), Gaps = 28/517 (5%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
                   W ++ +  L   L      + L       +   N       +L +    +L  
Sbjct  36   MRMLAERWRVILMVSLSASLVGLLATALLSSYLLPLVGDSNTPGLLFFMLVSGVITILLS  95

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
            ++  G M+I    +D  + R++ L    +   + +++L  L++GG  L L+IPG++  V+
Sbjct  96   AFWQGGMYISANNSDAKVERALALSWNMLLPLSFVILLSALIIGGAILALVIPGIIAMVF  155

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL---TARIPYV  288
              F    +      G  A+  S  +V+G W  +FGR ++L V+      +      +   
Sbjct  156  LTFTVPSMVIGGKRGFDAIMYSFEIVNGRWLEVFGRVIVLGVLMTLAGLIPDLIGDVAIA  215

Query  289  GEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLL  348
             +  +  F ++L  +S  Y  ++Y  L  +          R       AI   +L  G L
Sbjct  216  YDFVSFVFGIVLGAYSVCYMLVLYKSLATSAVKIGGDKRNRVKALFVGAIVLLLLSGGAL  275

Query  349  LVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQR-LSSADYKLLLSKQR  407
              S                         P+Q  D+N    E+P             +   
Sbjct  276  FASTLVYGTKVLTDFGFLFGGVFPDLQVPRQIEDINVDNREQPLELEMGIPEAEAATYDE  335

Query  408  KTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDD  467
               +    +L    +             L+ +  +  + +               +L  D
Sbjct  336  SIATFTKENLDNGQVRLRLNRFLGDMSLLFSRGSVFRYKDEENDSVNFEISSGFAILSRD  395

Query  468  ARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPL  527
                       E      + +       +  G  +  +    + ++   +     L+   
Sbjct  396  G-------SKGESANIKRLSLYTPHTVVMLQGTTTSEVLVSVRDDKHTDL---FVLSGKG  445

Query  528  AIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRF-LGDRTDLLNVHASNSHAEPLRE  586
            +I  L       G  L   G    +    S      F  G+      V       E    
Sbjct  446  SIRGLTSDPLATGIGLWA-GFMTTIDDTSSATEPREFDEGELPQTDTVEDFTKGKE----  500

Query  587  IGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQS  623
                    G +  +  +F G +  + V++      + 
Sbjct  501  --------GKSGKVSYLFYGAVGLLAVMLGAYFGKKR  529


>PLX78103.1 hypothetical protein C0616_15510 [Desulfuromonas sp.]
Length=553

 Score = 111 bits (276),  Expect = 6e-23, Method: Composition-based stats.
 Identities = 89/476 (19%), Positives = 176/476 (37%), Gaps = 27/476 (6%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              + CP C    N P  K+P +  +  C +C     F   +  +     +    P   + 
Sbjct  87   VAITCPACNFSSNVPREKIPPRTVNLTCRKCQNRFKFSGDQLDQPPKKQDAPPPPPKPVH  146

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
              +       +S   +                      LR I +LL DSW++F  R   L
Sbjct  147  PLLQDRTGPEESPKRS---------------------TLRKIPELLGDSWQIFKERILVL  185

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA-YILLGLSWMTGSMFI  180
            +GI LLG++L     F                     ++      +  L ++W++G+   
Sbjct  186  IGINLLGMLLIGVGAFILSSGFGNLTELFGENMITGLLVALLALTFSSLAVTWLSGATIC  245

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             I    +G+  ++  GL+ +  F  +  LL L++ G SL+  IPGLL    F F Q+++A
Sbjct  246  AITDDALGVRVALGAGLKLLLPFLWVFSLLGLIIFGASLVFFIPGLLLATLFMFAQFIVA  305

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
             +   G++AL KSR  V G +W + GR ++L ++   +  +   IP++G   +L      
Sbjct  306  TEESRGMEALLKSREYVKGFFWPVLGRVLVLAILMGFICGILGLIPFIGPLISLIL----  361

Query  301  TPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAE  360
             P++ + Y+ IYSDL+           +   +   A      L+  +++V +        
Sbjct  362  WPYTLIVYHEIYSDLRNIKSDMIFTCQRNDKVKWLAIGCAGYLVIPIIVVLMISTGAINN  421

Query  361  QLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADY-KLLLSKQRKTTSEGGLSLGP  419
             L    +     + + P Q           P    S  Y   +        ++  +    
Sbjct  422  SLTGQFQTGSSNVPSTPTQFNPQPAVNEPAPTMSDSMVYIYAVNYTGTVLLNDETIYTIE  481

Query  420  VTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQ  475
                 +     + +        +  +  +  A   S +++I ++  D   ++  ++
Sbjct  482  GEQDVNYNTTQEISLKPGPNELVVTYRQVPTASFFSLKLKIFRIDWDSGEEISLKE  537


>WP_144392134.1 hypothetical protein [Pleionea sediminis]
Length=891

 Score = 112 bits (278),  Expect = 7e-23, Method: Composition-based stats.
 Identities = 38/263 (14%), Positives = 89/263 (34%), Gaps = 10/263 (4%)

Query  371  QRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWAD  430
             +   +  +  + N        +L + D      +   +    G     ++        +
Sbjct  358  NQTQQEKAERIEENPWDYSGNDQLLNIDNLAPKEELISSAFSQGPFSVSLSEAQYNEELE  417

Query  431  DQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEH-----PAFHW  485
                 L  ++EL +  +     K +    +  +LD+   ++   +   +           
Sbjct  418  LTTIELRARMELPEVDSSWYKSKATLSFSVSSILDELDNEILRDERCIKDLPRFVGKNRA  477

Query  486  VGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQI  545
               N           + + L +G   E +H I GKL    P+ ++S  +    + +T   
Sbjct  478  PADNVAFNQGFALITKKLRLSEGKSYEDIHKISGKLNFEAPVDVKSYTI-PFKVNETAGN  536

Query  546  GGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSLRQMFD  605
                + +     +  + + +GD   L+ V A N++ + L+         G+       F+
Sbjct  537  EHTHIKVLATQPSVASYKVIGDDKHLIEVRALNANGQVLQSSFRM----GNNNRQNTTFE  592

Query  606  GNIESITVLVAGDSMTQSYPFEL  628
            G I+ + V VA   + ++  F L
Sbjct  593  GKIDRLQVWVASRFIAKTVDFTL  615


>MAF31333.1 hypothetical protein [Magnetococcales bacterium]
Length=647

 Score = 111 bits (276),  Expect = 8e-23, Method: Composition-based stats.
 Identities = 39/333 (12%), Positives = 101/333 (30%), Gaps = 20/333 (6%)

Query  311  IYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQ  370
             Y D+  +             + ++      M      + +L +     +    +  +  
Sbjct  312  FYKDILESMEFDASGNKLTFNMAISGDQISQMSDIQNQIATLIQSQFKFQSTTESESEST  371

Query  371  QRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWAD  430
            +           +                    S+           L      +      
Sbjct  372  EPKEDVLADPRQIPYYQNNADIDNIDTSNIDQFSQDLGEKFGPFRVLVGAEAQSFSNRRK  431

Query  431  DQNPHLWLKLELSDFPNLS-----LAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFH-  484
              +  L++ ++    PN+        +   A++ I +V +    ++   +   +      
Sbjct  432  QNDSTLFVHVKSGIIPNIPASMHSNDENARAKVTIQRVENSQGNNILKLETCGKDRNNKP  491

Query  485  ---WVGINQTDEND------LFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLT  535
                   + T + D         G +++ L  G   +   ++ G LEL +   I+   + 
Sbjct  492  QALSASSHMTYQGDSALYQHTVKGEKALRLADGYTTDDATTVHGILELRIANVIQEDVI-  550

Query  536  RNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSG  595
                 K  ++G   + +++   N +T +       +LNV   +   + L+    T     
Sbjct  551  PATTDKPHKVGKALVHIEKAEYNNLTYKLEDPEGQILNVLPLSGKKQVLKANMKTKS---  607

Query  596  DAFSLRQMFDGNIESITVLVAGDSMTQSYPFEL  628
               S+ Q   G  + + V+ +    +++YPF+L
Sbjct  608  -GDSITQYASGTPQFVKVVYSTQEQSRTYPFKL  639


>EKD24616.1 hypothetical protein ACD_80C00181G0002 [uncultured bacterium 
(gcode 4)]
Length=439

 Score = 109 bits (269),  Expect = 2e-22, Method: Composition-based stats.
 Identities = 41/273 (15%), Positives = 82/273 (30%), Gaps = 6/273 (2%)

Query  361  QLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPV  420
               +   +              L      +P   +  D  L               LGP+
Sbjct  42   WYCAYSGEQVVPTTGAQNILVRLLPYEELKPMTATDYDRYLWQKINGFNQGGISADLGPI  101

Query  421  TLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEH  480
             L   +             + + +          +A+I+I  V D D +++Y  +   + 
Sbjct  102  LLKVVKPEYYGDFSIEGHMVPIPNIIAADGRFPENAQIQITSVSDKDNQEIYVDEDGIQP  161

Query  481  PAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIG  540
             +   +         + +      L  G     + ++ G + L LP  +  + +T  D G
Sbjct  162  LS---LKYQGYPSPQILASRGGYSLISGKTFADITTVKGIITLKLPTNVSKMIITPKDQG  218

Query  541  KTLQIGGKQLILQRLGSNAVTLRFLGDRTD---LLNVHASNSHAEPLREIGFTWQKSGDA  597
                IG   + +       + L   G  +      +++      E             + 
Sbjct  219  SKFNIGTWTVEINTFDPLQIKLIISGGNSSFDVQDHINFFTPEGEEASPSNTILSYINND  278

Query  598  FSLRQMFDGNIESITVLVAGDSMTQSYPFELTR  630
              L   FD  I  I ++ +  S  +SYPF LT+
Sbjct  279  GVLTYDFDTGINYIKIIYSDTSTIKSYPFVLTQ  311


>NIS09268.1 hypothetical protein [Candidatus Dadabacteria bacterium]NIU88862.1 
hypothetical protein [Nitrosopumilaceae archaeon]NIX15814.1 
hypothetical protein [Candidatus Dadabacteria bacterium]
Length=915

 Score = 111 bits (275),  Expect = 2e-22, Method: Composition-based stats.
 Identities = 42/266 (16%), Positives = 96/266 (36%), Gaps = 19/266 (7%)

Query  378  QQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLW  437
            +   +   +        +    ++      K      ++ GP  +  D     +++  L 
Sbjct  369  ENESEERINENPWDYANNKGFEQMSDFTPEKNWPIPSITDGPFGIVIDSVSLGEKSNLLE  428

Query  438  LKLELSD-FPNLSLAQKGS-------ARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGIN  489
            L ++     P        S         + ID V  +D ++L   ++  E    ++   N
Sbjct  429  LDIKSQIRLPKKDEDNFMSWFGSGAELTLVIDSVKSEDGQELRRDEYCMEKLEDNFAKKN  488

Query  490  QTDE------NDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTL  543
               E      N+     ++I L   T  + +  ++GK+  T+P  ++ + +T    G  +
Sbjct  489  SEPESGFAYSNNTAYVSKTIRLITETSLKDIDKVIGKVSFTVPSKVKKIPVT-LKKGTVV  547

Query  544  QIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSLRQM  603
            +    +  L  +   +++ +  GD+  LL V A N   + L+    +     +     + 
Sbjct  548  EHNDMRFYLSSIKQQSISYQISGDKDKLLEVRALNDKGQTLQSSFGSSSSYRN----VKS  603

Query  604  FDGNIESITVLVAGDSMTQSYPFELT  629
            F GN++ + + V  +       FEL 
Sbjct  604  FRGNVQGLEIYVLDNKTEFERNFELK  629


>NRB77098.1 hypothetical protein [Saccharospirillaceae bacterium]
Length=900

 Score = 109 bits (271),  Expect = 5e-22, Method: Composition-based stats.
 Identities = 42/345 (12%), Positives = 100/345 (29%), Gaps = 16/345 (5%)

Query  296  FSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQ  355
             +  ++                        P   + L   +       +   L+++    
Sbjct  273  INANVSWVENALIATQEYLQTLQKNIDSTSPTLAKLLGNFSIEKQQSSLITQLMLNEQDI  332

Query  356  NLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGL  415
                         +   +    +           E     S + K +  +  K T    +
Sbjct  333  GELPNVFKEVMSSVFSSMFGLNKNEKAQEEESIMESTWDYSKNEKWMNLQDFKITDTSDI  392

Query  416  S---LGPVTLFADRFWADDQNPHLWLKLEL----SDFPNLSLAQKGSARIEIDKVLDDDA  468
            S    GP+ L  +    + +   + +++                K +  + ID V +   
Sbjct  393  SRIVNGPLALHMESVKLNKELNLIEIRVHSSINEPKTDGSWSDSKANFTLSIDSVNNAAG  452

Query  469  RDLYDRQHSFEH----PAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELT  524
             +L   +   +      A H          D+    + + L   +  + V +I G++  +
Sbjct  453  ENLLRDERCVKDFSFGTANHQPATGYDSSFDIGYTDKRVRLTPNSTFKDVANISGEMAFS  512

Query  525  LPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPL  584
            +P A+E + L       + +  G +  +      +VT +  G   ++L +   N+  + L
Sbjct  513  IPTAVEYIDLDLAKE-ASYEKHGVRFYINDFAQQSVTYQVSGKTDNILEIRGLNAKGQIL  571

Query  585  REIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFELT  629
                      G          G++  I ++VA         F L+
Sbjct  572  NTNFT----RGSGDKKTLYLKGDVAKIQLVVATQHSGHKASFNLS  612


>WP_008042065.1 hypothetical protein [Reinekea blandensis]EAR10730.1 hypothetical 
protein MED297_11960 [Reinekea sp. MED297] [Reinekea blandensis 
MED297]
Length=980

 Score = 109 bits (270),  Expect = 7e-22, Method: Composition-based stats.
 Identities = 46/344 (13%), Positives = 94/344 (27%), Gaps = 15/344 (4%)

Query  295  AFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSR  354
              S        +                   P+    +   A       +     ++ + 
Sbjct  373  LGSNNAGWNQQVATTATTWLNSTRNNAQITSPVAADLMAAIAIQHDAQSVRLDWPITANL  432

Query  355  QNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGG  414
                 + + +                 +     P    +       L L     +T    
Sbjct  433  VTQLRQSIENTIASAFGGRLNGGDSNEEQIVDDP-ADYQARFELANLPLLSLNSSTVAPA  491

Query  415  LSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSA----RIEIDKVLDDDARD  470
               GP+ +  D     D +    +    +  P         A     + +  V   D   
Sbjct  492  FHEGPIAIDVDSLSMTDDDVMELIIDAEAAMPEGINDSTLRALVGQSLMVTDVKSRDGES  551

Query  471  LYDRQHSFEH-----PAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTL  525
            L   +H  +         H      +      S  + + L   T    +H I     L+ 
Sbjct  552  LLREEHCLDVQNVFGGKNHEAATQGSVFFQKASVNKRVRLIPDTHVADIHEIDVTYTLSR  611

Query  526  PLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLR  585
            P  I++  +     G+ +   G  + L  +G+ ++  R  G    LL V A N+  + L+
Sbjct  612  PTQIQTFAV-PLQAGEQVSYAGVSIDLVSIGNRSIRYRQSGQIDRLLEVRARNAAGQVLQ  670

Query  586  EIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFELT  629
                +           Q + G I  + V++AG +  ++    LT
Sbjct  671  TSWSSSF----GRVKDQHYRGQIHQLEVIIAGSTDQRTATASLT  710


>MBI5694516.1 zinc-ribbon domain-containing protein [Nitrospirae bacterium]
Length=1084

 Score = 106 bits (263),  Expect = 6e-21, Method: Composition-based stats.
 Identities = 91/482 (19%), Positives = 159/482 (33%), Gaps = 51/482 (11%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              V   L G  W+   +   I    +G+  ++ +G   +     ++ L   +V GG L L
Sbjct  260  LGVVAGLAGAFWVFAGVISAIMDDGLGIKDALDVGRFKLWPMFWVMALSAFMVIGGYLFL  319

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            IIPG++F VWFF   ++L D +  G+ AL  SR  V G+ W +FGR +LL      LS  
Sbjct  320  IIPGIVFSVWFFLAIFLLMDQDGRGVSALLASREYVRGNGWGVFGRLLLLWG----LSTF  375

Query  282  TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGW  341
               IP+VG   +        P++ +  YLIYSDLKA+    +      +   L A     
Sbjct  376  AGMIPFVGPLLSFLV----WPYTIICGYLIYSDLKASKGEAEFTFSTGEKAKLIATGAFG  431

Query  342  MLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKL  401
             ++  LL+ +      S    L         L      +   + S          AD + 
Sbjct  432  YVLIPLLITAFI---FSYAAKLGVSPRDVLALSGISLPSLSGSLSGLTGGLFEKDADVRQ  488

Query  402  LLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEID  461
              ++ RK  +EG        +  D+         L   +       L      +  I   
Sbjct  489  AYAEYRKALAEGDADTLKGHIAKDKAKD------LEGPMAEVGLAMLKAFVPDNVTITDS  542

Query  462  KVLDDDARDLYDRQHSF-------EHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQV  514
             V  D A+     +                       +  ++  G  S+ L+ G  A  V
Sbjct  543  SVDGDVAKLTLSAETEGGVMKGTATMVREDGDWKLSEENWEMNVGPPSMKLKGGRGAPDV  602

Query  515  HSILGKLELTLPLAIES---LQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDL  571
              +     L  P    +    Q  ++      ++    + L         L F    +  
Sbjct  603  EPLA-DNNLAAPSMYVTGLAQQFIKDKANPPREL----ITLTGHEGEVTGLEF--LPSGR  655

Query  572  LNVHASN---------SHAEPLREIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQ  622
            L   + +         +    L       + +G             +  TV+++      
Sbjct  656  LVSASYDDYTVRMWDVATGAELASYKMENRPTG--------MAATPDGNTVILSDAYQNL  707

Query  623  SY  624
            ++
Sbjct  708  TF  709


>WP_146504520.1 hypothetical protein [Rubinisphaera italica]TWT62690.1 hypothetical 
protein Pan54_34350 [Rubinisphaera italica]
Length=704

 Score = 105 bits (259),  Expect = 1e-20, Method: Composition-based stats.
 Identities = 27/237 (11%), Positives = 68/237 (29%), Gaps = 10/237 (4%)

Query  398  DYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSAR  457
            +  +  +       E      P      +           +   L +  + + ++  +  
Sbjct  348  ETPVATNGPFLMLVESLEEYPPYAAGDLKLQLIAAGIPSAVVSSLQEVTDQADSENKTKA  407

Query  458  IEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSI  517
              I   L          + S        V +      D       +        +Q+ SI
Sbjct  408  RWIIDQLTGTGNLNLQIESSGYQMLMPQVSLKHAIIEDSIPLGNLVQ-----TVQQIESI  462

Query  518  LGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRL-----GSNAVTLRFLGDRTDLL  572
              +   ++P  I +    + + G+T++     L L+++       +  T+ + G     +
Sbjct  463  DTQFHYSIPTEIITQTFDKLEKGETIETENFTLTLKQVRINPDAGSNFTVEYTGLSNGQI  522

Query  573  NVHASNSHAEPLREIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFELT  629
             +   ++   P    G     +GD  +    FD     + V +  +         L+
Sbjct  523  TILGLDAEGNPFESSGGGGFWTGDKGTQNAYFDKAPAKLQVQIVKEFQEFEESLSLS  579


>MBE9582320.1 HEAT repeat domain-containing protein [Proteobacteria bacterium]
Length=658

 Score = 104 bits (257),  Expect = 2e-20, Method: Composition-based stats.
 Identities = 84/522 (16%), Positives = 165/522 (32%), Gaps = 48/522 (9%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA--YIL  168
            W ++ +R W L+G+ L+ ++L    +     L    W    +      ++   +A    +
Sbjct  20   WTIYRKRMWTLIGLGLVTVLLTILSLVPPFGLGFLLWQYMPDYKNVIMLVSILLATGSAV  79

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
               +W   +  + +     G+  + K       +   L +L  L++ G  +LLIIPG++F
Sbjct  80   WVANWGMSAFLLAVVDERCGIKEAFKKAKPKTLAHMWLGLLTGLILTGAHILLIIPGIIF  139

Query  229  CVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
             +WFFF    +  + +  G+ AL KS+  V G W+ +F R   + +    LS L A IP 
Sbjct  140  AIWFFFA-PFVFIEDDARGMNALLKSKAYVQGRWFGVFWRLAAIWL----LSVLVAAIPI  194

Query  288  VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGL  347
             G+   L F     PFSF+Y +L+Y DLKA        P K++   + A      ++P +
Sbjct  195  AGQVLALFF----VPFSFVYTFLVYEDLKALRGSFLFKPSKKEKAGIIATGAFGYVMPVV  250

Query  348  LLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQR  407
            L+ +           +   K   +       Q                +       +   
Sbjct  251  LVFAFMGSMCLMPFSVLTAKVTGESPFPTAMQELKNQGPGMGTTGLTVTHGAIRTNANVN  310

Query  408  KTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDD  467
            K            T  +   +   Q        + +  P +                   
Sbjct  311  KQIQLLEDKGKEWTKRSQAAFKLGQTKD-----KKAIEPLIEALSNDEHWTV------RQ  359

Query  468  ARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPL  527
                            H +   ++D+N       +  L +      V ++          
Sbjct  360  NAARSLTNLGARQAVPHLIRALESDKNVFVRSSAAKALGKLGDKSSVAALT---------  410

Query  528  AIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREI  587
                         K L+  G     +      V           L +  + +  E +   
Sbjct  411  -------------KALKDEGIVTTFKDGKGVEVK-EVANAAQQALKLLGAQAGEESIGTK  456

Query  588  GFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFELT  629
            G      G     ++  +  + +     + +  T+ Y   + 
Sbjct  457  GLASAAPGPKEEKKKALEPGVSTAKN--SPEEKTKEYRKTIK  496


>MBE9544953.1 zinc-ribbon domain-containing protein [Proteobacteria bacterium]
Length=626

 Score = 104 bits (256),  Expect = 2e-20, Method: Composition-based stats.
 Identities = 55/300 (18%), Positives = 106/300 (35%), Gaps = 20/300 (7%)

Query  330  QWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPE  389
                + A I   ++I     +   +     +    + K                    P 
Sbjct  192  GLYAVAAGIILAVMIGTFTGLKFYKDRQRPQITAVSDKQKAASSALLDAGPFLELTPTPG  251

Query  390  EPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLS  449
                  + D  L  +   KT       +   +LF   +    +NP + L L+  + PN +
Sbjct  252  PVAAEYADDPHLFTTVDAKTLKPKIPRIVKQSLFPGHYLNFGENPQMTLDLDPVNIPNAA  311

Query  450  LAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGT  509
            LA+      E+  +   D +D+     +   P  +            F G  S+ ++ GT
Sbjct  312  LAE---LTYEVRSIQSPDGKDILRAAETKFKPPIN--------PGSAFPGSLSLSVKNGT  360

Query  510  QAEQVHSILGKLELTLPLAIESLQL-TRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDR  568
              E + +   +  L+LP A++  +  +  + G   + GG Q+ L RL  +   + + G R
Sbjct  361  PPESLGTARIQFMLSLPDALQIFEFKSGEEKGSLKESGGIQVTLGRLEKDVADVTYSGTR  420

Query  569  TDLLNVHASNSHAEPLREIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFEL  628
            +  + + A +     L         S    S+   F G I+++ V+ A       YPFE+
Sbjct  421  S--VRLIAYDKTGNALASRETINSTS----SVSTRFQGMIDTLKVVSAASM--LEYPFEI  472


>PYM69976.1 hypothetical protein DME10_21885 [Candidatus Rokubacteria bacterium]
Length=605

 Score = 103 bits (255),  Expect = 2e-20, Method: Composition-based stats.
 Identities = 34/212 (16%), Positives = 67/212 (32%), Gaps = 4/212 (2%)

Query  376  QPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPH  435
                               + A   L    +R   +E    +                P 
Sbjct  355  PAAAPAAERIDTEPVAFVPTVAPASLPGYDRRAQFAEEVDQIQGPFGLRPSEIRLSAEPG  414

Query  436  LWLKLELSDFPNLSL---AQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTD  492
            + L+L +  F        A     R+ ID V     ++L   +              +T 
Sbjct  415  VGLELVVEGFAAEIPNIGASDDRVRLSIDSVKSTGGQELLRPEACGRER-NPQPAAFKTW  473

Query  493  ENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLIL  552
             +      +++ L  G     + S+ G+++L LP   E   L+    G   +  G    +
Sbjct  474  GSHRLKASKTLRLIDGADPHALQSVSGQVQLRLPTRTEVASLSHPAAGAVAERHGAVFTV  533

Query  553  QRLGSNAVTLRFLGDRTDLLNVHASNSHAEPL  584
             ++   +V+ +  G R  +L + A N+  +PL
Sbjct  534  TKVAGGSVSYQIEGARDRVLFLRALNAKGQPL  565


>NIM06119.1 hypothetical protein [Armatimonadetes bacterium]NIO97600.1 hypothetical 
protein [Armatimonadetes bacterium]
Length=606

 Score = 102 bits (251),  Expect = 8e-20, Method: Composition-based stats.
 Identities = 96/588 (16%), Positives = 181/588 (31%), Gaps = 43/588 (7%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             ++CP C   +     K        +                +          P      
Sbjct  47   QIKCPWCKTTQPLGE-KCVECGLDFKEYARKVKSEKAAGAPAQE----RPVLSPDEKKPP  101

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                D                +    P     +S   L S + L+     L     + L 
Sbjct  102  GEKPDAKPEVKSAFAKDDRTLAIKAIPRPPELSSIGELMSNAWLIYKGRLLTLLGLYLLS  161

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             + LL         F  ++L      + +    Q  I +  +   +L   W  G      
Sbjct  162  VVILLVP---IGASFGTVMLTTVGSPSAEGFVTQTIIFMFGLLIGMLAWLWGFGGFISAA  218

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
            C        ++  G R++ S   + +++   + GG +L I+PGL++ V FF  Q+++ ++
Sbjct  219  CDEMQNFKEALGSGWRNMWSLLWVYLIVGCTIQGGLILFILPGLVWTVSFFAAQFIIFEE  278

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
            +  G+ A+ KSR  V GHWWA+FGR +L+++I +  S +                +LLTP
Sbjct  279  HQRGMDAMLKSRAYVKGHWWAVFGRLLLIVIILILASVVP------------LGGILLTP  326

Query  303  FSFLYYYLIYSDLKANYRGPQHPPIKRQWL-PLTAAIFGWMLIPGLLLVSLSRQNLSAEQ  361
            F  +Y  LIY +L+                           L+P  L+ +       A  
Sbjct  327  FMMIYMVLIYKELRTIKGKNVEYARSGGARLGWITIAAVGCLVPLALMTAGGVSIYKARP  386

Query  362  LLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVT  421
             L     +  +   + +        + E+      A  +  L   +     G        
Sbjct  387  NLMELPSVIMKEVQKFRIPGQAPPPIEEQASAGPEASLEPQLKLFKAGFFAGEKITVRFQ  446

Query  422  LFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHP  481
              +          + W+ L  S  P+ S A+     I    +    + DL     +   P
Sbjct  447  APSTY------AANAWVGLIPSTVPHGSEAENDQNDISKQFLGGITSGDLEF--TAPTTP  498

Query  482  AFHWVGINQTDENDLFSGIRSIYLRQGTQAEQ----VHSILGKLELTLPLAIE-------  530
              + + +  TD+        +  +  G+  +     V ++        P+ IE       
Sbjct  499  GIYDLRMFDTDDGGAEVVSITFAVEDGSSGQAQGVSVRTVKRTYAAGEPVDIEFSGLPGN  558

Query  531  -SLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHAS  577
                +T   +     + G+      + S      F G       V   
Sbjct  559  SQDWITVVPVTTPENVQGEFFTTSGVKSGI--YTFGGLEPGSYEVRVY  604


>NNK01369.1 hypothetical protein [Desulfatitalea sp.]
Length=315

 Score = 98.7 bits (242),  Expect = 1e-19, Method: Composition-based stats.
 Identities = 47/305 (15%), Positives = 95/305 (31%), Gaps = 26/305 (9%)

Query  347  LLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQ  406
             L   +    L     +      Q           D +  L +        D        
Sbjct  11   ALDTLVYELQLDTPLPMQGEGFSQAVPPHASYTVQDWDEVLDDAKAFQPKTDQDEYF--D  68

Query  407  RKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDD  466
                +  G +   + +       D       L++ +     L       A + +  +   
Sbjct  69   PVARTTAGPARLSLFMPKVSRRFDGTVFSSRLQMRMPYVAALDNRVN-RAVLVVRALGFR  127

Query  467  DARDL-YDRQHSFEHPAFHWVGINQTDENDLFSGIR-------SIYLRQGT---QAEQVH  515
            D + L ++R+  F+  +          E  L   +        +I +       + +Q+ 
Sbjct  128  DGQRLTHERRTPFQLGSSGGFKDVSRTEAQLRRSLHLRGDIPLTIRVPDDAGIHEHQQLQ  187

Query  516  SILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVH  575
             + G +E  LP +I S+ +    +GK +     +L +  + +  +TL   GD  DL++V 
Sbjct  188  EMSGAVEFHLPQSIISIPMADLTLGKRVHASDFELQVIEIAAGHLTLSVDGDYADLVDVL  247

Query  576  ASNSHAEPLREI------------GFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQS  623
              N   E +                   + S     +   F+G    + ++V+  S   S
Sbjct  248  ILNPSGETIAAGPAFDREKNRLGVSDDEKASRPRAVVNMRFNGTPAELRLVVSEKSRMLS  307

Query  624  YPFEL  628
            YPF L
Sbjct  308  YPFVL  312


>MBI5624999.1 hypothetical protein [Elusimicrobia bacterium]
Length=557

 Score = 100 bits (246),  Expect = 3e-19, Method: Composition-based stats.
 Identities = 89/448 (20%), Positives = 150/448 (33%), Gaps = 28/448 (6%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
              +  W+ F  R   LL + ++ +VL                          A L    A
Sbjct  21   HFSRGWDYFKSRLATLLFVSIIALVLIILAAVIPPACGFLLGRLAPGLKGAAASLGVLAA  80

Query  166  YILLGLS--WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
             I +  S  W T +    +   D G           V +   + +L  L+V GG +LLI+
Sbjct  81   VIAVLWSLSWATAATCAAVVYKDAGAKECYSRSRPRVFALLWVHLLAGLLVMGGYVLLIV  140

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
            PG+L  VWFFF  +V   +++GG  AL KSR  V G W+++  R +   ++ + +     
Sbjct  141  PGILLTVWFFFAPFVCIAEDLGGFDALLKSREYVRGRWFSVAWRLLTAWLVIVLI----Q  196

Query  284  RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWML  343
             IP VG+  +        P  FL  +L+Y DLKA        P  +  L + A     ++
Sbjct  197  VIPVVGQLLSFIL----WPLPFLMSFLLYQDLKALRGDLPFQPTFKAKLGIAAPGLAGLV  252

Query  344  IPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYK---  400
               +LLV++    +   +         ++      +      S P         D     
Sbjct  253  AVPILLVTMMGGTVLMMRQKMLQGMGARKPFASGSKAGAAAVSSPAPNAVRRPVDPAVLD  312

Query  401  --LLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARI  458
              ++  +       G       T          Q    +   E   FP L   +     +
Sbjct  313  GLVIPDQPAAGQIRGADFRVERTEAMGSIIHLSQGKEFFADAEFIIFPFLGDGET----L  368

Query  459  EIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSIL  518
                +   D   +                     + ++F    ++ L  G   E+   I 
Sbjct  369  AGRTIRYPDPGRMSGPH----IHVARKPAPESMPKTEMFMSDYAMLLVFG--PEKAGKIA  422

Query  519  GKLELTLPLAIESL---QLTRNDIGKTL  543
            GK+ L +P A +S           GK L
Sbjct  423  GKIFLRVPDASQSEVKGTFELASDGKEL  450


>MBI5641685.1 hypothetical protein [Nitrospirae bacterium]
Length=541

 Score = 99.8 bits (245),  Expect = 4e-19, Method: Composition-based stats.
 Identities = 79/385 (21%), Positives = 140/385 (36%), Gaps = 18/385 (5%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAIL--LATV  164
              +SWE++ RR   L+ +YL       A I     +     L  +     +     +   
Sbjct  70   FRNSWEVYKRRIGPLIALYLFSFFCFIAAIAVFAGIGFLFSLPFEGSKGAFIAAGAVIGT  129

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
               ++ L W   ++   +    +G+  S+  G   + +F     +   ++ GG LL   P
Sbjct  130  LAGMIILFWGLAAVVFAVVDEGLGIRESLARGWHRIWAFMWFFSIAGFIITGGFLLFFFP  189

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            GL+F VWF F Q++LA ++  G+ AL KSR  V G+W+ +F R  L+ +     S     
Sbjct  190  GLIFLVWFAFGQFILASEDERGMNALLKSREYVKGYWFDVFLRLFLVWL----ASLAVGI  245

Query  285  IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLI  344
            IP++G   +L F     PF  ++ +L+Y DLK+      +     + L    A     L+
Sbjct  246  IPFIGPIFSLLF----MPFVMIFTFLLYEDLKSRKGAVTYTSTAGEKLKWLGAGTLGYLL  301

Query  345  PGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLS  404
              LLL++    +L+   LL  G               D    +P    +         L 
Sbjct  302  IPLLLLAFLGASLTIPFLLLKGMLSSTGREMILSP--DKWPQVPGMVPQTPGGQELYTLP  359

Query  405  KQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVL  464
            +Q +  + G    G      D+         L L      +         +      +  
Sbjct  360  QQGQAVAPGQGEPGSFPQMEDKKPDAAGEATLLLDGISETY------ILKTGFFSDTRFK  413

Query  465  DDDARDLYDRQHSFEHPAFHWVGIN  489
            D     +  +    EH     + + 
Sbjct  414  DPTHATIQFQAPGAEHSNARRIELT  438


>WP_150074101.1 hypothetical protein [Rhodopirellula sp. JC645]KAA5546994.1 hypothetical 
protein FYK55_00825 [Rhodopirellula sp. JC645]
Length=559

 Score = 99.1 bits (243),  Expect = 8e-19, Method: Composition-based stats.
 Identities = 48/329 (15%), Positives = 96/329 (29%), Gaps = 30/329 (9%)

Query  315  LKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLG  374
               +        + R        +        +                SAG  +   L 
Sbjct  111  WTVHRTPETANALTRPISIELRTVSRIQAFETICRELGLTPRYPDPMRSSAGGALVAGLI  170

Query  375  TQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNP  434
               +Q     ++ P + Q   +A      +++      G   +    L  +   A     
Sbjct  171  AAGEQLDPALKTRPADNQAADNAVTFETGARKLPVAFAGPCFIEVTKLVENPPHATG---  227

Query  435  HLWLKLELSDFPN--LSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTD  492
             L L +     P   L++ Q   A  +I  V  +   DL    ++     F W+      
Sbjct  228  RLDLTVHAPGLPPSVLAIDQPSLAFGKIRSVQSNRETDLLTDINTGAMVNFPWL------  281

Query  493  ENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLIL  552
                    R I L+   Q     S+ G  EL +P  + +L+  + + G +   G  +  L
Sbjct  282  -------SRQIPLKGLLQNVDSFSVSGSCELWVPTEVVTLEFDQLEEGASQTAGELKAEL  334

Query  553  QRLGSNA------------VTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSL  600
              +   +            + L++ G       +   +     L   G +   +G +   
Sbjct  335  IAVRPMSKRANGEERKGFRIQLKYTGSDAGKPKLLGVDDAGSLLSADGKSVFWAGRSGHA  394

Query  601  RQMFDGNIESITVLVAGDSMTQSYPFELT  629
                 G+   + V     + T  YPF++ 
Sbjct  395  SFTMWGDPARLVVKFPRQTETLRYPFQIN  423


>WP_198264656.1 hypothetical protein [sulfur-oxidizing endosymbiont of Gigantopelta 
aegis]
Length=303

 Score = 95.6 bits (234),  Expect = 8e-19, Method: Composition-based stats.
 Identities = 44/277 (16%), Positives = 87/277 (31%), Gaps = 13/277 (5%)

Query  360  EQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGP  419
                   +            +        +      +          +K   +       
Sbjct  31   AFKPKTEEPSLVSYNPGMLDSEPNISKSYKFKPLNKAIQNDNPAWLGKKWPGDKTNWSND  90

Query  420  VTLFADRFWADDQNPHLWLKLELSDF---PNLSLAQKGSARIEIDKVLDDDARDLYDRQH  476
               + + F  D        +   + F    NL+     +       +L+ +   L     
Sbjct  91   FPFYINVFIQDANQKQSSSENRDAVFSIKTNLTPFITQNITAIKLSLLERNKSQL-----  145

Query  477  SFEHPAFHWVGINQTDEND---LFSGIRSIYLRQGTQAEQVHS-ILGKLELTLPLAIESL  532
              +  AFH   ++   E D     S  ++  L +  QA +  S + G+L L LP   +S 
Sbjct  146  -DQFIAFHQQAMDAKTEGDDKFYLSVNKAFKLSKDRQANKHSSHLHGELTLYLPKTFKSH  204

Query  533  QLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQ  592
                  +G+T+ +G   +   RL    +T    G+   L+ +   N   E + E+     
Sbjct  205  LEPYTSLGQTIDLGELSIKSIRLDGKQITFEIRGELQKLVQLKLYNKADELISEVFELQH  264

Query  593  KSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFELT  629
               +   L   + GNI+   V++A     + YPF  +
Sbjct  265  IENNKAHLSLAYQGNIDKFKVVMAQSLAEKKYPFSFS  301


>CAB1070897.1 hypothetical protein D1AOALGA4SA_1060 [Olavius algarvensis Delta 
1 endosymbiont]
Length=1201

 Score = 99.5 bits (244),  Expect = 1e-18, Method: Composition-based stats.
 Identities = 54/276 (20%), Positives = 96/276 (35%), Gaps = 20/276 (7%)

Query  354  RQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEG  413
              + S  +  +    +      Q           P       +A+  ++ +         
Sbjct  543  EFSWSKTEDEAFLTALTAATIGQLFAASMELTPTPGAVTTRYAAEPDVVAAVDSDQLRAE  602

Query  414  GLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYD  473
              +L   +LF   +W     P + L L+  D PN  LA+      E+  V   D RD+  
Sbjct  603  IPALIKNSLFPGYYWNQGDQPQMTLDLDTIDIPNGMLAEM---TYEVKSVESPDGRDIRR  659

Query  474  RQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQ  533
               +   P              L  G  S+ ++  T  E +   +    LT+P+A+E L+
Sbjct  660  VDENKFKPRIQ--------PGSLIPGNISLSVKAETPPEDLAKAVMNFHLTVPVALEVLE  711

Query  534  LTRND-IGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQ  592
             +  D  G      G ++ L RL  +   +   G ++  L   A +   + L        
Sbjct  712  FSEADQPGSVKVADGVRVTLGRLEKDVAQVSSSGGKSMRL--IAYDQTGKALASRESMST  769

Query  593  KSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFEL  628
            +S    S+   FDG I  + V+V        YPF++
Sbjct  770  QS----SIATRFDGIITRLKVVVTRKM--LDYPFDI  799


>MBC8870065.1 hypothetical protein [Planctomycetes bacterium]
Length=352

 Score = 95.6 bits (234),  Expect = 2e-18, Method: Composition-based stats.
 Identities = 24/185 (13%), Positives = 63/185 (34%), Gaps = 13/185 (7%)

Query  449  SLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQG  508
            +  +  +    + +V     +++ D              +       L   +R++ L+  
Sbjct  52   NDEEDDTLTFRLLEVTGTGGQNVVDETSFG---------MVTQATQKLIMLVRTVELKNL  102

Query  509  -TQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSN---AVTLRF  564
                 ++  + G+L   LP  + +L+ +      T    G  + +  +       + + F
Sbjct  103  IRSVAEIEQLNGELSFALPTKMTTLKFSDVTKDATASAEGISMKIGSVNPGEMSTIEVEF  162

Query  565  LGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSY  624
             G  +D + +   ++  + +   G      G   +     +G   S+   V  ++     
Sbjct  163  EGIDSDHVTIVPLDADGQTMEVNGGGSSDFGGKGTRSFFVEGTPASLEARVIRETERVRV  222

Query  625  PFELT  629
            PFEL+
Sbjct  223  PFELS  227


>HAD58437.1 hypothetical protein [Planctomycetaceae bacterium]
Length=713

 Score = 95.2 bits (233),  Expect = 2e-17, Method: Composition-based stats.
 Identities = 35/311 (11%), Positives = 93/311 (30%), Gaps = 14/311 (5%)

Query  328  KRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSL  387
            +     ++      + + G +      +      L  A +               + R+ 
Sbjct  257  QEWRQSISLQDTPAIEVLGQIASRAGLKIHDQPHLEEALRAKITIEAEDQSAVQLIERAA  316

Query  388  PEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPN  447
             +           +   +  +         GP  +  D        P    ++EL  F  
Sbjct  317  AQIGLYPKYKLKTVAFYQGPRPW--PAAFAGPFIIVVDNLDVR--VPWPVARMELQFFAA  372

Query  448  LSLAQKGSARIEIDKVLDDDARDLYDRQHS------FEHPAFHWVGINQTDENDLFSGIR  501
                   S  ++++ + ++DA + +           F+    + +G  +           
Sbjct  373  GLPEPMISQVLQLNSLSEEDAPNTFTAVTDEIQGGSFQLHEVNTIGYGRRASFKTLMFSH  432

Query  502  SIYLRQG-TQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQR---LGS  557
              +L       +Q   + G++    P+   +++    + G T +     L +       +
Sbjct  433  VYHLHNLLRSVQQSKPVTGRISWPFPVESRNIKFGGVNKGDTAESDELSLTISDSILAET  492

Query  558  NAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSLRQMFDGNIESITVLVAG  617
            + +++   G   D + +   +++  PL     +   +     L    +G   S+  +   
Sbjct  493  SRLSVDINGASIDDITLIGRDANDLPLANNFSSGLTTEKRTVLHVTIEGEPTSLEAVQTT  552

Query  618  DSMTQSYPFEL  628
             S   +YPF L
Sbjct  553  KSDRVTYPFLL  563


>MBI2670271.1 hypothetical protein [Candidatus Yanofskybacteria bacterium]
Length=670

 Score = 94.1 bits (230),  Expect = 4e-17, Method: Composition-based stats.
 Identities = 62/323 (19%), Positives = 115/323 (36%), Gaps = 17/323 (5%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATW-----LNPQNQNWQWAILLATVA  165
            W L+ +R W L+ I L+  +     +  + L+    +                 L A V 
Sbjct  82   WALYKQRFWTLIAIALIPTLAIIPAVIGSTLIYAFIFSKIIAGGSLLLILIMFALAALVM  141

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
            +++           I   K ++G   S + G   + S   + IL+  V  GG LL IIPG
Sbjct  142  FLVQFWGQAALMFAIVNNKEEIGFIESYRKGWHKILSLWWVTILMAFVTLGGYLLAIIPG  201

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            ++F VWF     VL  +N  G+ AL KSR  V G WW + GR + + +    +  ++  +
Sbjct  202  IIFTVWFSLSALVLIAENSRGMDALLKSREYVRGRWWTVLGRNIFIGLFYFVVFLISTLL  261

Query  286  -------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAI  338
                   P V +  N    + L P + +Y +L+Y +L++        P K+         
Sbjct  262  FLFVFKVPLVSQVVNFIIIITLAPLALIYMFLMYQNLRSLRGEVDLAPTKKSRTTFI---  318

Query  339  FGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSAD  398
               + + G++ +      +      +  +  +               +      +    +
Sbjct  319  --IIAVLGIVAIFALLFWVVFPTFSTLRQLSKTLTNVGDYNPTVSPGTTEMADWQTYRNE  376

Query  399  YKLLLSKQRKTTSEGGLSLGPVT  421
                  K         L+ G   
Sbjct  377  EYGFKVKYPSGWITDDLTPGKYY  399


>MBC8871457.1 hypothetical protein [Planctomycetes bacterium]
Length=680

 Score = 92.1 bits (225),  Expect = 2e-16, Method: Composition-based stats.
 Identities = 31/244 (13%), Positives = 77/244 (32%), Gaps = 5/244 (2%)

Query  385  RSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSD  444
                 +P+    +       + + TT        P  +        +           + 
Sbjct  135  YPSYGKPEPSEDSFDFGFDEEPQVTTVTLKPGDRPYPVAFAGPLMVEVQEVERFPPRATG  194

Query  445  FPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIY  504
               L++       +  D + D   +D+   + +       W     +  +     +    
Sbjct  195  TLKLAIHAFALPPVVKDLLADQGEKDVTVEEVTGPDGHDLWDDERFSGTSWSLPLVPLKN  254

Query  505  LRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNA---VT  561
            L +G     + ++ G L++++P     LQL +   G + + G  Q+ L  +       ++
Sbjct  255  LFRGL--GTIGTVRGSLQISMPTNTAELQLAKLQPGTSEKSGDVQVTLASMDEQESCDLS  312

Query  562  LRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMT  621
            +  L    D +     ++  +PL   G +    G+   L    DG    +   +A  + +
Sbjct  313  IYVLNLGADEITWEPQDAQGQPLDIGGTSSFSLGNRAELELSVDGRPAVLVAKIADGAKS  372

Query  622  QSYP  625
              + 
Sbjct  373  VRFD  376


>NIY14984.1 hypothetical protein [Nitrospinaceae bacterium]
Length=334

 Score = 89.4 bits (218),  Expect = 2e-16, Method: Composition-based stats.
 Identities = 40/302 (13%), Positives = 89/302 (29%), Gaps = 13/302 (4%)

Query  332  LPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEP  391
                       LI    +   +          S   +        P  T     S     
Sbjct  43   QNFKGQPEAAELIIAGEIEEETFPFDFKFSRPSPPPNQGLWEVNVPTMTLAAYESQRLPK  102

Query  392  QRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLA  451
              L     K                L        +   D    H+ L +     P L   
Sbjct  103  PLLKYCKQKKKPHAITGNVGLCLSDLEIKYRHFGKNQGDYVQTHISLDVPHPHPPALDW-  161

Query  452  QKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQA  511
                A++ I K+   + +    R    +   F  +     + +     ++          
Sbjct  162  NLSGAKVVITKINFKNEKTGKTRSQPVQIEEFINLNSTGVNLDSEIMEMKD---------  212

Query  512  EQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIG-GKQLILQRLGSNAVTLRFLGDRTD  570
            E+   I G L++ LP  +  L L  +++G   +   G ++ L    S+       G R  
Sbjct  213  EKATGIEGYLQINLPAKLSKLTLDLSELGNQAENDAGLKVKLTGFSSDGAKFDVTGPREK  272

Query  571  LLNVHASNSHAEPLREIGFT--WQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFEL  628
            ++     +   + L++   +  W K  +++  +     + +++ ++ A     + YPF++
Sbjct  273  VVQYQPRDRQGKALKQSFPSIRWSKKKESWWGKVAIPAHAKTLDIVYATRQEQKRYPFKI  332

Query  629  TR  630
             +
Sbjct  333  RK  334


>WP_167192080.1 hypothetical protein [Aestuariicella hydrocarbonica]
Length=465

 Score = 91.0 bits (222),  Expect = 2e-16, Method: Composition-based stats.
 Identities = 44/310 (14%), Positives = 95/310 (31%), Gaps = 14/310 (5%)

Query  331  WLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEE  390
                    F +     L       Q     Q              +  +    N +   E
Sbjct  154  KQQPLEYRFRFNQSLPLQKSQSYAQQPQLPQPYHLDDWSNVLELAKSLEPEQKNSAWLGE  213

Query  391  PQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSL  450
            P      +  LL     + T   G       +  D  +    + +    +  ++  NL  
Sbjct  214  PLVRQRTEVGLLSLFPSEVTRGFGGYALRGYVQLDMPYVQALDGYQGRAVFSANQLNLVN  273

Query  451  AQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQ  510
                    E + V++      +   +   +     +      + +    I      +  +
Sbjct  274  HDPVKVSYEQNLVMNFKG---FGNNYKNSNKTEAELRQELYLQGNQQMSIPLGEAGESLK  330

Query  511  AEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTD  570
              Q+HSI G+L+L LP  I +       +G  L   G  L +  L    + L   GD + 
Sbjct  331  GAQLHSIEGELQLKLPKNIAAQMFDAVTLGHRLSSEGLTLKVIELAVGGIKLSAEGDTSK  390

Query  571  LLNVHASNSHAEPLREIGFTWQKSGDAFSLR-----------QMFDGNIESITVLVAGDS  619
            ++++   +   + +       ++     +               F+G + ++ V++  D+
Sbjct  391  VMDILVLDKEGKQIGHGASFSEEMRSGMNSSGDQQPPRLVSDLRFNGELAALQVVMVTDT  450

Query  620  MTQSYPFELT  629
            +TQ YP  L+
Sbjct  451  VTQRYPVTLS  460


>MBI3648016.1 hypothetical protein [Actinobacteria bacterium]
Length=257

 Score = 87.5 bits (213),  Expect = 3e-16, Method: Composition-based stats.
 Identities = 38/215 (18%), Positives = 78/215 (36%), Gaps = 5/215 (2%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            + L+ +   GL+ I  +  V      F  + +   T    Q         L TV      
Sbjct  28   FTLYKQHWQGLVKIVAIVAVPLTFLQFLLVDIAFRTGGFGQAAVGGAIAGLFTVVIYSAF  87

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
               +T +    +    + +  S + G+  +G    + +L  L V  G  +  +  ++  +
Sbjct  88   AGAITRAAAGSVVGMPISVEESYRYGMARLGPIIWVAVLTGLAVFAGFFVFFVGAIIAAI  147

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP----  286
                   VL  +   G  AL +S  L +GH+W + G  ++  +IS  +  +         
Sbjct  148  KLAVGVPVLVVEGKRGAAALSRSWELTTGHFWHVLGTVIVAGLISSIVGQVLQAPFSGAG  207

Query  287  -YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
             +          ++  PF+ +   L+Y D++A   
Sbjct  208  WFGLAIGASIAQIVTAPFTAIVSILVYLDVRARKE  242


>NJD55990.1 hypothetical protein [Nitrospirae bacterium]
Length=585

 Score = 91.0 bits (222),  Expect = 3e-16, Method: Composition-based stats.
 Identities = 74/379 (20%), Positives = 135/379 (36%), Gaps = 12/379 (3%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQN--QNWQWAILLATV  164
             + SWE++ RR   L+ +YL+ I+L   P+   L    A  +              L   
Sbjct  60   FSVSWEIYRRRFGPLVALYLISILLFILPLAILLGGGFALGMLLPAAKYVLIGIGGLLGF  119

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
                + + W T  + + I    +G+  +++ G   V SF  L  +L  VV GG LL  IP
Sbjct  120  FISCITVFWGTAGLTLAITDESLGVSSALRRGWAKVWSFIWLFSILGFVVTGGFLLFFIP  179

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            G+LF VWF F Q+VLA+D+  G+ A+ KSR  V G+   +F R  L+ +I+         
Sbjct  180  GMLFLVWFSFAQFVLAEDDERGMNAMLKSREYVRGYGGDVFLRMFLIWIIAAVA----GM  235

Query  285  IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLI  344
            +P++G       S+   P+  +Y Y IY+DLK             +            ++
Sbjct  236  VPFLGPIL----SIAFFPYVLIYLYRIYADLKGMKTMHAFSASTGEKAKWILVSLLGYIV  291

Query  345  PGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSA--DYKLL  402
              +++++L         L          + +           +              + +
Sbjct  292  VPVMIIALVGAACLGPLLALRAFQAAPGILSPFLGRQQNPDVVRPANFSDLLGTWTGREI  351

Query  403  LSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDK  462
              +   T          V   +  ++    + H  L +          A      +    
Sbjct  352  NREAGWTFLFSEGHNVQVVSPSGEWYRGKASIHFDLGIADGAIRVPPGAGILDIDVTESS  411

Query  463  VLDDDARDLYDRQHSFEHP  481
              D   +        +++ 
Sbjct  412  SGDYAGKTSLGAYSIYDNA  430


>MBI5187560.1 pilus assembly protein PilP [Nitrospirae bacterium]
Length=439

 Score = 90.2 bits (220),  Expect = 3e-16, Method: Composition-based stats.
 Identities = 69/353 (20%), Positives = 128/353 (36%), Gaps = 24/353 (7%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            W ++  R    LGI L+  ++    I + +++                 ++ +   +   
Sbjct  21   WRIYKSRLGTFLGIMLISNIVPVLLIGALIVVGLFLGFKLPPPGSLNIGIILSFLIVFSF  80

Query  171  ------LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
                     +     I      +G+  S + G   + SF  +  LL   +  G  +LI+P
Sbjct  81   IMIVYTWGVIALIYAIKDSDEGIGIKESYRKGWHKILSFWWVSFLLAF-ILIGGFILIVP  139

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL------  278
            G++F +WF    +VL  ++I G+ AL KSR  V G W ++F RF+ +++IS+ +      
Sbjct  140  GIIFAIWFSLSVFVLIAEDIKGMDALLKSREYVKGRWGSVFWRFLFIVLISMIIMIILSV  199

Query  279  -SFLTAR--IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLT  335
             + +     IP+VG+  N    + LTP   +Y +L+Y DLK+        P  R      
Sbjct  200  PAIIFGLLRIPFVGDIINSIIPIFLTPLVMMYSFLVYRDLKSLKGEFPFTPSGRTKAGFI  259

Query  336  AAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLS  395
                  + I   + + L+       +L    +           + P   +  P E ++  
Sbjct  260  FTGIMGLFIIITVPLFLAYTGYIQTRLGKKEQ--------AAAKEPVAEKVQPAETKKQI  311

Query  396  SADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNL  448
             A  +    +Q     E      P                    +E  D   L
Sbjct  312  KAPEETKKVEQEVYAYEAKGKRDPFLSLVIVSREKPIKKKGVSPVENYDVEEL  364


>RLC34648.1 hypothetical protein DRZ76_02340 [Candidatus Nealsonbacteria 
bacterium]
Length=312

 Score = 88.3 bits (215),  Expect = 3e-16, Method: Composition-based stats.
 Identities = 57/222 (26%), Positives = 106/222 (48%), Gaps = 3/222 (1%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY  166
              D +            +  +G+++ +  IF+ ++ K +      +  +   + L     
Sbjct  66   YKDRFWTLVGIMLPPFLLGWVGMIVWWLLIFAGMVTKISLENI-VSLLFLVLLGLVFFVI  124

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
             +    W   ++   I + D+G+  + + G   + S+  + +L  L+V G  LL  IPG+
Sbjct  125  FIAAGLWSQVALLCAIKEKDIGIKEAFRKGWHKIISYWWISVLSTLLVLGAFLLFFIPGI  184

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            +  +WF    YV   ++  G+QAL +S+ LV+G WW +F R VL+L+I +  + +   IP
Sbjct  185  ILAIWFSLVFYVFIMEDKKGMQALLRSKQLVAGKWWTVFWRKVLILLIIMGANLIIGLIP  244

Query  287  YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIK  328
            +VG   +    LL TPF  ++ +L+Y DLK   +     P K
Sbjct  245  FVGR--SGVAGLLTTPFLHVFGFLLYQDLKRFKKDSYFEPPK  284


>MBI2062359.1 hypothetical protein [Candidatus Yanofskybacteria bacterium]
Length=484

 Score = 90.2 bits (220),  Expect = 3e-16, Method: Composition-based stats.
 Identities = 59/363 (16%), Positives = 118/363 (33%), Gaps = 8/363 (2%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            + +F  R      I LL ++ A       +        +         + +      ++ 
Sbjct  70   FVVFNARRKLAFKIVLLPVIAAVLLSLIVVYAASFLAGSIAGAIIIVLLTVILAVTSVIL  129

Query  171  LSWMTGSMFIYICKTDV-GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
             +    S+       +V      +KLGL+    +  ++ L   +V GG +LLI+PG++F 
Sbjct  130  SAIAFISLAYLFNGKEVFTFSEYIKLGLKKFWPYAWIIFLTSFLVLGGYMLLIVPGIMFS  189

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY--  287
            +WF    Y L  ++  G+ AL +S+ LVSG +W    RF+ + +++L +      I Y  
Sbjct  190  IWFGLAFYALIIEDRKGMGALLRSKHLVSGKFWKTAWRFLFIWLVALLILAPFWVIDYLI  249

Query  288  ----VGEAANLAFSLLLTPFSFLYYYLIYSDLKANY-RGPQHPPIKRQWLPLTAAIFGWM  342
                +     L   ++  P    YY + + +L         +     +   +   +   +
Sbjct  250  KSVDLSWLTRLVEYVVTLPLGVAYYTIFFKNLVQVKTEPFVYDKKSARNFIIVGFVGVLL  309

Query  343  LIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLL  402
            ++  L              LL           T            P        ++    
Sbjct  310  MLALLAFGGYYYFTKLRPLLLVPATTSTADWKTYTNPQYGFEFKYPNYLTTKEFSEVIDN  369

Query  403  LSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDK  462
                R  T +G  S   + L  D       +P + +   +         +      +I  
Sbjct  370  QQSGRGVTLDGLSSNTRMGLSIDNTVYVGGSPEMTISSTMIQGGAFEWHKTKGYAYQILP  429

Query  463  VLD  465
              D
Sbjct  430  PRD  432


>KKT50197.1 hypothetical protein UW40_C0008G0005 [Parcubacteria group bacterium 
GW2011_GWF2_44_17]OGY72264.1 hypothetical protein A3H61_01480 
[Candidatus Jacksonbacteria bacterium RIFCSPLOWO2_02_FULL_44_20]OGY73786.1 
hypothetical protein A3H07_02575 [Candidatus 
Jacksonbacteria bacterium RIFCSPLOWO2_12_FULL_44_15b]HCE86843.1 
hypothetical protein [Candidatus Jacksonbacteria 
bacterium]
Length=322

 Score = 88.3 bits (215),  Expect = 4e-16, Method: Composition-based stats.
 Identities = 48/282 (17%), Positives = 97/282 (34%), Gaps = 4/282 (1%)

Query  98   SGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQW  157
              +            L       +  +  + IV     + +A     A            
Sbjct  41   WNVYKGQWKSYAGLVLLILASMLVPLMAFIFIVFGGLFLKNAAPNFSAIMGGLSAVTVTI  100

Query  158  AILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGG  217
             +LL     I + L   TG +          +    K       S   + IL   ++  G
Sbjct  101  GVLLFIPFIIWILLLAYTGVLLPGFADERARIGTVFKFTWSKFRSLLWVAILYSFIIAVG  160

Query  218  SLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT  277
             LL IIPG+++ +++    Y  A ++  G+ AL +S+ LV G+ W I  R+    +    
Sbjct  161  ILLFIIPGIIWSIYYCMAFYACAFEDKRGMDALRRSKELVKGYVWTIMRRW-GFWMYFAL  219

Query  278  LSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG---PQHPPIKRQWLPL  334
            LS +   IP  G   ++    ++TPF ++YY+ +Y  ++              +++   +
Sbjct  220  LSGIVGTIPLFGWIWSIIALFIITPFQYVYYFSVYLTIRERKNKGFATDVATGRQKMNTI  279

Query  335  TAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQ  376
                    L   L+++ ++ +           +D        
Sbjct  280  LLLFGPVFLFTILVIIGIASKTSDNSNAFEEFEDKNITGEEA  321


>MBI2677778.1 hypothetical protein [Candidatus Koribacter versatilis]
Length=267

 Score = 86.7 bits (211),  Expect = 5e-16, Method: Composition-based stats.
 Identities = 31/241 (13%), Positives = 66/241 (27%), Gaps = 18/241 (7%)

Query  402  LLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEID  461
                                   D            L+  +   P  S A  G    E+ 
Sbjct  13   FCVTSFAQLPTRAQLTAKPKFTEDVPHTFKDFTEAELREAIKIVPARSYAMFGYNTPEVR  72

Query  462  KVLDDDARDLYDRQHSFEHPAFHWVGIN-------QTDENDLFSGIRSIYLRQGTQ-AEQ  513
              L   +  +Y      +       G             +  FS      +   +    Q
Sbjct  73   LQLPKVSNSVYTTIDFPDPVVLDDAGKPVAVEIERGGYNDARFSDEIRFRVPDNSDAIVQ  132

Query  514  VHSILGKLELTLPLAIESLQLTRNDIGKT---LQIGGKQLILQRLGSNAVTLRFLGDRTD  570
                 G +++  PL++ +        G     ++I G  + L     +   L+     + 
Sbjct  133  FAHATGTVKVKYPLSVTTQTFAPLKPGPKEFAVKINGPFVNL-----DQEKLQVPDVSSK  187

Query  571  LLNVHASNSHAEPLREIGFTWQKSGDAFSL--RQMFDGNIESITVLVAGDSMTQSYPFEL  628
            L  + A ++    L    ++     DA +   +  F GN+  + V    + +    P+++
Sbjct  188  LHPIRAYDASGHQLESASYSETGYDDAGAYVTKMAFYGNVAKLEVDSVAEWVELELPYDV  247

Query  629  T  629
             
Sbjct  248  K  248


>MAG94652.1 hypothetical protein [Planctomycetaceae bacterium]
Length=314

 Score = 87.5 bits (213),  Expect = 5e-16, Method: Composition-based stats.
 Identities = 54/325 (17%), Positives = 101/325 (31%), Gaps = 22/325 (7%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
                CPHC     TP  K      SA+CP C + +    +                  + 
Sbjct  3    IEFNCPHCDKLLRTPDDK---AGQSAKCPGCGEPITVP-SAPTLPADDFGDVESGLPPIS  58

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRR----  117
                     +  + +         C +              I  ++ D+W +F       
Sbjct  59   GGAAEKPCPMCGEQIKAAAIRCRHCGEDLGGAPELRPTTIEIGVVIGDAWRVFGANLGIA  118

Query  118  ---GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
                + ++ + L+ +V  FA      + +         Q     +      +    L   
Sbjct  119  VGATFLVVILTLVALVPYFALAIMMEMDQQQGAAPDPVQVAAMVVCYLFGIFFQYFLYLG  178

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
               +F+ IC+        +  G R+         L IL V  G  L I+PG+   + F  
Sbjct  179  QHRLFLNICQGTSPSIGDLFSGGRYFLRMLGNGFLFILAVYAGLALCIVPGVYVALMFSP  238

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
              +VL D ++ GL  L +++ L  G+   +F   ++   +++            G  A  
Sbjct  239  FLWVLVDTDVRGLGPLARAKSLTEGNRGTLFVLALVQFGVAIA-----------GALACY  287

Query  295  AFSLLLTPFSFLYYYLIYSDLKANY  319
               L   PF+ L   + Y  +    
Sbjct  288  VGLLFAIPFAMLIQAVAYMQMSGQR  312


>OGD63017.1 hypothetical protein A2160_05215 [Candidatus Beckwithbacteria 
bacterium RBG_13_42_9]
Length=378

 Score = 88.3 bits (215),  Expect = 6e-16, Method: Composition-based stats.
 Identities = 47/279 (17%), Positives = 101/279 (36%), Gaps = 13/279 (5%)

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV  214
                  +     ++   S +   M + +    +G     K     +G + L  +L++L++
Sbjct  72   ILIVAPIVVAFIVISIWSGVALIMSLKLRSQSLGWKGCYKAAWPLLGKYFLNSLLVMLII  131

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
             G S+LLIIPG+++ VW     YV+  +   G+ A+++S+ LVSG+WWAI GR ++L ++
Sbjct  132  WGASILLIIPGIIWGVWLSMAIYVVLVEGKTGMAAMKRSKYLVSGYWWAIVGRSIVLGLM  191

Query  275  SLTLSFLTARIP-------------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
            +L +  + A +               + +  N+ F      F   + YLIY +L      
Sbjct  192  ALGILIVLAIVNAILKGILGSDIAAVLYQLINIVFQTGFGLFFLTFAYLIYENLVRIKGD  251

Query  322  PQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTP  381
             +       ++     I   +    + ++ ++             K              
Sbjct  252  GEAKSSASYYILAILGIVLLVGSIIVAVLLVALNPAKQMSKAQDAKTRADLTSIATALEI  311

Query  382  DLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPV  420
                +              L  +   + + +       +
Sbjct  312  YKAENQAYPATLKDLEPNYLPKAPTTEFSYQATGETYRL  350


>PYJ43452.1 hypothetical protein DME80_08840 [Verrucomicrobia bacterium]
Length=243

 Score = 85.2 bits (207),  Expect = 1e-15, Method: Composition-based stats.
 Identities = 45/213 (21%), Positives = 81/213 (38%), Gaps = 11/213 (5%)

Query  128  GIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDV  187
             I L +      + L          +         +V   L+    +T ++         
Sbjct  27   VIFLTYFLPVFPVNLCVTEAQIAGAKGLYLLGTFVSVFVSLIACGAITIAISDMCLGNKP  86

Query  188  GLFRSMKLGL-RHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGG  246
             + RS      + +G   +  +L  L V  G +LLI+PG++  VW      V+  + +GG
Sbjct  87   SVARSYGKIFGKMLGKLFVANLLQTLFVLIGLILLIVPGVIAAVWLLLTPSVVILEGLGG  146

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI----------PYVGEAANLAF  296
            + AL++S+ L  G +W  FG F+L++VI + L  +   I           + G    +  
Sbjct  147  MNALKRSKALGQGFYWRNFGVFLLVMVICVVLGGILGAIFGAIFGAMLGNFGGRLVYVLA  206

Query  297  SLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
              L  P S +   L+Y DL+          +  
Sbjct  207  QTLSAPLSLVMVVLLYYDLRVRKEAYDAAALAE  239


>MBI4273046.1 hypothetical protein [Candidatus Uhrbacteria bacterium]
Length=794

 Score = 89.4 bits (218),  Expect = 1e-15, Method: Composition-based stats.
 Identities = 67/457 (15%), Positives = 140/457 (31%), Gaps = 36/457 (8%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQ----NWQWAILLA  162
              ++W LFC R   + G+  +  +L    I + +    AT                 L +
Sbjct  86   FIETWRLFCDRFKSIAGLSFICFLLCLFGIVTIVGWLIATLYFSFPFLILLMISLCALTS  145

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
             +   ++       +      +   G     +   + +     +L L  L+  G S+   
Sbjct  146  LMWSSIVSRMLTALTTLFIAKEPIPGAVHCYRQNAKKIIFIWWILFLSGLITLGASIAYW  205

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL----LLVISLTL  278
            +P L+  ++F F  + +  D+  G+ AL  S  LV   WW +FGR +L     L I    
Sbjct  206  VPALILSIFFQFLAFTVLIDDKRGIDALYASYQLVHNAWWKVFGRLLLLSFCFLGIFFGA  265

Query  279  SFLTARIPYVGEAA-------------------NLAFSLLLTPFSFLYYYLIYSDLKANY  319
              +   + ++                       +    LL+  F F +  L    L    
Sbjct  266  GIVLGLLTWLFATIVSPLVGSKYNSLLLSLYISSGVVFLLIGSFLFSFATLFSYKLYLAL  325

Query  320  RGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQ  379
            +  ++  +  +   +   +  +  +  +  VSL         +         ++      
Sbjct  326  KEIKNTALLTRSSLVFRVMCIFFAVASIAYVSLFYFQSQEILISRGINTFDGKIILDRVI  385

Query  380  TPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLK  439
               +     +E   LS  + K         ++      G      D F+   ++      
Sbjct  386  REFIVPEQKQESNFLSPTENKKYQRNPPLDSARDVDLEGYTGKPEDAFFTIYKSTGRDFS  445

Query  440  LELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSG  499
            ++L      +  Q+G + +     LD+  + L  R          +         D    
Sbjct  446  VKLLTPRKWNFQQEGPSYLFKR--LDEGVQFLVARDEQERQSTRDFAR-------DALES  496

Query  500  IRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTR  536
            I+S Y         V      +  T P ++ S+ +  
Sbjct  497  IKSTYPADSETLNSVEIQDAFVGFTQPASLFSVTVKP  533


>WP_054977454.1 hypothetical protein [Paenibacillus sp. A3]
Length=452

 Score = 87.9 bits (214),  Expect = 2e-15, Method: Composition-based stats.
 Identities = 45/325 (14%), Positives = 97/325 (30%), Gaps = 14/325 (4%)

Query  151  QNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL  210
             ++ W           IL+G+     +  ++       +   +  G+  +GS      ++
Sbjct  130  FSRYWPMVGNTLLFGLILVGVYTAISTAVVFGVIIVAAVSFGIGAGISELGSDPFGSSIV  189

Query  211  ILVVGGGS-----LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
            ++V+  G      LL++     F + F F   V+A +  G   +L +S  L  G++W IF
Sbjct  190  LVVLLVGMYLLVGLLVMAGIGFFAIRFGFYLPVVALE--GARSSLSRSWKLTKGNFWRIF  247

Query  266  GRFVLLLVISLTLS-------FLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKAN  318
            G + +L +I                ++  +G+   +  +LL+TP   + Y + Y DL+  
Sbjct  248  GIYFVLSIIYSVFMMGTYALLIAVFKLSLLGQLIYIVLTLLITPIYMIPYAVTYFDLRVR  307

Query  319  YRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQ  378
              G     +        +              + +  +  A    +       +  +  Q
Sbjct  308  NEGADLEQMLAAGQSYGSYSGQPGEAGYAAGSAQAGGSAYAATAGAGHAVEPVQTDSFAQ  367

Query  379  QTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWL  438
                       +       D     +     +      +G                    
Sbjct  368  TPGAEGPDTGAQAAEAVKTDEPAQEAAAEHASGAAQDRVGSDGEAVRVPGMITPAGQEDG  427

Query  439  KLELSDFPNLSLAQKGSARIEIDKV  463
                    + S+   GS   E    
Sbjct  428  SPVNPSPESESIDANGSKEPEKKDE  452


>NNK00149.1 hypothetical protein [Desulfatitalea sp.]
Length=690

 Score = 89.1 bits (217),  Expect = 2e-15, Method: Composition-based stats.
 Identities = 39/254 (15%), Positives = 75/254 (30%), Gaps = 1/254 (0%)

Query  340  GWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADY  399
               L   L L        +       G  +    G+   + P+        P        
Sbjct  437  PGQLTAALTLDQGLVAAANEISNEGWGAFMSVDSGSPGSRPPEEMVDTDAGPYPAEVDLA  496

Query  400  KLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSL-AQKGSARI  458
             L   + R        S               +   + L++     PN+      GS  +
Sbjct  497  DLPAYESRHGERYAWRSGPLAATVHAVRLTPGELVEIELRITGRQIPNVPDADIGGSGTV  556

Query  459  EIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSIL  518
             I  V D   + L   ++           + ++D        ++I L+ G     V +I 
Sbjct  557  AITAVTDRQGKALLRDEYCGREINSEPAVLKKSDGPAEIQAFKTIRLKSGVPLAAVAAIK  616

Query  519  GKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASN  578
             ++ L +    ++  L        ++  G  L  +++    V     GDR+ LL++   N
Sbjct  617  ARIALDIATRTQTRVLAIPLQDPVIKTDGFYLRFKQVRGGTVDYVVSGDRSRLLSIRGLN  676

Query  579  SHAEPLREIGFTWQ  592
            +    LR  G    
Sbjct  677  AARRYLRSTGAASF  690


>KKT47946.1 hypothetical protein UW39_C0004G0005 [Parcubacteria group bacterium 
GW2011_GWC2_44_17]
Length=256

 Score = 84.4 bits (205),  Expect = 3e-15, Method: Composition-based stats.
 Identities = 47/256 (18%), Positives = 94/256 (37%), Gaps = 4/256 (2%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            +  + IV     + +A     A             +LL     I + L   TG +     
Sbjct  1    MAFIFIVFGGLFLKNAAPNFSAIMGGLSAVTVTIGVLLFIPFIIWILLLAYTGVLLPGFA  60

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
                 +    K       S   + IL   ++  G LL IIPG+++ +++    Y  A ++
Sbjct  61   DERARIGTVFKFTWSKFRSLLWVAILYSFIIAVGILLFIIPGIIWSIYYCMAFYACAFED  120

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
              G+ AL +S+ LV G+ W I  R+    +    LS +   IP  G   ++    ++TPF
Sbjct  121  KRGMDALRRSKELVKGYVWTIMRRW-GFWMYFALLSGIVGTIPLFGWIWSIIALFIITPF  179

Query  304  SFLYYYLIYSDLKANYRG---PQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAE  360
             ++YY+ +Y  ++              +++   +        L   L+++ ++ +     
Sbjct  180  QYVYYFSVYLTIRERKNKGFATDVATGRQKMNTILLLFGPVFLFTILVIIGIASKTSDNS  239

Query  361  QLLSAGKDIQQRLGTQ  376
                  +D        
Sbjct  240  NAFEEFEDKNITGEEA  255


>OGZ50532.1 hypothetical protein A3C83_02435 [Candidatus Ryanbacteria bacterium 
RIFCSPHIGHO2_02_FULL_47_25]
Length=677

 Score = 87.9 bits (214),  Expect = 3e-15, Method: Composition-based stats.
 Identities = 53/344 (15%), Positives = 108/344 (31%), Gaps = 26/344 (8%)

Query  141  LLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHV  200
             +           +  + I +      +    ++       +           +  +   
Sbjct  41   FINLVVAPYKSTSDPLYMIGIIAGVVAVFSELFIFPVAVFSLAGGRA-----YRDSVNSF  95

Query  201  GSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGH  260
              + L+ I   +V  GG +LL+IPG++F  WF+F  YV   +   GL AL +SR LV  +
Sbjct  96   VPYFLIFIFGSIVTIGGFVLLVIPGIIFLTWFWFLNYVNLLERKNGLSALHRSRELVRDN  155

Query  261  WWAIFGRFVLLLVISLTLSFLTARIP------------------YVGEAANLAFSLLLTP  302
            +W +  RF    +    ++     +                      E     F  L+ P
Sbjct  156  FWKVLLRFAAAFLAIFIVAVFFIFVVKYITAHLLAKSLASFYQRVFTEVVTRLFGFLIAP  215

Query  303  FSFLYYYLIYSDLKANY-RGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQ  361
            F   Y YL+Y DL A     P+  P +++ +  +        I GLLLV  +   ++ + 
Sbjct  216  FFVSYGYLVYLDLVAIKAAVPETQPTRKEKISYSLVALLGAPILGLLLVLNTLYLIARDA  275

Query  362  LLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVT  421
                  D+  +    P+             +       K    +  +             
Sbjct  276  PPPNDSDVVLQKIEVPENENAYFSLQKIIEKLPQEQKEKYGHWQ--EMVDGKAWYDDEAR  333

Query  422  LFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLD  465
              ++      +      +     +P L+     +  + +  +  
Sbjct  334  ALSEGNQKFFEYFTEAAEKSQYIYPPLADPANITPALVLPSLNS  377


>KKR18156.1 hypothetical protein UT47_C0003G0168 [candidate division CPR2 
bacterium GW2011_GWC2_39_35]KKR27203.1 hypothetical protein 
UT59_C0066G0004 [candidate division CPR2 bacterium GW2011_GWD1_39_7]KKR27431.1 
hypothetical protein UT60_C0049G0004 [candidate 
division CPR2 bacterium GW2011_GWD2_39_7]KKS09107.1 
hypothetical protein UU65_C0003G0162 [candidate division CPR2 
bacterium GW2011_GWC1_41_48]OGB57384.1 hypothetical protein 
A2Y27_03700 [candidate division CPR2 bacterium GWD1_39_7]OGB71082.1 
hypothetical protein A2Y26_05500 [candidate division 
CPR2 bacterium GWD2_39_7]HBG81820.1 hypothetical protein 
[candidate division CPR2 bacterium]
Length=238

 Score = 83.7 bits (203),  Expect = 3e-15, Method: Composition-based stats.
 Identities = 42/194 (22%), Positives = 74/194 (38%), Gaps = 3/194 (2%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            W LF      LL +  +  ++        L           +  +   +++      +  
Sbjct  26   WGLFKTNFKPLLILIAITGMINLIASLGGLFFDDTNGQELISDLFVLVLVIFLSILSIYP  85

Query  171  LSWMTGSMFIYICKTDVG---LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
            L     S+   I   ++    L    K        F  + IL  L V  G +LLIIPG +
Sbjct  86   LLMYLQSLDKIISGKNLIKGQLSGIFKETKGKFWGFLFVTILYGLKVLLGFILLIIPGFI  145

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
            F V +F   Y+   +   GL+AL +S+ + SG+   IF   V+L +  + +S +   +P 
Sbjct  146  FMVMYFMAPYIYVSEGKRGLEALRESKAITSGYKGKIFVTLVVLYLPIIVVSIILTSLPI  205

Query  288  VGEAANLAFSLLLT  301
            +        S +L 
Sbjct  206  ISSILVTFLSFILI  219


>RMG35661.1 hypothetical protein D6725_12135 [Planctomycetes bacterium]
Length=318

 Score = 84.8 bits (206),  Expect = 5e-15, Method: Composition-based stats.
 Identities = 47/316 (15%), Positives = 93/316 (29%), Gaps = 16/316 (5%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPE-----CCQTLIFDPAESQRTQTTDNIATCP  56
                CP CGA    P +    K    RC                +     ++ ++  +  
Sbjct  3    IEFDCPQCGARLRVPDAAAGHKSRCPRCGAIVDVPQTDIHETVGSPIGPAESVESAPSAD  62

Query  57   HCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCR  116
                     +          +  + +     +P      + S         A+    F  
Sbjct  63   ATDTGWPNSATETGGWDAEPSVTQPSPYATSEPMAAAGDAASSAAQAIPTDAEGIARFAW  122

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                     L+G VL    I + L         P       A        I  GL+    
Sbjct  123  AVCKENLGLLIGAVLIVGLIDTVLQALVERLSVPGPDRGAAAAAALVAWVIEAGLAIGYW  182

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             M + I + +      +  G R +    +  +L   V+  G ++ I+PG++  +  +   
Sbjct  183  RMVLKIVRGEAADLNDLLSGFRRIPVVFVGSLLFGTVIVAGFVVFIVPGVILSLRLWPYH  242

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
             +L +   G  +A   +  + +GHW            + + LS +      +G  A    
Sbjct  243  LLLIEGRAGITEAFSVAWEMTAGHW-----------ALPILLSVIGFLAVILGLLACGVG  291

Query  297  SLLLTPFSFLYYYLIY  312
             L+  P+  + +   Y
Sbjct  292  LLVAIPYITVMWASAY  307


>PCJ63660.1 hypothetical protein COA73_05130 [Candidatus Hydrogenedentes 
bacterium]
Length=698

 Score = 87.1 bits (212),  Expect = 5e-15, Method: Composition-based stats.
 Identities = 33/240 (14%), Positives = 71/240 (30%), Gaps = 17/240 (7%)

Query  399  YKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFP----NLSLAQKG  454
             +  +  +          +GP  +       +  NP   +++    F      L++    
Sbjct  334  TEGQILLKSGKRPYPIAFVGPYLIEVLAVEENAPNPTGEVRVAARTFGLDAGVLTVQDLL  393

Query  455  SARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQV  514
               I+I  V D   R L        + +   V      +      I  I      +   +
Sbjct  394  YETIKIQSVSDSQGRSL--ANTDIRYLSNPEVKGTAAYDTVGXDLINMIR-----EVTHI  446

Query  515  HSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDR----TD  570
              I G  ++ +P+ +  L+    + G     G   ++L+            G        
Sbjct  447  DLIAGGYDVLVPVQVYELEFASLEDGTQQSAGDLTVLLKS-SGETSMFEISGPEALTENL  505

Query  571  LLNVHASNSHAEPLREIGFTWQKSGDAFSLRQ-MFDGNIESITVLVAGDSMTQSYPFELT  629
             +     +   +P+     + Q      S  Q         I + +   ++ +SYPF+L 
Sbjct  506  KIMYLPIDEAGDPMAVQYDSTQFWMPGQSQSQLNTHEAPVKIRMKIIAGAVIKSYPFKLN  565


>OGZ79235.1 hypothetical protein A2358_03170 [Candidatus Staskawiczbacteria 
bacterium RIFOXYB1_FULL_37_44]
Length=398

 Score = 85.2 bits (207),  Expect = 8e-15, Method: Composition-based stats.
 Identities = 34/181 (19%), Positives = 70/181 (39%), Gaps = 6/181 (3%)

Query  451  AQKGSARIEIDKVLDDDARDLYDRQHSFEHPAF--HWVGINQTDENDLFSGIRSIYLRQG  508
              K    +    VLD + +++ D++ SFE   F    +     D  + +   RSI + + 
Sbjct  95   NAKKEIDVRFLSVLDSNGQEVLDKESSFETKPFWTKKMLDKSNDPVEHYKADRSISVLEN  154

Query  509  TQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDR  568
               E   +I G + +      ++   +++ +         ++ +     N +T+   GD 
Sbjct  155  FSGE-FSAITGTIYIDSIQGQKTYSFSKDQLSSLSSSSDSKIKIGEFEENYLTVIVSGDL  213

Query  569  TDLLNVHASNSHAEPLREIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFEL  628
              L++V A +S    L   G       D           I+ + V+ A + + ++YPF L
Sbjct  214  NKLVSVTAYDSSNNELEREGK---GDADGGLNVYYETAKIDKLVVIYANNIVRKAYPFTL  270

Query  629  T  629
             
Sbjct  271  K  271


>HBD04937.1 hypothetical protein [Candidatus Uhrbacteria bacterium]
Length=312

 Score = 83.7 bits (203),  Expect = 1e-14, Method: Composition-based stats.
 Identities = 50/229 (22%), Positives = 96/229 (42%), Gaps = 6/229 (3%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                 + +          +   K    +  S+ +G++H      L IL  L+V G + L 
Sbjct  68   FGAFVLFVSTWINGSIYSLISAKQKQEIGESLSIGIKHFWPMLWLTILNGLIVLGFTGLF  127

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            I+PG+   V F F  YV   +   G  AL +S+ L+ G+WW +FGRF+LL+V    ++ L
Sbjct  128  IVPGIYIGVAFMFVYYVHFAEGARGFGALMRSKDLIKGNWWNVFGRFLLLIVGIYAVAIL  187

Query  282  TARI------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLT  335
               +      P +    +  FS+L + ++  +   +Y +L A+    +   + RQW    
Sbjct  188  LMLLGELFNAPLLFSVVSQLFSILASVYAMAFSVEMYHELAASKGVHKPEVVGRQWKYAV  247

Query  336  AAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLN  384
             +I G+++   +   + S        + +      ++     +Q    N
Sbjct  248  LSIAGYLMFGLIAYSAGSMFVKLMSSIPTDETPDAEQWQEFLKQIESEN  296


>KKU46575.1 hypothetical protein UX65_C0002G0049 [Parcubacteria group bacterium 
GW2011_GWB1_46_8]KKU48021.1 hypothetical protein UX66_C0001G0040 
[Parcubacteria group bacterium GW2011_GWF2_46_8]
Length=423

 Score = 84.8 bits (206),  Expect = 1e-14, Method: Composition-based stats.
 Identities = 68/334 (20%), Positives = 123/334 (37%), Gaps = 14/334 (4%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPA--------TWLNPQNQNWQWA  158
               +W ++ +R    LG+  + +++    +                             A
Sbjct  71   FGQAWTIYKQRLGTFLGVMAIPMLIMVVLLAVLAGGGLLGISLLSSKFAAGGIGLLILLA  130

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
            IL   + +I            I   +  +G+  S + G   + S+  + +L+  +  GG 
Sbjct  131  ILFFVIVFISQAWGQTALLFAIKDSQERIGVIESYRRGWHKLFSYWWVALLVGFITMGGF  190

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            LLLI+PG++F  WF    ++L  +++ G+ AL KS+  V G W  +F RF  +  ISL +
Sbjct  191  LLLIVPGIIFATWFSLAVFILIAEDLKGMNALLKSKEYVKGKWGGVFWRFFFIGAISLII  250

Query  279  SFL------TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWL  332
            S +        +IP+  E +     L LTP    Y +L+YS+LKA        P   +  
Sbjct  251  SLVPVLIFSLLKIPFGSEISRFVIGLFLTPLVMTYSFLVYSNLKALKGEIAFAPTGGKKA  310

Query  333  PLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQ  392
                A    +L+   +L S    +L + +  +     Q  +               + P 
Sbjct  311  AFIFAGILGILLIPAILFSTVFLSLGSAREKARDARRQADIRQIQMGLEIFYNEQNKYPF  370

Query  393  RLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADR  426
             L+    K L S     ++             D 
Sbjct  371  SLNELSPKYLPSAPVDPSTNQPYQYQLQPNGTDY  404


>MBD8978789.1 DUF975 family protein [Clostridiales bacterium]
Length=562

 Score = 85.2 bits (207),  Expect = 2e-14, Method: Composition-based stats.
 Identities = 51/396 (13%), Positives = 102/396 (26%), Gaps = 32/396 (8%)

Query  188  GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGG  246
            G+  + K    + G       L  L+V   S+L IIPG++F    +F   ++ +   +  
Sbjct  151  GIKVTYKEAFDNYGKKLGAAFLQGLLVDLLSVLFIIPGIIFYYSSYFTYQLICEYPELSP  210

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFL  306
            +QA++ S+ +V GH   +F   +             + IP+      +   + + P+   
Sbjct  211  MQAIKLSKKIVKGHRSELFALDL-------------SFIPWGLLCIAVFPLIYVIPYVST  257

Query  307  YYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAG  366
               L Y + KA                                 S    N    Q   + 
Sbjct  258  TQALYYENFKARAIQLGVVTEDDFLSDAQK----------AAKYSGQYGNPQGGQYYGSP  307

Query  367  KDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADR  426
               QQ        T               +   +   ++Q+  +     +    T+    
Sbjct  308  YGTQQYAQPVQNPTQPQQNPAQPGAYYGYAQPPQDFGAQQQPQSGAYTQNQPQNTVNYAP  367

Query  427  FWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWV  486
                   P          + N         +           +D    +  +   A   V
Sbjct  368  NAQGYYPPQQPANPYAPTYFNPPQQIVQQPQPAYFTPDIPQPQDPEKPKDIYAPLANDTV  427

Query  487  GINQTDENDLFSG-IRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQI  545
                  E         +I   +  Q      +   +E   P           +  +   I
Sbjct  428  NPPDMPEEPKQEAPQITITEPEEPQTPLFSEMEEPVE---PTEAFIEPTEPTEPSEPKDI  484

Query  546  GGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHA  581
              +      +   A+     G    +    A N+  
Sbjct  485  SSQTAQAADIPEAAMQFMQAG----VFGTGAFNTPG  516


>NIQ93538.1 hypothetical protein [Desulfuromonadales bacterium]NIR33730.1 
hypothetical protein [Desulfuromonadales bacterium]NIS39881.1 
hypothetical protein [Desulfuromonadales bacterium]
Length=313

 Score = 83.3 bits (202),  Expect = 2e-14, Method: Composition-based stats.
 Identities = 69/365 (19%), Positives = 130/365 (36%), Gaps = 53/365 (15%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            MP + CPHCG  +       P++                                     
Sbjct  1    MPQLTCPHCGLTKEVSGDNFPSRPVKVT--------------------------------  28

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
                              R  +         E  AS   L  I  L A +WE++ +R   
Sbjct  29   ----------------CPRCRDSFVFPAEGPETAASPGELTDIGDLFAAAWEIYRQRLGT  72

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI-LLGLSWMTGSMF  179
            L+ +  + +++   P+   +       +    +         T A +  +   W   ++ 
Sbjct  73   LIPLSFISLLVILVPVAIFVAGGYFVSVFVGYREAFSIAGAVTGALVGAIAFIWAMAALT  132

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
                   +G+  SM      +G+F  +  LL +++ GG  L +IPGL+F V F F QY+L
Sbjct  133  FAAVDESLGIKASMSCAWTRLGAFVWVFSLLPIIILGGYFLFLIPGLIFSVLFIFAQYIL  192

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLL  299
            A +++ G+ AL KSR  V G+ W +F R +++ + +  ++ L    P +G  A       
Sbjct  193  AAEDVRGMNALLKSREYVRGYGWPVFLRLIVIWLATGLVTSLLNMAPVIGSLATYFV---  249

Query  300  LTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSA  359
             TP+ F+Y  L+Y DL+       +                  ++    + ++       
Sbjct  250  -TPYVFIYTVLVYRDLRRIKGDVAYDSSPGAKFLWLFLGALGFIVFFGAIAAMVISGTFT  308

Query  360  EQLLS  364
            +  ++
Sbjct  309  QMQIN  313


>MBC8087786.1 hypothetical protein [Phycisphaerae bacterium]
Length=199

 Score = 80.6 bits (195),  Expect = 2e-14, Method: Composition-based stats.
 Identities = 26/195 (13%), Positives = 69/195 (35%), Gaps = 6/195 (3%)

Query  153  QNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLIL  212
                   +L +V    L  ++++          +  +  +++     V +    ++L  +
Sbjct  1    MWRYLPGMLTSVLSYSLVSAFVSRMGSEVYLGGEPDVGATLRGVAPRVPTIIATMLLSSI  60

Query  213  VVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
            ++   ++  ++P +   ++ F     +  +  G   ++ +S  LV G    +FG   LL 
Sbjct  61   LMFLAAIFFLVPAVFVFIFLFGTVPAIVLEGKGVFSSMSRSSRLVKGRKGHVFGTLALLF  120

Query  273  VISLTLSFLTARI------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPP  326
             I    +   +          +    +  FS++ +P   L   ++Y DL+    G     
Sbjct  121  GIYFVFAIGFSIAAAATGNNMITIITSSLFSVVASPLLILGVMVLYYDLRIRAEGFDVEH  180

Query  327  IKRQWLPLTAAIFGW  341
            + +      +A    
Sbjct  181  MAQNLGAPQSAPSMG  195


>MBI3653834.1 hypothetical protein [Acidobacteria bacterium]
Length=394

 Score = 84.1 bits (204),  Expect = 2e-14, Method: Composition-based stats.
 Identities = 42/362 (12%), Positives = 78/362 (22%), Gaps = 12/362 (3%)

Query  90   EREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLN  149
                       +  + +L     +       L  I      L      +           
Sbjct  14   WISEGWKMFTEQWKAWVLNTFIYVLICGTPMLAVIIGFYGYLFTQLFRNPHGAPAIGPEV  73

Query  150  PQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLIL  209
                      +L    ++            +   +      R +  G        +  IL
Sbjct  74   FITFYLALFAVLFLTLFVAAFFIGGMHRAALKQLRGGTVELRDLFSGGSTYFPILIATIL  133

Query  210  LILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFV  269
              ++   GS L IIPG +     FF   ++ D  +G + A++ S  LV  +         
Sbjct  134  TTVLTMIGSALCIIPGFIVGGCLFFTLPLIVDRRLGAIDAMKASYELVKQN---------  184

Query  270  LLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
              L++    +F+   I   G  A     L   P  F    + Y D               
Sbjct  185  --LLMFTLFAFVVQLIASAGSYACYVGLLATIPLLFTISAVAYRDTFGVEGARYFTTNAP  242

Query  330  QWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQ-QRLGTQPQQTPDLNRSLP  388
                  A        P       +                          Q P       
Sbjct  243  PAAGGYAPPLEQRYQPPAPDFGQAAYPPYPNASYGQAGYPPSPNAEYGQAQPPYATPPDR  302

Query  389  EEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNL  448
                           +++ +            T        +   P     + L++ P  
Sbjct  303  RPITEKLHNPAPPQPTEKLRAPEVAAPLDTLETAAPQTEPVEPLQPRFDATMPLANAPLQ  362

Query  449  SL  450
              
Sbjct  363  MP  364


>PIS41547.1 hypothetical protein COT25_02505 [Candidatus Kerfeldbacteria 
bacterium CG08_land_8_20_14_0_20_42_7]
Length=465

 Score = 84.4 bits (205),  Expect = 2e-14, Method: Composition-based stats.
 Identities = 46/324 (14%), Positives = 101/324 (31%), Gaps = 23/324 (7%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
              F       L    +G++ A   +            +  +          T+  ++   
Sbjct  73   RAFQATKQAFLQYLAIGVLGAIFAVVVGYACVALLIGSGASVVAYVVGFFVTILVLVFVF  132

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
              +  S  I I      + +++ L ++ V  ++L  +L   V+ GG+ L  IPG+L  + 
Sbjct  133  LVIQYSAVIIISYKTTSIGQAISLTMKKVIPYSLAGVLAYFVIAGGTYLFFIPGVLMALA  192

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS------------  279
            F    +V+A +N+ G  A+++   +  GH   I    ++L ++ L +             
Sbjct  193  FTLLPFVVAYENLSGFAAMKRCYAIAKGHRGQILWALLVLFLVYLGVFVLLFVLLSAITA  252

Query  280  -----------FLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIK  328
                       F++     +       F +    F   Y Y++Y+D+ A  +        
Sbjct  253  GAGAVQSDAGNFISGVWLILTLVVLGLFYIFWPLFQTSYIYVLYTDIAACQQIADIQTEH  312

Query  329  RQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLP  388
            R    +   I  + L         S    +        + +     T        + +  
Sbjct  313  RGKKLMIGYIVVFFLFIIFSFGLTSFLPSNLSSTTDDLQRMSAVSTTYYDILEYTSATNG  372

Query  389  EEPQRLSSADYKLLLSKQRKTTSE  412
              P  L             +  ++
Sbjct  373  LYPASLDELLQDPNTYSITQEDAD  396


>NNL75238.1 hypothetical protein [Desulfobacterales bacterium]
Length=380

 Score = 83.7 bits (203),  Expect = 2e-14, Method: Composition-based stats.
 Identities = 28/209 (13%), Positives = 71/209 (34%), Gaps = 14/209 (7%)

Query  428  WADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVG  487
              +      ++     D  NL++  + +        L           H+     F    
Sbjct  1    RFNPNPFPTYVDFNDPDLNNLTVEWQEADENGWTDNLSVQLPKGPFSGHANWEVHFFGKK  60

Query  488  INQTDENDLFSGIRSIYLRQGT-QAEQVHSILGKLELTLPLAIESLQL-----TRNDIGK  541
              Q  + +  +GI+ +  +    Q +   +  G+++L L   I+ L       ++ D  K
Sbjct  61   KPQYLKGNAVTGIQDVSFKLDKGQLKNSAAAFGRVQLNLQTDIKRLTFVNKDTSQPDARK  120

Query  542  TLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSLR  601
                    +   +       + +   + D++ + A ++  + L++  +T  K G      
Sbjct  121  LPSGHTVTVSFNKNE-----VTYSTGKADVIQISAYDAWGKRLKQDDYTRTKKGKRQ---  172

Query  602  QMFDGNIESITVLVAGDSMTQSYPFELTR  630
              F G      + VA  ++ +   F++ +
Sbjct  173  IYFWGLPAKFDMDVATKNLEKQVVFDIKQ  201


>ANZ76760.1 BA75_03591T0 [Komagataella pastoris]
Length=508

 Score = 84.4 bits (205),  Expect = 2e-14, Method: Composition-based stats.
 Identities = 43/415 (10%), Positives = 95/415 (23%), Gaps = 29/415 (7%)

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY--  287
            V  FF         +  +  +++S       +W +F    ++  ++  L  + + + +  
Sbjct  102  VAAFFTLVTALVWGLSLISGIKRS-----PVFWKVFCFLQIIAFMATVLVIVVSYLLFSP  156

Query  288  ----VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWML  343
                V      A          +   ++ +   +  R  +    + + + L   +     
Sbjct  157  HTTWVMAVMCAAGGCS---LVSVILVILCASWVSRARKYEDEEDENEKMNLNDDLTNMDP  213

Query  344  IPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLL  403
            I    +      +  A+                      +     +        D K   
Sbjct  214  INTNNIGLTDFYSDRADMKNDELIHQHTVDTGDSGSRYGVISVNTQTKMNDFVNDSKKSR  273

Query  404  SKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQ-KGSARIEIDK  462
                                      D + P L    +    P L       +       
Sbjct  274  YVMDAVPKIPLPQDPNDQNSVSSLMNDARQPMLPEISDPYSSPTLDPIILGNTVDERDQN  333

Query  463  VLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQG-------TQAEQVH  515
            +            + FE  +       +      + G  +  +  G         A+Q+ 
Sbjct  334  LNYKADSKDQSSDYMFETGSNFTSVSQKGINPQYYPGAGNYPVLPGRMDRQQEAPAQQMF  393

Query  516  SILGKLELTLPL---AIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLL  572
                    T P         Q    + G         +  Q   S    L          
Sbjct  394  PASANYYSTSPQMLNNTPQFQGPPPNQGPYNNQYQTPVTYQNQPSTTDLLLSKNPD---F  450

Query  573  NVHASNSHAEPLREIGFTWQKSGDAFSLR-QMFDGNIESITVLVAGDSMTQSYPF  626
             +     +  P R++G          S   Q F G        +   SM++  P+
Sbjct  451  FIGGHQKNGRPRRKVGIPQNYVPQGNSQAPQQFGGPKNRNKNNIPAASMSRDSPY  505


>WP_111889993.1 DUF975 family protein [Aerococcus urinae]RAV93935.1 hypothetical 
protein DBT53_07745 [Aerococcus urinae]
Length=659

 Score = 84.8 bits (206),  Expect = 3e-14, Method: Composition-based stats.
 Identities = 44/435 (10%), Positives = 103/435 (24%), Gaps = 23/435 (5%)

Query  110  SWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
             + +          +  +                     +         +   +     L
Sbjct  101  FFMVQVWVYGTPWSLLEMVDGGDHHIGRVWSAFTHRPIRHYIANFLAAIVRWFSAFVFYL  160

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
             L+ +    ++ +  T       +   + HV    + L+LL LV    S L      +  
Sbjct  161  VLATILFFYYVVVLSTARAAIGQVTGIVVHVLYLLIALLLLFLVALLTSWLYYGFNFIMF  220

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG  289
                       ++  G  ++L  S  L+ G+ W +F   +  + + L +    +      
Sbjct  221  -------PTYDNEETGVFRSLRMSWQLMRGNKWRLFKMGLGYVFLPLIIGGAVS------  267

Query  290  EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLL  349
                               Y  ++ L                  +  A+F         L
Sbjct  268  -----LLMAYFRTAFPQADYYFWAQLGLGALVVLFLFGNLIKFMVVQAVFYREQTKQYAL  322

Query  350  VSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKT  409
                       +      D+             +     E      SA        +   
Sbjct  323  YLNDHFPSFGAEASDNIADLYHTEDRPQFTADTMAIDASELNALDDSASSSDDYLSEAAF  382

Query  410  TSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDAR  469
                G       +  +  ++  Q+ +   +   SD  +L   +     + +      D  
Sbjct  383  RETYGPDQDDFKVGGEGTYSQPQSTYPENENSDSDPASLDAEENDDEIMVMAPADSQDEP  442

Query  470  DLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAI  529
               D Q   +         +  D   +    +   +   +    +       + + P A 
Sbjct  443  ARSDSQRDNKSDQPDETAYDYADPGYVLDDRQQEEVLPASDIFDLAE-----DYSQPEAS  497

Query  530  ESLQLTRNDIGKTLQ  544
               + T  D  +T Q
Sbjct  498  PINEETLPDQAETPQ  512


>PLX75151.1 hypothetical protein C0614_11220 [Desulfuromonas sp.]
Length=709

 Score = 84.1 bits (204),  Expect = 5e-14, Method: Composition-based stats.
 Identities = 95/596 (16%), Positives = 188/596 (32%), Gaps = 44/596 (7%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              + CPHCG  R  P++K+P+++    C  C QT            +T   +        
Sbjct  94   VEICCPHCGNCRELPAAKVPSRQVKVTCHACGQTFTLHGDRLLAQLSTQAPSRQSDSQTG  153

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                                     L P  E       L  I  L   SW +F RR   L
Sbjct  154  S------------------------LVPPPESTPESLKLAKIGDLFEASWTVFKRRIMTL  189

Query  122  LGI-YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            LGI      +LA   +     L         +  +   ++L      ++  +    +M  
Sbjct  190  LGINLSGFALLAVGYMLFGWGLTVLPGAFGDSLLFVVPVMLVGALITVVVFAIFGAAMTY  249

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +   D+G+ +S+  G+++   F  +++L+  +VGGG LLL++PGLLF  WF F Q++ A
Sbjct  250  ALVDEDLGVRQSVAYGIQNFAGFLWVMLLVGFIVGGGYLLLLLPGLLFMTWFVFAQFIFA  309

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
             +++ G++AL KSR  V G    + GR +L+ +    +S +    P +G        +  
Sbjct  310  GEDVRGMEALLKSRAYVRGQELPVCGRILLIALFGSAISSIPLVGPLLGLLVVPFVLIFY  369

Query  301  TPFS----FLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQN  356
                     +   + Y   ++                L   +  +       L   +  +
Sbjct  370  HELYLDLRRIRGSISYPVTRSEKAKWLAAGGAGYLAVLLLVVVMFGSFLLQGLAIFTEVS  429

Query  357  LSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLS  416
             S  Q       + +              S         +   +   + + +  +     
Sbjct  430  PSLPQPAPQNLSLGEPNQQSTPVRTVQQPSKAGLFLANDNLAPEESFTVRFEAPAGLSTG  489

Query  417  LGPVTLFADRFWADDQN----PHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLY  472
                 + AD    +  +       +  LE + F                 +  +D+ D  
Sbjct  490  ALLAMVPADLAVQNGPSVLTIALDYSYLEGAMFGEFRFTAPEMPGNYSLGIYQNDSVDQK  549

Query  473  DRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPL-----  527
                +F       V +           I         +A ++  I     +++P      
Sbjct  550  IASVNFTVEKIVTVKVGGPLPVAADPSITYQSPIANEEAPRISGIQSPAYVSIPTKGESR  609

Query  528  -AIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRF-----LGDRTDLLNVHAS  577
              +       N  G+    G + ++       + +         G  +  ++ +A 
Sbjct  610  QQVMIFIGALNYQGEIRVNGDELMVFSGEPGVSDSYTSGVWLESGSNSLEISYNAL  665


>PYM54499.1 hypothetical protein DMD79_24525, partial [Candidatus Rokubacteria 
bacterium]
Length=122

 Score = 76.7 bits (185),  Expect = 6e-14, Method: Composition-based stats.
 Identities = 19/118 (16%), Positives = 45/118 (38%), Gaps = 1/118 (1%)

Query  513  QVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLL  572
            ++ ++ G L +  P  +++L+L     G+  ++    + +       +TL    D   ++
Sbjct  3    ELKALRGVLTIQFPKTLQTLRLDDLSPGQHAELADLSVRVVGRTRKGLTLGINKDGNRVV  62

Query  573  NVHASNSHAEPLREIGFTWQKSGDA-FSLRQMFDGNIESITVLVAGDSMTQSYPFELT  629
                 N+  + +   G    +S D  +              V++AG+   + Y F L 
Sbjct  63   YARLLNTEGQAIAFFGPQITESPDGAWRFELPPFSPPARAEVILAGELERKGYAFTLK  120


>OGY17246.1 hypothetical protein A2784_02080 [Candidatus Chisholmbacteria 
bacterium RIFCSPHIGHO2_01_FULL_48_12]
Length=241

 Score = 80.2 bits (194),  Expect = 6e-14, Method: Composition-based stats.
 Identities = 50/206 (24%), Positives = 91/206 (44%), Gaps = 5/206 (2%)

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
             + R  W L+ + ++  +L    +    +L          Q             +++ +S
Sbjct  16   FYRRHFWPLVKLLVIPSLLTILLVAIMAMLLILVTNLSSWQWVILVPTFIIGLTVIIIVS  75

Query  173  WMTGSMFIYICKTD---VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
             +  +  IY         G+    +     +G   L  +L+ L+   G +LLIIPG++F 
Sbjct  76   LLGYAAIIYFLGNSGQYPGILSLFRAVWPLIGKLWLTQLLVSLITTLGFILLIIPGIIFL  135

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI--PY  287
              + F  +V+  D   G +AL+ S  LV G +WAIFGR +L+ +    LSFL  +I  P 
Sbjct  136  TRYVFFPFVVVLDRKFGREALKFSTSLVKGRFWAIFGRGLLISLFPWVLSFLITQIDQPI  195

Query  288  VGEAANLAFSLLLTPFSFLYYYLIYS  313
            +     + F + + P + +Y+Y +Y 
Sbjct  196  IVGLLQIIFYVTIGPLTTIYFYHLYQ  221


>PIZ00505.1 hypothetical protein COY62_02255 [bacterium (Candidatus Howlettbacteria) 
CG_4_10_14_0_8_um_filter_40_9]
Length=315

 Score = 81.4 bits (197),  Expect = 8e-14, Method: Composition-based stats.
 Identities = 40/173 (23%), Positives = 82/173 (47%), Gaps = 3/173 (2%)

Query  143  KPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGS  202
            +   +          +++L T+   +LG+  +  ++        + + ++ K  +  V  
Sbjct  129  QFRIFFTTFLNPLNVSLVLLTIVLAVLGVVALYRAIVDLDQGESLDIQKAYKNAMPFVLP  188

Query  203  FTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWW  262
            +    IL  L V  G +LLI+PG++F  WF    + +  +  G +++L +S+ L+ G+WW
Sbjct  189  YLGASILYGLAVIFGMILLIVPGIIFLGWFMLFGFTVVYEKKGAVESLSRSKELIKGNWW  248

Query  263  AIFGRFVLLLVISLTLSF---LTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
             I GR++L  +I   ++    L + +P+ G     A   + + F  +Y Y IY
Sbjct  249  GIVGRYLLGSIILGVVAALPNLLSYVPFFGPIFQGAIQSIGSVFMTVYLYNIY  301


>KKW28605.1 hypothetical protein UY73_C0041G0005 [Parcubacteria group bacterium 
GW2011_GWA2_52_8]
Length=478

 Score = 82.9 bits (201),  Expect = 8e-14, Method: Composition-based stats.
 Identities = 63/403 (16%), Positives = 125/403 (31%), Gaps = 13/403 (3%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPA---TWLNPQNQNWQWAILLATVAYI  167
            W L+  R    LG+  + +      I    L +                   +L  +   
Sbjct  30   WRLYRSRLKTFLGVMAVPVAAVLIFILVPALSQVFSKSVGAGAALIILFLLFVLLMIVIY  89

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
            L  ++ +  ++        +            V S   +  +  ++VGG  +L I+PG++
Sbjct  90   LWAVAALFKAVVESEQGIGIKDAYLSAWREGRVSSLFWVNFVNGIMVGGAFMLFIVPGVI  149

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP-  286
            F VWF    +V+  D   GL AL KSR  V G WW I  R + +LV+    SF+   +  
Sbjct  150  FSVWFSQGPFVVMTDRERGLNALLKSREYVRGRWWGILWRNLCMLVLVYLASFILTLLAK  209

Query  287  -----YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGW  341
                       +   ++  TPF   Y YL++ +L              + LP        
Sbjct  210  ELGGQLAANIFSWILTIFTTPFVVCYDYLLFKNL---RDLHTGEITLPKKLPYVLTAIAG  266

Query  342  MLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKL  401
             ++  +++ +L    +++         + + +   P  + +   S         +++Y  
Sbjct  267  WIVMPIVVTTLMIGLITSFMKSVKNSPVAEIVTDIPSSSQNTLPSDTLTTATYRNSEYGF  326

Query  402  LLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSA-RIEI  460
                      +G       T           +    L   L  + N  L           
Sbjct  327  GFRYPSDWVIQGEEQDPIGTGGDGYGLGLSLSSSEALVAHLVKYGNTDLFMVSVIVDNSD  386

Query  461  DKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSI  503
                D   +   D    ++     +        N  ++    +
Sbjct  387  RTPRDYYKQSNLDFSEEYDLVINGYPAYYVKHSNPSYTDHTYV  429


>NLV54496.1 hypothetical protein [Acidimicrobiales bacterium]
Length=346

 Score = 81.7 bits (198),  Expect = 8e-14, Method: Composition-based stats.
 Identities = 37/237 (16%), Positives = 71/237 (30%), Gaps = 14/237 (6%)

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
            L L+  T            G   S++   R +   T +L++  + +  G +L I P L  
Sbjct  104  LALAGTTRLSLGAYLGDRPGWGASLRFAWRRIVPLTAVLVVTTVGMLAGLVLCIAPALWL  163

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
               +     VL  ++ G + +L +S+ LV G +W + G  +   +++  L  +      +
Sbjct  164  QGIWAVAVPVLLTEDRGAIDSLRRSQELVRGRFWPVLGTILAGGLLASVLQGVLVGPTLI  223

Query  289  GE--------------AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPL  334
                             A +    L TPF      ++Y DL+    G     + R     
Sbjct  224  LTFVGASFLVTSILTGLAQIVGVALTTPFVAALTAVVYVDLRVRKEGFDLELLARGVGVD  283

Query  335  TAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEP  391
                       G +    +    +                      P    + P   
Sbjct  284  PGVETAPSGPTGPVTALANPPVPARPDAGPVAPPGGWGAPAPSPSGPRAVPATPTAE  340


>MBI3120608.1 hypothetical protein [Candidatus Kerfeldbacteria bacterium]
Length=232

 Score = 79.0 bits (191),  Expect = 1e-13, Method: Composition-based stats.
 Identities = 43/206 (21%), Positives = 86/206 (42%), Gaps = 1/206 (0%)

Query  104  SQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLAT  163
               L + W+L      G+  I LL   +  +   +   L  +      +      I++  
Sbjct  16   WHELVEHWKLLMTVSLGIALIQLLVSYVLPSSWTNFNQLFVSGAKPGLDLTPTIVIIVVL  75

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
               + + L  +     +   K  + L  +   GL        ++IL  ++   G +L II
Sbjct  76   AFVVNVILQSVLYYSVVNRTKEGLTLAHAFNGGLSVSLKILAVVILKTILTLIGFVLFII  135

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
            PG+L  VW  F ++    +N+G +++L++S+ L  G + AI GR +L+ ++    + + +
Sbjct  136  PGILAWVWLSFAEFASIKENLGVIESLKRSKELTRGFFAAILGRLLLMALVVAVPTMILS  195

Query  284  RIPYVGEAANLAFSLLLTPFSFLYYY  309
             IP  G       +  L+   F+  Y
Sbjct  196  LIP-AGALIVTVIATPLSILYFVNLY  220


>QFU93549.1 hypothetical protein YIM_42065 [Amycolatopsis sp. YIM 10]
Length=452

 Score = 82.1 bits (199),  Expect = 1e-13, Method: Composition-based stats.
 Identities = 41/321 (13%), Positives = 81/321 (25%), Gaps = 27/321 (8%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
             T         ++T  +   I         ++K     +     + +L  L++  G LL 
Sbjct  131  VTALGSSFLDGFLTVVVGKAILGRKPTFGEALKEATPRLLPLLAMTLLYTLMIFVGLLLC  190

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            I+PG+   V F      L  +  G   A ++S LLV   +W + G  +L +VIS  ++ +
Sbjct  191  IVPGVYLAVVFGLATPALVLEKAGVGTAFKRSSLLVKNAFWRVLGVLLLAIVISWVINQV  250

Query  282  TARIP-------------------------YVGEAANLAFSLLLTPFSFLYYYLIYSDLK  316
             +                             +     +  +    PFS     L+Y D +
Sbjct  251  VSLPFSLAGDFSGFTTMFSGEVPEYTAWGLALATLGTVIAATFTAPFSAGVRALVYIDQR  310

Query  317  ANYRGPQHP--PIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLG  374
                G                A         G+ +   +            G        
Sbjct  311  MRKEGMDLQLARAAAAPAAPPAGSTDPTPPSGIPMGQATGMPTGPTGFAEPGAPTGPTAP  370

Query  375  TQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNP  434
                       +        ++   +   S +     E   + G +              
Sbjct  371  GGTGVPTGSGGAAGAAAPGEAAKPAEGTESGEAAKGDEAPEATGAIEAAERAKAEGATEA  430

Query  435  HLWLKLELSDFPNLSLAQKGS  455
                +    + P         
Sbjct  431  AKPAEASEEEKPKQDPEPPRP  451


>OGS26675.1 hypothetical protein A2297_04140 [Elusimicrobia bacterium RIFOXYB2_FULL_48_7]
Length=684

 Score = 82.5 bits (200),  Expect = 1e-13, Method: Composition-based stats.
 Identities = 67/496 (14%), Positives = 152/496 (31%), Gaps = 21/496 (4%)

Query  21   PAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRR  80
            P + +   C  C      +       +             + +      +   +     +
Sbjct  10   PDRPAEKYCAICGSDFCKECLVQMENEMVCV------ECAKTKKAPGWSKFSKEWKQLPK  63

Query  81   CNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSAL  140
                +    +   +   +   + +   A  + +     +   G  LL  ++A A      
Sbjct  64   FRLLYSTMFKLLEKGFWTLFGTYALGTAIQFIIIFGVFFIAGGGELLQKMIASANTPDDK  123

Query  141  LLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHV  200
            L+         + +            +   ++    ++       +      +  G   +
Sbjct  124  LVNEMIQFFKNSLSIFLTGGFVYFLSVYWTMATTLIAIDSVDKDKNTPFGSILLQGTIKM  183

Query  201  GSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGH  260
                ++ IL I +V  G ++LIIPG+LF VW+ F    +  +N   ++A ++SR L  G 
Sbjct  184  VPAMIIGILYIALVCLGFVMLIIPGILFGVWYLFVFSSIILENTHFVEAFKRSRKLTKGF  243

Query  261  WWAIFGRFVLLLVISLTLSFLTAR---IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKA  317
            +WAI GR+   +V++            IP VG   + A ++L+     ++YYLIY  L  
Sbjct  244  FWAINGRYGWFVVVAWLWMIAIGLIGKIPIVGPIVSQAGNILVGLLGLIFYYLIYQKLCD  303

Query  318  NYRGPQHPPIKRQWLPLTAAIFGW-MLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQ  376
                P     +   +P          +  G  ++ ++    +             +L   
Sbjct  304  INGAPGERKPEDDAMPKIETSILTKTMFKGGAVILIAAIISATAYFTVTFGWSYIKLAQL  363

Query  377  PQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHL  436
              ++ D+  +  +  +       K  L        +  +                 +P L
Sbjct  364  SSRSKDMAAATEQFSKVYDITLLKRALGSGNIYVKKAAIERLSEI----------GDPKL  413

Query  437  WLKLELSDFPNLSLAQKGSARIEIDKVLDDDA-RDLYDRQHSFEHPAFHWVGINQTDEND  495
                         +  K S  I + +V D  +   + +                  +++D
Sbjct  414  APIFVKMFDREKDIQMKSSIIIALCRVGDSTSVPKIREALKHPNPVIRISAAQALGEKSD  473

Query  496  LFSGIRSIYLRQGTQA  511
            L S    + +   +  
Sbjct  474  LASLEYIVQMLNDSDE  489


>HDN01387.1 hypothetical protein [Candidatus Bathyarchaeota archaeon]
Length=300

 Score = 79.8 bits (193),  Expect = 2e-13, Method: Composition-based stats.
 Identities = 29/197 (15%), Positives = 68/197 (35%), Gaps = 3/197 (2%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L+ I +LG   A  P       +          +              +   W    +  
Sbjct  97   LISIAILGSTFAGWPRPWRAPPQYGLGGLSFIGSAIILYAFLVFLIYTVLEGWTVAMVSQ  156

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +    + L   +   L  +    +  I++ + V  G +L +IPG++  V        + 
Sbjct  157  GLQGRPIDLSLGLSEALGKIVPLVIAGIVVSVAVTIGLILCVIPGIILFVLLALTTQAIM  216

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA---RIPYVGEAANLAFS  297
             D+   ++AL  S  +   +++ IF   ++  +    ++F+ +    +P V    +    
Sbjct  217  IDDHDAIEALSVSFNIGKNNFFTIFVALLIPGIAYFIVAFIFSEIIVVPVVSTIVSSLIG  276

Query  298  LLLTPFSFLYYYLIYSD  314
             L   +  +   +IY +
Sbjct  277  ALFMTYFAVVLTMIYYE  293


>SFB51261.1 hypothetical protein SAMN05216266_11559 [Amycolatopsis marina]
Length=286

 Score = 79.4 bits (192),  Expect = 2e-13, Method: Composition-based stats.
 Identities = 35/176 (20%), Positives = 61/176 (35%), Gaps = 21/176 (12%)

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
                ++T  +   +    V            +     L ++  L+V  G +LLI+PG+  
Sbjct  101  FLSGYLTVVIGKAVLGRPVRAGEVWTEIRPLLLPLLGLTVIYTLIVVVGLILLIVPGIWL  160

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP--  286
             V F      L  +      +L +SRLLV G WW  FG  +L  +I+  +S + +     
Sbjct  161  YVLFALATPALVLERGRIFTSLGRSRLLVRGSWWRTFGILLLAAIIAGVISMIISLPFEL  220

Query  287  -------------------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                               ++     +    +  PFS     L+Y D +    G  
Sbjct  221  LGDGASAFGNSTTLSTTGIWLSALGGVVAGAITYPFSAGVTALLYIDQRMRKEGMD  276


>NLM70207.1 hypothetical protein [Firmicutes bacterium]
Length=304

 Score = 79.4 bits (192),  Expect = 3e-13, Method: Composition-based stats.
 Identities = 36/298 (12%), Positives = 85/298 (29%), Gaps = 8/298 (3%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
                CP+C        S L       +C        +   +       +   T       
Sbjct  5    IGTYCPYCQKILTEQDSVL----VCQKCHTPHHHQCWVENQGCAVVGCEGSLTHAGAVES  60

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                S +     + +         C    ++ ++  +G       L D+ +    R    
Sbjct  61   EAPRSKQCPNCGEAIPAAAVFCVHCKTMLQDLKSDSNGSLPAFATLIDAVKFGWNRTIQN  120

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            LG  +L  +   A       +   T                 +      ++     +F+ 
Sbjct  121  LGFLILMQLGLVAGGLVLAFVSSLTIYFIPAGVLF----SIGIFLFSSLVTVGVQRVFLK  176

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            I       +  +      +  F  + +L+      G    +IPGL+F  +      ++ D
Sbjct  177  IADNQPVSWADIFSASDRLLPFIGVGLLVGFGTAVGFFFFLIPGLIFAFFTMLAPIIVVD  236

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLL  299
              +G ++A++ S  LV  +    F  ++ + V+ +  +   +         +    + 
Sbjct  237  QPLGAVEAIKTSMALVLDNILLTFLLWLAVTVLGMLGALFFSLGLLFTAPISALTLIY  294


>WP_166080418.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Erysipelothrix sp. HDW6A]QIK56700.1 hypothetical 
protein G7059_01970 [Erysipelothrix sp. HDW6A]
Length=586

 Score = 81.4 bits (197),  Expect = 3e-13, Method: Composition-based stats.
 Identities = 40/470 (9%), Positives = 118/470 (25%), Gaps = 24/470 (5%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L    +L + + F  I  A+L++       + +     I       + + +  +   +F+
Sbjct  74   LTLFAVLCVTVLFVIIQFAILIRIVGDSFHEQKFNFKEIPRFIKRLVSVDILLIIPYVFL  133

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             I   +  +  +      H+  F +  I     +    L      L   +   F   +  
Sbjct  134  VIPNMNFAMSITYVSQF-HLPEFIVSSIFQNFALTLVYLAGNAIFLYLNIRLVFTPIIYV  192

Query  241  DD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF-LTARIPYVGEAA------  292
             +     + A+++S  +    +  +     +  +I++  S  +   IP +          
Sbjct  193  MNPEKRFIDAIKESWKITRKSFIKLGLLIFIGWLITMIFSVAILYGIPAISAILAMQFPS  252

Query  293  -----------NLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGW  341
                       +    L++    ++   +    ++           ++ +     A    
Sbjct  253  IPRVILIAIDTSAIVLLIIAISMWMILSIQMIVVRYYEITESSEYHRKSYKRSGLAKILV  312

Query  342  MLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKL  401
            +       +            +    +               N             DY  
Sbjct  313  LTGITGFYILSFVYIYFDMGRMDEKSETLIISHRAESPGTIENTIEALTKVNEFKPDYVE  372

Query  402  LLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEID  461
            +  +  K                 +     Q+    L+        +           I 
Sbjct  373  IDVQITKDDEIVVYHDFNTKRLGSKNVKIAQSTLEELQSIELSKDGIKSKIPTFEEFVIK  432

Query  462  KVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYL----RQGTQAEQVHSI  517
                + +  +  +  +F +P       +  + +D+F       L        +A+  ++ 
Sbjct  433  AKELNQSLLIEFKAETFNNPLLTEKVTDILNRHDMFKMSTFQSLDKKTITEYEAKYPNAE  492

Query  518  LGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGD  567
             G +       +E L +    +  +         +       +      D
Sbjct  493  TGFILGINFGELEILNVDFYSLEDSSITDDVVQSIISNDKGLLVWTINDD  542


>PIW91097.1 hypothetical protein COZ91_02280, partial [Candidatus Nealsonbacteria 
bacterium CG_4_8_14_3_um_filter_39_7]
Length=181

 Score = 76.3 bits (184),  Expect = 3e-13, Method: Composition-based stats.
 Identities = 53/181 (29%), Positives = 84/181 (46%), Gaps = 3/181 (2%)

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
            +    +     I     ++G+  S +   R + S+  +  L   +V GGS L ++PG++F
Sbjct  1    MIWGQLALICAIKDSGENIGVIESYRRAWRKILSYWWVTFLTTSIVLGGSFLFVVPGIIF  60

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP--  286
             VWF    Y+L  ++I G+ AL KSR  V G W ++F RF+ L ++++      + I   
Sbjct  61   SVWFSMAVYILVAEDIKGMDALMKSREYVRGRWLSVFWRFLFLSLLAMIFLLPLSLISEF  120

Query  287  -YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIP  345
               G    L FSL   P S  Y +LIYS+LK+        P K +  P        +LI 
Sbjct  121  IPFGSFVELVFSLFFAPLSATYLFLIYSNLKSVKGDFVFEPAKIKKWPFILTAIIGLLIV  180

Query  346  G  346
             
Sbjct  181  P  181


>WP_152053729.1 hypothetical protein [Aquisphaera sp. JC650]
Length=725

 Score = 81.4 bits (197),  Expect = 4e-13, Method: Composition-based stats.
 Identities = 21/242 (9%), Positives = 58/242 (24%), Gaps = 13/242 (5%)

Query  397  ADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSA  456
            A   +   +       G   +    +             +           +  A     
Sbjct  352  AVTPVAGPRPWPVAFAGPFLIEVTGVEQQPSGTGTLGLRVLGVGLPPAITAMMAADPFHE  411

Query  457  RIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHS  516
             I I  +     ++L              +     D       +  +          +  
Sbjct  412  PIAIRSISGPGNQELLPDDRFPVTLGASELPEGVVDVTLDVPLVGLLR-----SVSAITE  466

Query  517  ILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRT-------  569
            ++G++E+ +P  I  +     + G   +  G ++ L+    +  TL              
Sbjct  467  VVGQVEIPVPTEIVRMSFDPIEPGAVREEEGLRVSLRGEMPHQATLELERLDESDPDMKW  526

Query  570  -DLLNVHASNSHAEPLREIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFEL  628
               + +   +   +P+             +S      G   ++ +  A +         L
Sbjct  527  HRYVFLLGRDRGGDPVEMEIGGGGGDEKEYSAFLSIRGEPVAMDLKFAKEVEPLVTDVRL  586

Query  629  TR  630
             +
Sbjct  587  EQ  588


>MBI4063975.1 hypothetical protein [Elusimicrobia bacterium]
Length=253

 Score = 77.9 bits (188),  Expect = 4e-13, Method: Composition-based stats.
 Identities = 38/169 (22%), Positives = 68/169 (40%), Gaps = 7/169 (4%)

Query  156  QWAILLATVAYILLGLSWMTGSMFIY----ICKTDVGLFRSMKLGLRHVGSFTLLLILLI  211
             W  +   +A  L G   +  +++ +           L       +  +   TL+ I   
Sbjct  86   FWLSVFLNLALFLWGQGAVYLALYPHPNNPSAGEPRALPAIALGAIGLILPLTLVNIFYG  145

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            ++V  G++LL+IPG+   V + F   +L  ++     +  +S  LV G WW +F R VL+
Sbjct  146  VLVLIGTVLLVIPGIWLAVKYSFAPLLLVTEDASAFSSFGRSSDLVKGCWWGVFIRLVLV  205

Query  272  LVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
             + +L  S L   IP +G            PF      +I  +L+    
Sbjct  206  GLATLIASVLVGLIPVIGRVIAGVLLA---PFLNSCQLVILENLQEIKG  251


>PWL61158.1 hypothetical protein DBY37_07050 [Desulfovibrionaceae bacterium]
Length=327

 Score = 79.0 bits (191),  Expect = 5e-13, Method: Composition-based stats.
 Identities = 35/213 (16%), Positives = 78/213 (37%), Gaps = 15/213 (7%)

Query  134  APIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSM  193
              I    L+         ++       + ++   L+    +    F  +    VG+   +
Sbjct  114  FVIGLVALIINIGLNLVSSRMGSLLGSIVSMVLNLMAGGAIAHVAFQELRGVRVGMKEGL  173

Query  194  KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKS  253
               +  +    L+ +L  + +G G +LL++PGL+    +         + +G L  + +S
Sbjct  174  SYAMGRLAPLFLVALLAGIGIGVGLILLVVPGLILACAWVVVIPACVVEKLGPLDCIRRS  233

Query  254  RLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP---------------YVGEAANLAFSL  298
              L  GH   I G  V++ +  L L F++  +                +      L  +L
Sbjct  234  MELTKGHRLIILGAVVIIGLAILALGFVSGLLSMGLLSAFSSAPYAALFFIALLYLVLTL  293

Query  299  LLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
            + +    L   +IY+DL++   G     + + +
Sbjct  294  IPSTMYSLLTAVIYADLRSMKEGVSRESLAKVF  326


>TDJ55075.1 hypothetical protein E2O47_04950 [Gemmatimonadetes bacterium]
Length=196

 Score = 76.0 bits (183),  Expect = 6e-13, Method: Composition-based stats.
 Identities = 31/180 (17%), Positives = 61/180 (34%), Gaps = 18/180 (10%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +    +G +     +        V  + ++   + ++    +L IL  LVVG G +  
Sbjct  6    LNIVLSAIGTAAAVFIVSDSYLGRPVDPWDALSRAVPYIARIVVLSILTTLVVGLGFIFF  65

Query  222  IIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
            ++PG++F          L  + N   ++A+ +S  L  G  W +    V+  +I    S 
Sbjct  66   LVPGVIFLSALVISTQALVLEENRSPIEAMGRSWQLTKGFRWKVLALVVVTAIIVFIPSI  125

Query  281  L-----------------TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                               +    +         LLL P  +    ++Y DL+    G  
Sbjct  126  ALVSVASFLATEPAVLTDLSIGWSLALVLGAVVQLLLYPLMYSVLTVLYYDLRVRKEGFD  185


>PIT91442.1 hypothetical protein COU17_00420 [Candidatus Kaiserbacteria bacterium 
CG10_big_fil_rev_8_21_14_0_10_49_17]
Length=290

 Score = 77.9 bits (188),  Expect = 6e-13, Method: Composition-based stats.
 Identities = 44/193 (23%), Positives = 84/193 (44%), Gaps = 4/193 (2%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            ++ L  ++A     S ++   +  L           +L   A   L +      +     
Sbjct  89   LWTLIGIIAIPAFLSLVIGFGSVTLPSGKGTITLFAVLGVFALAFLQILAGIAIIRAVGD  148

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
                 +    +       SF  + IL+  +   G +LLIIPG++  V+     +++  ++
Sbjct  149  SEKREITTYFREAFPLFLSFLWVSILVGFLEVVGFILLIIPGIIISVYLSLAVFIVLFES  208

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI----PYVGEAANLAFSLL  299
              G+ AL+KS   V G WWA+FGR +++++IS  L  + A        +G+  ++   +L
Sbjct  209  ERGVDALKKSHSYVKGRWWAVFGRMLVIILISGILGGIGAFFSSKDAALGQVISVVIQIL  268

Query  300  LTPFSFLYYYLIY  312
            + PFS  Y Y +Y
Sbjct  269  VVPFSVAYMYFLY  281


>WP_114370738.1 hypothetical protein [Blastopirellula cremea]RCS43979.1 hypothetical 
protein DTL42_18520 [Blastopirellula cremea]
Length=563

 Score = 80.2 bits (194),  Expect = 8e-13, Method: Composition-based stats.
 Identities = 30/297 (10%), Positives = 77/297 (26%), Gaps = 17/297 (6%)

Query  346  GLLLVSLSRQNLSAEQLLSAGKDIQQRLGT--QPQQTPDLNRSLPEEPQRLSSADYKLLL  403
                 S  R +     + +    +  +            LN         ++ +  K  +
Sbjct  123  AGYHPSYYRYDPDNPSMTNWLNRVNNKPLEKRTDFDRQVLNHIDKNLDATIALSPGKYPV  182

Query  404  SKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKV  463
                          GP+ +       +  N    + +     P      +   +   +  
Sbjct  183  VDGLAHEYPSVTFAGPLRVQLTDVLQNVPNATGKVSVSFLRLPLPGKVVRHDGKDFYELP  242

Query  464  LDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGT--QAEQVHSILGKL  521
                   +  R +              +    +        L +G    A ++  + G+ 
Sbjct  243  W----ELVTLRGNPGRSVKESSRTSGWSPTKSILLSESRNLLLKGLLQNANELEELTGEF  298

Query  522  ELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRT---------DLL  572
             L LP A + + +    +GK+++ G  ++ +  L +  +       +             
Sbjct  299  TLLLPQAWKPIWIENPTVGKSIREGDYEVQIIELSTERLGFYIRCKKPPTPPFHDFCRQY  358

Query  573  NVHASNSHAEPLREIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFELT  629
             V A ++  E +      +Q     F       G    + ++   +     Y    T
Sbjct  359  RVFAFDAQGELISPDDAIYQGFFSGFGNFIPLHGKPALVAIVPITEVQEIKYQHRFT  415


>MBI4087172.1 hypothetical protein [Candidatus Kaiserbacteria bacterium]
Length=245

 Score = 76.3 bits (184),  Expect = 1e-12, Method: Composition-based stats.
 Identities = 45/179 (25%), Positives = 74/179 (41%), Gaps = 8/179 (4%)

Query  144  PATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSF  203
              T L          +    +  I+  ++ +    F            +      +   +
Sbjct  54   FFTELVALGSTLWILLSGLILGGIVSIIACIALIFFTADPSRYPSAASAYSHAKTYFFPY  113

Query  204  TLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA  263
              + +L  L + GG +LLI+PG++  VWF F QY    +   G +AL  SR LV G W A
Sbjct  114  LFVGLLTGLAIMGGMILLIVPGIVIAVWFAFSQYTCLLEEKNGFEALRASRTLVVGRWGA  173

Query  264  IFGRFVLLLVISLTLSFLTARI--------PYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
            +F R ++ + I +  +F    I              +NL   ++LTP S +Y YL+Y D
Sbjct  174  VFTRLLVFVAIYILATFAFMSIIDQAISDDALSAGLSNLLLVIVLTPISTIYPYLLYKD  232


>WP_158039999.1 hypothetical protein [Pseudoclavibacter chungangensis]KAB1659499.1 
hypothetical protein F8O01_06120 [Pseudoclavibacter chungangensis]NYJ67642.1 
hypothetical protein [Pseudoclavibacter 
chungangensis]
Length=620

 Score = 78.7 bits (190),  Expect = 2e-12, Method: Composition-based stats.
 Identities = 43/418 (10%), Positives = 88/418 (21%), Gaps = 63/418 (15%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
             +    +L    +   +          L  +++   R V        L  LV     L +
Sbjct  130  ISFLGTVLVQGILVLVVTRGAVGERTTLGDALRRAWRRVLPLLGFAALTGLVPIVLVLGI  189

Query  222  IIPGLLFC-------------------------------VWFFFCQYVLADDNIGGLQAL  250
            I   ++                                 V   F    +  + +G   AL
Sbjct  190  IAIIVVPAALGADGMLVLGLGVLALLLGLTLAVGLLALYVKLLFTPSAIVAEGLGLRDAL  249

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISL-----TLSFL-TARIPYVGEAA------------  292
             +S  L +G +W I G  +L+ +I         + +  A + +VG  A            
Sbjct  250  VRSWQLTNGQFWRILGITLLVNLIVSTAASTVATVVQFAYLMFVGVFAPLGTDENQTVVL  309

Query  293  --------------NLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAI  338
                              S +          L+Y D++    G        +      A 
Sbjct  310  VIVAIAFNLLLLLLQTVVSAVTLVLLAGNATLLYLDVRMRKEGLNLQLQAYRDATDRGAA  369

Query  339  FGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSAD  398
                      L                          Q                +  ++ 
Sbjct  370  PDEDPFRRSDLGPQPAPGAVPAGAGYPPGAPLVGGPQQFASPQQFASPQQFASPQQFASP  429

Query  399  YKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARI  458
             +     Q     +   +  P         +    P+       +  P  +    G+A+ 
Sbjct  430  QQPFAPPQPYGAPQPSGATSPYGPTRPTPPSGAAQPYGTTPPSGAAQPYGATQPSGAAQP  489

Query  459  EIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHS  516
             +                                     +   S  +  GT  +   +
Sbjct  490  YVPAQPYGGPARPSGAAQPPGAAQPSGAAQPYGVPRPFGATGPSDPVPPGTSPQTSGA  547


>PJA45116.1 hypothetical protein CO174_04745 [Candidatus Uhrbacteria bacterium 
CG_4_9_14_3_um_filter_50_9]
Length=258

 Score = 75.6 bits (182),  Expect = 2e-12, Method: Composition-based stats.
 Identities = 42/248 (17%), Positives = 87/248 (35%), Gaps = 0/248 (0%)

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
            ++ +  +M           F  +K  L     F  ++ +  L++  G +  IIPG+L  +
Sbjct  1    MAPIALAMNKGSHLGFKSSFVHLKAQLPKARGFIWIMFIQSLLIMIGFVFFIIPGVLMAI  60

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
            W+    +V   +   G+ ALE+S  L+ G+ W  F   + ++ + L +   +  + ++  
Sbjct  61   WWSQSLFVYLKEGKRGMAALERSYELMRGYGWYYFYMLIPVVFVILAIIVPSILVIFLLP  120

Query  291  AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLV  350
             + L   L  TP  ++YYY ++  +    +      I  ++       F    +  L  V
Sbjct  121  ISYLVLILAATPLMWVYYYHLFERIVRLNQQSARSMIAAKYKWSLVGAFVLYYVILLGSV  180

Query  351  SLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTT  410
             LS     +                        +   P E      ++ +L   +     
Sbjct  181  GLSSALWGSVLETKDSIGNDGSDSFNFDTWDTDDLYNPSETSDNELSEEELAEIEAWLEE  240

Query  411  SEGGLSLG  418
             EG     
Sbjct  241  IEGMSDEE  248


>WP_171809514.1 hypothetical protein [Corallococcus exiguus]NPD28632.1 hypothetical 
protein [Corallococcus exiguus]
Length=905

 Score = 79.0 bits (191),  Expect = 3e-12, Method: Composition-based stats.
 Identities = 29/177 (16%), Positives = 55/177 (31%), Gaps = 22/177 (12%)

Query  471  LYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIE  530
                          + G N +    + S      L+ G  A+ + +I G + + +P  I 
Sbjct  420  PLAASPDGMKVGEDFSGENAS-RGQVASHHAQFALKPGVDAQTLPAISGSVTVHVPTRIR  478

Query  531  SLQLTRNDIGKTLQ---------IGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHA  581
            S++L         +         + G ++ L  L      L   G    +L V A N+  
Sbjct  479  SIRLPFAAPFNRAEQTFPDGKLFVEGLRIHLGTL---TFQLGSEGTPPTVLAVRALNAEG  535

Query  582  EPLREIGFTW---------QKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFELT  629
            + LR                      +     +G    I +++   +   + PF LT
Sbjct  536  KYLRTEARPGLTMVLSLPLFGVLSPDAPLFTVEGRPAQIELVLVDATTPYTRPFTLT  592


>MBF6612405.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Chloroflexi bacterium]
Length=489

 Score = 78.3 bits (189),  Expect = 3e-12, Method: Composition-based stats.
 Identities = 41/349 (12%), Positives = 80/349 (23%), Gaps = 21/349 (6%)

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT-LLLILLILVVGGGSLLLIIP  224
               L               + V L   + L    +G+    +      +     L L I 
Sbjct  141  IGPLFGIIGLQVAVWIAIASPVALLFIVGLTAAGLGASNAGVGFGAACLGLFLLLPLGIL  200

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS----F  280
                 V        +  + +G +QA ++S  LV  +WW   G  + L ++   +S    +
Sbjct  201  FFYIVVRLAVVIPAVMVEKLGPVQAWKRSWRLVMNYWWRTLGIIIALSLLGAVVSAGPAY  260

Query  281  LTARIPYVGE------------AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIK  328
            L   +  +                 +   L+  P   +   L Y DL+    G       
Sbjct  261  LIQMLVLLAVKLDPVVSQALLGGVQILTGLVFVPLQLISMTLYYFDLRVRKEGFDIETAI  320

Query  329  RQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLP  388
             Q     A           +     + N       + G    Q    Q            
Sbjct  321  EQRYWPGAGGMMPQTAMAGVYPQYGQTNQGMIAPPALGYGAPQPTYQQGAYKQGGIADPI  380

Query  389  EEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNL  448
            E      +                            ++     Q P   + +    +   
Sbjct  381  EPVYGQPATIEGYAQPSGAPEAGNDTAPGNITPTAPEQALQAPQMPPPTVAVANRAYGQS  440

Query  449  SLAQKG--SARIEIDKVLDDDARDLYDRQ--HSFEHPAFHWVGINQTDE  493
            +       +   E +   D +       +     +          Q  E
Sbjct  441  TAPVPDWLARVREKEARADIEGSARIKSEAAAPEQTMEASDAPPEQRPE  489


>MBI2903216.1 hypothetical protein [Candidatus Methylomirabilis oxyfera]
Length=255

 Score = 75.6 bits (182),  Expect = 3e-12, Method: Composition-based stats.
 Identities = 34/201 (17%), Positives = 70/201 (35%), Gaps = 3/201 (1%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
              F         +   G +          L            +    I +     I +  
Sbjct  24   RSFDLYRRNFWPLTFFGGLFWIVLFVGPSLPNLEQEHYRVPYSIVAVISILAS--IQVSY  81

Query  172  SWMTGSMFIYICKTDVGLFRSMKLG-LRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
              +    F  +    + +  +++   LR+V  F     L +++V  G L L+IPGL+  V
Sbjct  82   ICVIWGSFQALLDRRISIASALRQVSLRNVLRFVWTAGLSMVLVLIGVLALVIPGLVVGV  141

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
                   V+  +  GG QAL +S  LV G    +F   +++ + ++ ++     + ++G 
Sbjct  142  RLSMVAPVVVIEGKGGWQALVRSWHLVKGRSLGVFRSLLVVSLPTMLVNAANQLLGHLGL  201

Query  291  AANLAFSLLLTPFSFLYYYLI  311
                  + ++T        L 
Sbjct  202  VLEPLAATVITIVVLACSVLA  222


>HGE14881.1 hypothetical protein [Candidatus Parcubacteria bacterium]
Length=700

 Score = 78.7 bits (190),  Expect = 3e-12, Method: Composition-based stats.
 Identities = 64/379 (17%), Positives = 128/379 (34%), Gaps = 17/379 (4%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY  166
              +S+ L+  R   L GI ++         F   +L    +      +  ++ +   +  
Sbjct  17   FKESFSLYRVRTKVLFGITIIA----VGVNFLGGVLLNYLFNTNIKYSLFFSFIWIIILL  72

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
              L +  +     I+  K ++G+  + + GL+   SF  +  L   +  GG LL +IP +
Sbjct  73   SSLFIWLLAIPSLIFAIKENIGVKEAYRKGLKIFPSFFWIYFLFNAIFVGGFLLFLIPAI  132

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            LF VWF    +VL  +   G+ AL KS+ L+SG +  +  R++   +  L +  L     
Sbjct  133  LFFVWFSLAIFVLVFEEKKGMDALFKSQHLISGKFLGVLRRYIGFGLFLLLILILVLSPL  192

Query  287  YVGEA-----------ANLAFSLLLTPFSFLYYYLIYSDLKAN-YRGPQHPPIKRQWLPL  334
            ++ +                  + LTP   +Y +LIY +L+      P   P K++ L  
Sbjct  193  FIFKVSEAKIEKISKAIGYLLQIFLTPLILIYGFLIYENLERIKKEIPYQEPSKKRKLKY  252

Query  335  TAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRL  394
               I    LI  L+                           +  +  +    L       
Sbjct  253  ILPIALGTLIFLLVFSLNFMNIFFGRDEPPPDDRDLWLSRIEIPKEENALYYLIPHFYLS  312

Query  395  SSADYKLL-LSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQK  453
                 ++       K + +         L  +     + +     +L   +   L   +K
Sbjct  313  QYEKEEVYKYWPDLKESKKIYWPAEKEDLIKNIISQKEWDEKFVKELLEENKEVLDSFEK  372

Query  454  GSARIEIDKVLDDDARDLY  472
                      +  D + ++
Sbjct  373  AVKSSYFQDPMTQDPKTVH  391


>WP_068790608.1 hypothetical protein [Phormidium willei]OAB55799.1 hypothetical 
protein AY600_08670 [Phormidium willei BDU 130791]
Length=246

 Score = 75.2 bits (181),  Expect = 3e-12, Method: Composition-based stats.
 Identities = 41/216 (19%), Positives = 83/216 (38%), Gaps = 12/216 (6%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
                 W +F       L + L+  +          +      +            +  + 
Sbjct  17   YFDIGWNIFKDNFSNFLVLALVIDLPLAIIQLFVPVSDNPEEMANPGDLLALGFAVLLIV  76

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
              ++ +          +    + +  +++ G   +    L++I+  ++V  G +LLI+PG
Sbjct  77   LSIVSVLSALILTESAVLNRPIEVASAIQQGFSRLIPSLLVVIISTILVAIGLILLIVPG  136

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV-----------I  274
            L      +F  Y +A    G L AL  SR LV G WW IFGR VL+ +           +
Sbjct  137  LYIANLLYFVLYAVALRGQG-LDALSYSRDLVKGQWWKIFGRSVLIGLGFGAVTFVVMVV  195

Query  275  SLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYL  310
                S + ++IP+  E  ++  S++    ++L+  +
Sbjct  196  FGFASLVVSQIPFAAELVSVVGSVISGLLAYLFVAI  231


>MBE6446274.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=789

 Score = 78.3 bits (189),  Expect = 4e-12, Method: Composition-based stats.
 Identities = 52/513 (10%), Positives = 119/513 (23%), Gaps = 37/513 (7%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
                    I+   I               +        +    +   T          + 
Sbjct  215  WAIDYWQVIFSQIISPLLEFGIVFGNQIISGTGVQSTASSVSTLGGGTTYITPSCTGVVA  274

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
              +   +C   +G   ++ L L    +     +          L       +  + F F 
Sbjct  275  AGLDKGVCDAILGFLSTISLTLVTWMAVGATFLAECWGKVLYVLPDFNMLFMGLIIFIFA  334

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP----YVGEA  291
              +     +  + AL +                ++ +V+   L      +P    Y G A
Sbjct  335  FMIYVSFPLKLIDALFR----------------LMFVVVLFPLWCAFWVVPQTRKYFGTA  378

Query  292  ANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVS  351
             N+  ++L T  S     ++   + A+             L           I       
Sbjct  379  VNMFLNVLATFLSASVILVMIVSILASLFNGIDMRRIISLLKEGETNTAMQQIDFSTSGL  438

Query  352  LSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTS  411
                 L         K          Q +  +  ++        +A  KL    +  T  
Sbjct  439  FYAICLLYMAFHLVSKVDYFANIFVKQDSMGIGGAVSGAVTAGITASPKLYGMAKNATKW  498

Query  412  EGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDL  471
              G +       A+R  A   N ++            S     + +              
Sbjct  499  GVGKTKQIAGGIANRSRAKSYNNNVPGGTARQGPRLWSANPITAVKDWNQARKSLGGYAR  558

Query  472  YDRQHSFEHPAFHWVGINQTDENDLFSGIRSI--YLRQGTQAEQVHSILGKLELTLPLAI  529
                   +         +QT  N  +   +++   +++ T +      +G  ++    ++
Sbjct  559  TGDVSVGKVQKAADGSTSQTYSNATYDKDKNLLARMKETTSSSGNKVNIGGKKVPEKQSV  618

Query  530  ESLQLTRNDIGKTLQIGGK------------QLILQRLGSNAVTLRFLGDRTDLLNVHAS  577
            E  +   ++ G                       ++    N V +               
Sbjct  619  ERERTKFDENGNVKGTANVKKAFENGQAVRKTTTMKDKDGNRVDISVDKKTGLATKTK-Y  677

Query  578  NSHAEPLREIGFTWQKSGDAFSLRQMFDGNIES  610
            ++    L          G+        DG +E 
Sbjct  678  DASGNKLSTETRARG--GERTLETYKPDGTVEK  708


>MBE0521809.1 DUF975 family protein [Candidatus Methanoperedenaceae archaeon]
Length=540

 Score = 77.9 bits (188),  Expect = 4e-12, Method: Composition-based stats.
 Identities = 45/277 (16%), Positives = 86/277 (31%), Gaps = 13/277 (5%)

Query  136  IFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL  195
            +   + +   T            IL+     + L +S       I I       FR    
Sbjct  23   LILFISIMILTGFISVVTENSSDILILIGGILSLIISMGIIKTSIKITDNLNVRFRDFFS  82

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
                   + +  I  IL+V  G  LLIIPG++  + F F  Y + D  IG + AL+KS  
Sbjct  83   SSPLFLKYLISSIFYILMVYVGLFLLIIPGIILAIRFQFYAYFIIDKEIGPIDALKKSFS  142

Query  256  LVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
            L  G  W +F               L   I  +G  A +    +  P ++L    +Y  L
Sbjct  143  LTKGALWELF-----------VFDLLLIGINIIGLFALIVGLFVTIPLTYLANAFVYRKL  191

Query  316  KANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGT  375
                +      I      +         +   ++  ++++  +     +   D    +  
Sbjct  192  L--NQTKLDNTIVEITENVEYNPIVIPAVEPEIIPIINKEMENRFNEETFVSDTTNDVIG  249

Query  376  QPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSE  412
                      ++  + +  +     +   K      +
Sbjct  250  IKIAFTYKGATIQYKVKVENPTPEPIADIKVTLYVPD  286


>MBI2166473.1 zinc ribbon domain-containing protein [Chloroflexi bacterium]
Length=262

 Score = 75.2 bits (181),  Expect = 4e-12, Method: Composition-based stats.
 Identities = 40/205 (20%), Positives = 80/205 (39%), Gaps = 3/205 (1%)

Query  130  VLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGL  189
               F  I     +                +L+  V +++LG      ++        + +
Sbjct  54   FWPFLAISFVASIPALLLNVVPQWWLSLVLLVPAVFFLVLGEGAAIHAVGRQRLGHAIDI  113

Query  190  FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW--FFFCQYVLADDNIGGL  247
              + +   +  G   +  ++ +L++ G  L+++   LL   W  F F    +  +  G  
Sbjct  114  QAAYERAWQKAGPLGIATLVFLLLLLGLILIVVGIPLLPFFWVSFAFYVQAIMLEGKGAT  173

Query  248  QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAAN-LAFSLLLTPFSFL  306
             AL +SR LV G WW +FG  V+L +I + LS   A   ++   +N     +L+ P  ++
Sbjct  174  AALGRSRQLVRGGWWRVFGICVVLFLIVVGLSAALAIPGFLVSLSNETLAGVLVNPIIYI  233

Query  307  YYYLIYSDLKANYRGPQHPPIKRQW  331
               L+Y DL+          +  + 
Sbjct  234  GLTLLYLDLRVRKESYGLETLASEM  258


>TMM16922.1 hypothetical protein E6F98_00490 [Actinobacteria bacterium]
Length=239

 Score = 74.8 bits (180),  Expect = 4e-12, Method: Composition-based stats.
 Identities = 24/160 (15%), Positives = 51/160 (32%), Gaps = 2/160 (1%)

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
            +    +          +G+      +  L V  G +LL++PGL+    +     ++  + 
Sbjct  77   RAPAPINVYYDRTRGRLGTLVGASFMYGLGVFFGLILLVVPGLIAIARWSLIVPLVMIEG  136

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP-YVG-EAANLAFSLLLT  301
            +G   A  +S  LV      +    ++  VI   +  L   +P ++G          +  
Sbjct  137  LGWRDAFRRSSELVRDRTGRVLVLVIIANVIKGIVGSLFGVLPGFIGAWIGGTIAGAVAV  196

Query  302  PFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGW  341
            PF      ++Y  L             + W  +       
Sbjct  197  PFEAHVLTVLYYRLTEPDVPILPEEGGKSWQSIWDEEHSQ  236


>NOZ86130.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=309

 Score = 76.0 bits (183),  Expect = 4e-12, Method: Composition-based stats.
 Identities = 34/197 (17%), Positives = 71/197 (36%), Gaps = 19/197 (10%)

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
               +    +    +T +  + +   +V +  + K  ++    +   +++  L    G  L
Sbjct  99   PFYLIVQPIVAGTITFTANLRLNGEEVSISGAYKGFMKIFWGYLSAVVMYSLAWILGGFL  158

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
            L +PG+L  + F   Q     +   G +AL +S  L   HW  +F   +LL +IS  +S+
Sbjct  159  LFVPGMLAAIVFSLTQESAVVEKAYGTRALSRSWELTRKHWGKVFLLGLLLFMISAVISY  218

Query  281  -------------------LTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
                               LTA    +    +    + + PF  + +  +Y DL++  R 
Sbjct  219  GIKSIADFALGVDSNNAEGLTAWAMALSGFVSSLVQIAILPFWHISWLFLYYDLESTTRE  278

Query  322  PQHPPIKRQWLPLTAAI  338
                 +    +      
Sbjct  279  MDVRNLAESLVKALPGR  295


>NIO19370.1 hypothetical protein [Candidatus Aenigmarchaeota archaeon]
Length=294

 Score = 75.6 bits (182),  Expect = 4e-12, Method: Composition-based stats.
 Identities = 39/230 (17%), Positives = 74/230 (32%), Gaps = 17/230 (7%)

Query  75   TVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFA  134
                         QP +           + +  ++SW  F       L +  +  +++ A
Sbjct  28   CGYNYNFTIQTEKQPWQHKFYRSYTDMGVKEFFSNSWRTFGANWSTFLMLAAVPPLISSA  87

Query  135  PIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMK  194
             + +                    + +  +   +L    +  +         +G+  S  
Sbjct  88   LVLAKASEVVIV------------LTIVNLIVWILTSMALIMAAHRTSDGEAIGIGESYT  135

Query  195  LGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSR  254
            L L     +    IL  L+V GG LL IIPG+++ V +    Y +  + IGG  A  +S+
Sbjct  136  LSLGLFWRYIWTGILYFLIVLGGLLLFIIPGIIWSVQYVLAPYAVIVEGIGGRDAFSRSK  195

Query  255  LLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFS  304
             L       IF   +   +         + +  V     L    L  PF+
Sbjct  196  ALTKDRRGNIFFLELGFGLFFFL-----SIVIPVTLLILLVGVALGNPFT  240


>WP_130023871.1 hypothetical protein [Emticicia sp. 17J42-9]RYU92967.1 hypothetical 
protein EWM59_24440 [Emticicia sp. 17J42-9]
Length=533

 Score = 77.5 bits (187),  Expect = 5e-12, Method: Composition-based stats.
 Identities = 31/220 (14%), Positives = 70/220 (32%), Gaps = 13/220 (6%)

Query  367  KDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADR  426
               +     +  Q  ++      +P    +    ++  K   T          + LF + 
Sbjct  20   AQHRNLSKEEKIQAVEMISDFFGKPTPSKTPVPDVISEKGMPTNLNSADFDKKLALFTNS  79

Query  427  FWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWV  486
            F+  + +         +D  +L        +   +K    +   L               
Sbjct  80   FYFKESSFRANEARIEADVFHLPNMDLVKGQFVWNKATAKEGGTLEIE---------PKP  130

Query  487  GINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIG  546
              +     D F+ + +   + G     +++I G    T P     +  T+ DIGK  +I 
Sbjct  131  DPSMMPSMDFFNEVVTYKPKDGKT---LNTIEGTFSFTYPTEFVQITFTKADIGKEKEID  187

Query  547  GKQLILQRLGSNAVTLRFL-GDRTDLLNVHASNSHAEPLR  585
            G+++ +  + ++              L+ +A N+  EPL 
Sbjct  188  GERVKILSIENDIALFEIQRKTEESRLSYYALNAKGEPLA  227


>MBI5106072.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Solirubrobacterales bacterium]
Length=278

 Score = 74.8 bits (180),  Expect = 7e-12, Method: Composition-based stats.
 Identities = 29/167 (17%), Positives = 61/167 (37%), Gaps = 10/167 (6%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
            ++    L    +   +        V    + + GL         ++L  + +  G    +
Sbjct  79   SLVTTPLVTGMLVKVLLDDAAGRPVSAGAAGRTGLDAFAPLLGTVVLAGIGIVLGLAAFV  138

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            +PG+   V +F     +  +      AL +S  LV+GH W + G  +++ +++  LS L 
Sbjct  139  VPGIFLAVRWFVAAPAVVVEGRAPTDALRRSWDLVTGHGWWVLGVVIVVTLLAGVLSLLV  198

Query  283  ARIP----------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
            +              V         ++  PF+ +   L Y  L+A +
Sbjct  199  SLPFGAAADAADSQAVALVGTTLAEVVALPFTAIATTLAYFTLRARH  245


>MBF0618578.1 hypothetical protein [Candidatus Omnitrophica bacterium]
Length=236

 Score = 74.0 bits (178),  Expect = 7e-12, Method: Composition-based stats.
 Identities = 38/184 (21%), Positives = 70/184 (38%), Gaps = 7/184 (4%)

Query  144  PATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSF  203
             +   +P  +     ++ A+    +L       +M   +    V +   +      +   
Sbjct  41   LSVSPDPLMRGVALMLIPASAGVYVLFYLMTLIAMSKILDGEAVDVPEMVAAAQGKLWRA  100

Query  204  TLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA  263
                 L  L+   G  LLIIPG+ F V F F  YV+  ++     + ++S  +V G++W 
Sbjct  101  FGAYALFFLIFCAGFFLLIIPGIYFGVIFSFFFYVILFEDRRVWDSFKRSEEIVKGNFWK  160

Query  264  IFGRFVLLLVISLTLSFLTARIPYVG-------EAANLAFSLLLTPFSFLYYYLIYSDLK  316
            +FG   L+++IS  +        ++          A    S  L PF   +YY +Y  L 
Sbjct  161  VFGAHALVMLISAAVFGPVWGAMFLLGASKAWQAVATTVASAGLMPFFCCFYYQLYRALS  220

Query  317  ANYR  320
                
Sbjct  221  KRAD  224


>MBJ7342504.1 hypothetical protein [Solirubrobacteraceae bacterium]
Length=144

 Score = 71.3 bits (171),  Expect = 7e-12, Method: Composition-based stats.
 Identities = 27/143 (19%), Positives = 62/143 (43%), Gaps = 4/143 (3%)

Query  201  GSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGH  260
                LL I+  +++  G   LIIPG++  + +F    VL  ++ G   ++ +S  L  G+
Sbjct  1    MPLLLLGIVTSILITIGFFFLIIPGIILALMWFVAVPVLVAEDKGVFASMSRSSELTKGN  60

Query  261  WWAIFGRFVLLLVISLTLSFLTARI----PYVGEAANLAFSLLLTPFSFLYYYLIYSDLK  316
             W +    V++ V+   ++F+   +    P +G    +  +++  P+  +   + Y +L 
Sbjct  61   RWRLIWLVVIIYVLLFIIAFIVGLLGAITPILGVIGGVLIAIVAYPYLAIIIAVTYYELL  120

Query  317  ANYRGPQHPPIKRQWLPLTAAIF  339
              +   +  P      P +  + 
Sbjct  121  EAHGESKPGPGVATMAPPSGEMP  143


>HIF31776.1 hypothetical protein [Planctomycetaceae bacterium]
Length=409

 Score = 76.3 bits (184),  Expect = 8e-12, Method: Composition-based stats.
 Identities = 58/381 (15%), Positives = 109/381 (29%), Gaps = 54/381 (14%)

Query  3    TVRCPHCGAERNTPSS-------------------KLPAKKSSARCPECCQTLIFDPAES  43
             VRCP+C  +   P                     KL   ++ +  PE  +       ES
Sbjct  25   LVRCPNCQTQLRIPDQTGGTAEDPESTNLKKNTNPKLDPDRAPSFEPETLEGPPVPGPES  84

Query  44   QRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCR------------RCNRSFCLQPER  91
            +    T    T P  GL    PS   EI     +                      +  R
Sbjct  85   ESDINTPRGQTLPPNGLDVAAPSTETEIGFGMEDVELELTLQPLESAGSTQEPVRQKITR  144

Query  92   EFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQ  151
                    L     +LA +W++F +    +L    L   +          L   T     
Sbjct  145  AGSYRRVKLEYADDVLAQAWKIFSKNARPILTAAFLFWFIPVLVAQPLGSLIALTDHVSI  204

Query  152  NQNWQWAILLATVA-----YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLL  206
                     +            L    M   +      +   L +S +     +      
Sbjct  205  GSLNFSYRRIVLSLAIPAVVSFLAQPVMIYIVAGAYVGSVPTLAQSWEFIKPRIPKLVGT  264

Query  207  LILLILVVGGGSLL----------------LIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
            + L+  V+   + L                L IP +   +        +  + I G+ AL
Sbjct  265  IALVFAVMACWTALESCVLLVVGGKLGLYVLFIPHMFVTLSLVMVIPAVVCEGISGIPAL  324

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTLSF--LTARIPYVGEAANLAFSLLLTPFSFLYY  308
            ++S  L+  HW  ++  ++LL ++S+ ++          +   A++A    +  F     
Sbjct  325  KRSFTLMRAHWMMLWAIYILLGIVSIVIASFTAVMLPSLLAPIASVAIQAGMAAFITTVL  384

Query  309  YLIYSDLKANYRGPQHPPIKR  329
              +Y   +A         I++
Sbjct  385  VSLYFSSRATMEEFSVASIRQ  405


>WP_166233540.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Propioniciclava sp. HDW11]QIK72466.1 hypothetical 
protein G7070_09565 [Propioniciclava sp. HDW11]
Length=357

 Score = 75.2 bits (181),  Expect = 1e-11, Method: Composition-based stats.
 Identities = 31/238 (13%), Positives = 64/238 (27%), Gaps = 17/238 (7%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
             ++   ++ L++  +   V   +   ++A++ +    AL++S  L  G WW  FG  +++
Sbjct  120  FLIVVATIALMVAAVWVSVRITYALVIMAEEGLKAWDALKRSFALTKGAWWRTFGYQLVI  179

Query  272  LVISLTLSFL-----------------TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
             ++   ++ +                    + ++G       S+ L P S+++  L+Y  
Sbjct  180  GLVVGIITSIPQGMITGAAQSAAQGQGAGAMGFIGGLLAFVLSIALIPVSYIWIALMYLG  239

Query  315  LKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLG  374
                              P  A         G            A           +  G
Sbjct  240  RTREVANAGAGYPVGPSFPGGAPWNDPGQPFGQTPGQYDDGARYAADPQQPYPGAPEAPG  299

Query  375  TQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQ  432
                            P +    D       Q     E     G      D F     
Sbjct  300  QYGAPGQYDASGQYGAPDQPGQQDPFGRPDAQPGQGGEQKPGEGGSPTDDDFFGRPKN  357


>NCO75413.1 hypothetical proteinNCO78273.1 hypothetical proteinNCQ02910.1 
hypothetical proteinNCQ41186.1 hypothetical proteinNCS83499.1 
hypothetical protein
Length=223

 Score = 72.9 bits (175),  Expect = 1e-11, Method: Composition-based stats.
 Identities = 35/190 (18%), Positives = 76/190 (40%), Gaps = 8/190 (4%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
                   + L    +   +  +    +L+ +V   L   +         + + ++ + R+
Sbjct  25   VYFPLLIINLPDLVFSLIEGFSEVSYLLVISVILSLWYTATGYIYYHKVLNEQNITIGRA  84

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
             K  L  +G+  + +I++ L+V  G LL IIPG+      +   YV+  +N   +Q +++
Sbjct  85   FKASLSKIGNVLIAIIIISLIVFIGFLLFIIPGIYIANRLYLTLYVIVIENCSVIQGIKR  144

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY--------VGEAANLAFSLLLTPFS  304
            S  L  GH   IF     ++ I + + ++   I +        +          L +P  
Sbjct  145  SWELTKGHSRFIFLTLGQVIGIPILIVWIVFLIIFRDVETVENITNLLGSVIRYLWSPIL  204

Query  305  FLYYYLIYSD  314
            F+   L+Y+ 
Sbjct  205  FVSITLVYNR  214


>EQB62434.1 hypothetical protein RBG1_1C00001G0013 [candidate division Zixibacteria 
bacterium RBG-1]
Length=225

 Score = 72.5 bits (174),  Expect = 2e-11, Method: Composition-based stats.
 Identities = 40/202 (20%), Positives = 79/202 (39%), Gaps = 5/202 (2%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            I    ++            +   +   +     +   +  +    +  + +T  +  Y  
Sbjct  10   ILSFYLLPKIPLSDPETSQELPRFWILKQLPSMFIYSILGMILTFIKGALVTWMVSCYCL  69

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVV--GGGSLLLIIPGLLFCVWFFFCQYVLAD  241
              D  + +S +  LR  GS     IL IL V     ++  I   + F V + F       
Sbjct  70   GKDAEIGQSFQFALRRFGSLAGAAILYILGVTFLFITIFGIPFSIYFLVSWGFAVQTCTL  129

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
            +++G  ++L++S  LV  HWW I G  ++ L+IS  +      +P+VG+      ++L  
Sbjct  130  EHLGPKKSLKRSSWLVEDHWWRIVGIMLVFLLISAPIYIGFHFVPFVGK---YIGAVLAG  186

Query  302  PFSFLYYYLIYSDLKANYRGPQ  323
            P   +   L+Y +L+    G  
Sbjct  187  PLFVIAGILLYFNLRVRKEGYN  208


>WP_156892035.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Gulosibacter molinativorax]
Length=595

 Score = 76.0 bits (183),  Expect = 2e-11, Method: Composition-based stats.
 Identities = 38/368 (10%), Positives = 93/368 (25%), Gaps = 39/368 (11%)

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
                L ++     + +  + +                   +  +I++V   S+ + +  +
Sbjct  174  FWPVLGYLGLYFAVILVFSAIFALLIFWAISLGTNENFGGMAAVIILVILLSVGVTVLAI  233

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
                   F    +  +  G ++A+ +S  L +G++W  FG  VL+ +I   +S + + + 
Sbjct  234  WVFTKLTFVMPAIVLETKGPIEAIRRSWTLSNGYFWRTFGIIVLMYLIVQVMSQVISGVF  293

Query  287  ---------------------------------YVGEAANLAFSLLLTPFSFLYYYLIYS  313
                                             ++    ++  + +          ++Y+
Sbjct  294  SLFTTFAPTFFIPTGDVATGQEAGFIALMLVFLFITILLSVLVAAIGQVLLNGSAVIMYT  353

Query  314  DLKANYRGPQHP------PIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGK  367
            DL+    G                 P         L P  +  + S    +      A  
Sbjct  354  DLRMRKEGLNIHLQNAVEEYSAGREPEQDPWLAPDLGPTTVPGTFSAGAGAGAGAYGAAG  413

Query  368  DIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRF  427
                  G           +   +     +   +    +                      
Sbjct  414  YGAAGYGAGGYGAGGYGAAGYPQQGYPQTVYGQAGYQQGSPQPGYAQPGHQQPGYQQADP  473

Query  428  WADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVG  487
              D   P          +P    +Q+G A  +       D R  +  +      +  W  
Sbjct  474  QQDAPQPGYAQPGYQQGYPQQDASQQGYAAADSYPQPQSDERPSHLPEQQAPWVSQPWPE  533

Query  488  INQTDEND  495
             + T   D
Sbjct  534  ESSTQRVD  541


>OGZ78234.1 hypothetical protein A2358_04180 [Candidatus Staskawiczbacteria 
bacterium RIFOXYB1_FULL_37_44]OGZ82998.1 hypothetical protein 
A2416_01890 [Candidatus Staskawiczbacteria bacterium RIFOXYC1_FULL_37_52]
Length=225

 Score = 72.1 bits (173),  Expect = 2e-11, Method: Composition-based stats.
 Identities = 42/148 (28%), Positives = 73/148 (49%), Gaps = 4/148 (3%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
            A +   +    W   ++F  I + ++    S+ +    +  F  + +L+ L V GG +L 
Sbjct  74   AGLFAAIFFGLWARVALFFSIKEQELDFKNSLSVSWPKMWQFFWVSLLVGLAVLGGFILF  133

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            +IPG++F VWF    +V   + + G  AL++SR LV G+WW +FGR  LL ++ + +S  
Sbjct  134  VIPGIIFSVWFCLAMFVFVAEGLKGTSALKRSRQLVQGYWWPVFGRLALLGILIMLISS-  192

Query  282  TARIPYVGEAANLAFSLLLTPFSFLYYY  309
               I + G   N+ F+         Y Y
Sbjct  193  ---IKFFGPIINIFFTAPFAVAFEYYLY  217


>WP_161390382.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Altererythrobacter xixiisoli]
Length=204

 Score = 71.7 bits (172),  Expect = 2e-11, Method: Composition-based stats.
 Identities = 29/146 (20%), Positives = 51/146 (35%), Gaps = 3/146 (2%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                  L  L       +I +             G R    F  L I++ + V  G +LL
Sbjct  36   LGGLVQLGVLIAAVIVGYIVLETMLKQAGLMAYSGPRRFLPFLGLSIVVGVGVILGMILL  95

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            I+PG+   V +   Q  L     G   A++ S  L  G+   IF   + L ++ + ++ +
Sbjct  96   IVPGVYLAVRWSLAQPRLVGRGDGVFDAIKASWKLTDGYAMRIFLCILALGLVVVVIAII  155

Query  282  TARI---PYVGEAANLAFSLLLTPFS  304
               +     +G       +  LT   
Sbjct  156  AGLLGPTNIIGILIGQLANYALTMVM  181


>OGK18868.1 hypothetical protein A2799_02280 [Candidatus Roizmanbacteria 
bacterium RIFCSPHIGHO2_01_FULL_39_24]OGK26257.1 hypothetical 
protein A3D80_00915 [Candidatus Roizmanbacteria bacterium RIFCSPHIGHO2_02_FULL_40_13b]OGK48892.1 
hypothetical protein A3A56_01675 
[Candidatus Roizmanbacteria bacterium RIFCSPLOWO2_01_FULL_40_32]OGK57559.1 
hypothetical protein A3H83_01910 [Candidatus 
Roizmanbacteria bacterium RIFCSPLOWO2_02_FULL_39_8]
Length=417

 Score = 74.8 bits (180),  Expect = 2e-11, Method: Composition-based stats.
 Identities = 38/354 (11%), Positives = 114/354 (32%), Gaps = 6/354 (2%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +  +  +          +      ++        +    +L  +  +++G +  + ++ +
Sbjct  45   ICVLLFVISGAGATLFKTGASGLLSSITPSVFFGFGAVFVLFFLVMLVIGSAVSSAAILL  104

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
            +  +  + +  + + G+  +   T + ++ + +  GG  + + PG++      F  + + 
Sbjct  105  FDSEGKLSVLEAFRKGITLIIPLTAVGLIHLFLGMGGFFVFVFPGIIISYLLAFSSFEVI  164

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA----NLAF  296
             +    ++A+++S  +VS ++  IF R+ +L+++   ++F+      +G  +        
Sbjct  165  LNGKRPMEAIKRSVSIVSHNFGDIFVRWFVLILVYFGIAFVL--PGLLGAISKELKIYVT  222

Query  297  SLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQN  356
             L      FL +Y++  ++    +       K         I   +     LLV  +   
Sbjct  223  GLSFIVNMFLGWYILAFNVTLYKQFKDLAHSKDVKSITWMWIVAIIGWIIALLVFFAGWK  282

Query  357  LSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLS  416
            L              +  + P   P  + +    P   S        ++      E    
Sbjct  283  LITSPAFLDAFTKNVKTNSAPYAFPTTSHTQVPLPTAGSYQPASSNCTQYPIREGEFTSD  342

Query  417  LGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARD  470
                           Q       +      ++++   GS   +     D   + 
Sbjct  343  KCYSAKDYSDLLYYLQRYDSAASVYNGAIASMNITCNGSEFFKDACARDTTDKT  396


>MBI4415174.1 hypothetical protein [Candidatus Kerfeldbacteria bacterium]
Length=338

 Score = 74.0 bits (178),  Expect = 2e-11, Method: Composition-based stats.
 Identities = 47/196 (24%), Positives = 83/196 (42%), Gaps = 7/196 (4%)

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
               +L +     S+         G    ++     V +     IL  LV  GG+ L +IP
Sbjct  136  LSGILTIMSNAASVLALAGNGQEGASALLRRAYGMVWAIVGAGILAGLVTFGGTALFVIP  195

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL-SFLTA  283
            G++  V   F   V+  +N  GL AL +S  LV+  W+ +FGR +LL ++   + +  T+
Sbjct  196  GIIVGVGLLFTTMVVVLENRRGLAALRRSMALVNPRWFGVFGRDLLLALVVWIVTAIATS  255

Query  284  RIPYVG------EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAA  337
             + ++            A + L+TPFSF Y+YL++   ++          K+        
Sbjct  256  ILSFILRGTVANIIVAFAVTTLITPFSFAYFYLLFQGARSVAGETSGAEQKQNTWIKVLL  315

Query  338  IFGWMLIPGLLLVSLS  353
            I G + I  L   ++ 
Sbjct  316  IVGVITIIILPTYAIV  331


>NHV97722.1 hypothetical protein [Thaumarchaeota archaeon]
Length=222

 Score = 72.1 bits (173),  Expect = 3e-11, Method: Composition-based stats.
 Identities = 43/207 (21%), Positives = 79/207 (38%), Gaps = 1/207 (0%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            + +  R    L+   +  ++       S L L              + +LLA      + 
Sbjct  16   FNILLRNPKILIPQLISSLIFTIMSGLSILQLLNGLGGWFLLVPLTFLLLLAGFIVYTVV  75

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
                   +   +      L  +    L+ + S     IL+ ++V  G +L I+PGL+F  
Sbjct  76   SGMYPLMVKDVVNGVSPNLTLAANTALKKLISLIAASILVSIIVAIGLVLFIVPGLIFLT  135

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
            WFF+    L  ++ G L+A+  SR              +LL + +L   F  + IP+VG 
Sbjct  136  WFFYTIPALMLEDKGALEAMTASRYFGRDKKLNTLAVIILLGLAALVGGF-FSLIPFVGP  194

Query  291  AANLAFSLLLTPFSFLYYYLIYSDLKA  317
              +   SL++T ++ +    IY     
Sbjct  195  IISFIISLVVTAWASITQSYIYIKYTK  221


>PTL75252.1 hypothetical protein DAT35_55950 [Vitiosangium sp. GDMCC 1.1324]
Length=382

 Score = 74.4 bits (179),  Expect = 3e-11, Method: Composition-based stats.
 Identities = 38/251 (15%), Positives = 79/251 (31%), Gaps = 9/251 (4%)

Query  78   CRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIF  137
            C  C R+       +    G   R    +     + F               ++      
Sbjct  112  CCCCARAGGSPLLMQGCVMGLAARRCEMVTEILRDSFSLYRRFAGRFIATAAIVFAGLDL  171

Query  138  SALLLKPATWLNPQNQNWQWAILLATVAYI--LLGLSWMTGSMFIYICKT-DVGLFRSMK  194
             + L    +          W I+      +        +  ++        D+ + +  +
Sbjct  172  FSALSNVESRRGHTAGAVFWGIIALVAWVVGSFWIQGAIVEAVEDVRDGRADMTIGQLYE  231

Query  195  LGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSR  254
                 + +     +L  L +  G LL IIPGL     +     V+  +     +A  +S 
Sbjct  232  RVRPRLATLIGAGLLAGLGIAFGLLLFIIPGLYLLTRWIVLAPVIIIEKRSISEAFSRSN  291

Query  255  LLVSGHWWAIFGRFVLLLVISL----TLSFLTARIPYVG--EAANLAFSLLLTPFSFLYY  308
             LV G  W +F   ++  ++++     +  + + +P        NL    L+ PF  + +
Sbjct  292  NLVRGDGWEVFALVLITGILTVLAQAIIGAVFSWLPLFLDVWIGNLIAHSLVMPFVLIVW  351

Query  309  YLIYSDLKANY  319
             L+Y  L A  
Sbjct  352  TLLYHQLVARR  362


>KXA89225.1 hypothetical protein AKJ57_05660 [candidate division MSBL1 archaeon 
SCGC-AAA259A05]
Length=135

 Score = 69.4 bits (166),  Expect = 3e-11, Method: Composition-based stats.
 Identities = 27/114 (24%), Positives = 54/114 (47%), Gaps = 1/114 (1%)

Query  200  VGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSG  259
              +F LL IL+  +   G L+ ++P +   V           +++  ++ L++S     G
Sbjct  16   FLAFILLAILVSPIGIIGGLIWLLPMIYIGVRLSLYAQACVIEDLRPVECLKRSWRTTKG  75

Query  260  HWWAIFGRFVLLLVISLTLSFLTARIPYVG-EAANLAFSLLLTPFSFLYYYLIY  312
            ++W IF   ++L +I   +S     IP +G    +L  +L + P + + + LIY
Sbjct  76   NFWRIFAIGLILAMIGGIISAAVNLIPVIGSTLGSLITTLFIAPATAIAFTLIY  129


>NQW20074.1 hypothetical protein [Chloroflexi bacterium]
Length=196

 Score = 71.0 bits (170),  Expect = 3e-11, Method: Composition-based stats.
 Identities = 30/166 (18%), Positives = 64/166 (39%), Gaps = 13/166 (8%)

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGG-----GSLLLI  222
            ++ +     ++       +V    S++ GL  +    +  IL +L +         L+ I
Sbjct  1    MIAVGATIDAVARQYGGRNVDALVSLRRGLSKLWILIVSSILAMLAISLSGVLILILIGI  60

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
               +   + + F   ++  D  G + AL  S  LV G  W + G  ++  +IS+ +  L 
Sbjct  61   PLIIFLMISWAFIFPLILIDGAGPVSALTGSYDLVKGSRWRVLGIAIVFFLISIVIQILI  120

Query  283  ARIPYV--------GEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
              +  +            +     LL P + +   ++Y DL+A  +
Sbjct  121  RIVTGIVDGFSEPLAVIMSSIGMALLAPVAAIGLTVVYFDLRARKQ  166


>OGY30342.1 hypothetical protein A3F35_00750 [Candidatus Woykebacteria bacterium 
RIFCSPHIGHO2_12_FULL_45_10]
Length=256

 Score = 72.5 bits (174),  Expect = 3e-11, Method: Composition-based stats.
 Identities = 37/174 (21%), Positives = 70/174 (40%), Gaps = 7/174 (4%)

Query  146  TWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTL  205
                 Q  +    +   +              +        V     +   L+   S+ +
Sbjct  60   FINFTQLFSGVIVLFFVSFILGTAFAIMQILLVIKTDKGEAVNFSDLLSASLKLFFSYLI  119

Query  206  LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
            L   + + V  G +LL++PG++F  WFF   ++LA +   G++A++ S+ LV GH   + 
Sbjct  120  LGAFVGITVLVGIVLLVVPGIVFATWFFASFFILAQEGKRGMEAMKASKALVQGHTMDVL  179

Query  266  GRFVLLLVISLTLSFLTARIP-------YVGEAANLAFSLLLTPFSFLYYYLIY  312
             RF L +++   +S +   I         VG   +   S +L   S  + Y+IY
Sbjct  180  TRFFLGILVYALVSIVVGLILLPLEAIYLVGSVVSALISAVLGAVSLSFSYVIY  233


>MXX63576.1 hypothetical protein [Acidimicrobiia bacterium]MYD03594.1 hypothetical 
protein [Acidimicrobiia bacterium]
Length=239

 Score = 71.7 bits (172),  Expect = 4e-11, Method: Composition-based stats.
 Identities = 26/196 (13%), Positives = 58/196 (30%), Gaps = 6/196 (3%)

Query  134  APIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSM  193
                    L  A      +      ++L       +                     R++
Sbjct  26   FGSMVMASLIVAVPYALSSIPGLDLLVLLAALLTPIAFGATIFLAAGAYVGVAPDWQRAL  85

Query  194  KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKS  253
                +      +  ++  + +  G   LI PG+   + F      +  +  GG  +L +S
Sbjct  86   SYAWKEYLPLLIASVVAAIAIMAGLPFLIFPGIFLAISFALALESVMLEGRGGFSSLGRS  145

Query  254  RLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY------VGEAANLAFSLLLTPFSFLY  307
              LV+ H   IF   ++  ++      ++  I +      V +      +L+  P +  +
Sbjct  146  WRLVNEHRGRIFWAGLIFFLVIGLAIGISYGIVWGLSNQTVADIWAGITNLVTIPLTSAF  205

Query  308  YYLIYSDLKANYRGPQ  323
                Y DL+       
Sbjct  206  GVAFYFDLRVRKENLD  221


>MAF79665.1 hypothetical protein [bacterium]
Length=458

 Score = 74.4 bits (179),  Expect = 4e-11, Method: Composition-based stats.
 Identities = 65/372 (17%), Positives = 126/372 (34%), Gaps = 15/372 (4%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            LL + L G V              + +    +    ++++   +  IL  +      ++ 
Sbjct  45   LLFLILGGGVYGVIYWEQQTSPFVSGFSFLFSPGILFSLMFFGMLVILFHVWANAALLYS  104

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                       + +     +GS+  + +L  L V GG LL +IPG++  V   F  ++L 
Sbjct  105  ITSGAKDNFLVAYRESFSKIGSYIWIGVLSTLAVLGGLLLFVIPGIIISVAITFAAFILF  164

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP--YVGEAANLAF--  296
             +   G  AL KSR  V G+WW + GR +   +I      L   +     G  A+     
Sbjct  165  VEGDKGSAALFKSREYVRGNWWGVLGRVLFAFLIVSLFLNLLQALIRYVFGSMASEIVVD  224

Query  297  ---SLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLS  353
               S ++ PFSF++ + IY  L+          +  + L  +A +   +++ G ++++ +
Sbjct  225  AVSSFVVIPFSFVFLFEIYKSLRNGKPWVAEGILSGRLLAFSAILGFLLILAGPVVLTTT  284

Query  354  RQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEG  413
                +     S+       L  Q           P                 + +   + 
Sbjct  285  GIFRTVFSFYSSNISDFSGLNFQSFFAGGAFTESPVVSGFS------GEDLGRLQRIRDA  338

Query  414  GLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYD  473
                    L         Q+P+       + + + S +  GSA + ID  L     +   
Sbjct  339  RRISDLSRLGIVLSVISRQDPNFCNAPRGAVYRSTSPSGGGSAWLPID--LSLSGLNDLA  396

Query  474  RQHSFEHPAFHW  485
              H    P    
Sbjct  397  VTHPPRDPMNTS  408


>OQB13217.1 hypothetical protein BWY16_00266 [Candidatus Omnitrophica bacterium 
ADurb.Bin205]
Length=210

 Score = 71.0 bits (170),  Expect = 5e-11, Method: Composition-based stats.
 Identities = 29/159 (18%), Positives = 54/159 (34%), Gaps = 1/159 (1%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                 I   L  +                +     L       L  I+  L    G +LL
Sbjct  50   IVYLLISPYLFGLIMRFVFESIDKKPSWNKLNSFVLNKYPLILLAHIIYYLACFVGMMLL  109

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            +IPG++  +    C   +  DN   + +L +S  +  G WW +F   +   +  +  +F 
Sbjct  110  VIPGVILSIRLLLCDGGILFDNDSAIVSLRRSWRITKGSWWRLFVLVLGCSLPVILFAFF  169

Query  282  TARIP-YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
             + +P  V     L  S++   +    + L Y  L+   
Sbjct  170  ESLLPKTVYSFVYLLLSIITCVWYQCVFTLAYLHLRERE  208


>RTK93754.1 hypothetical protein EKI60_05420 [Candidatus Saccharibacteria 
bacterium]
Length=228

 Score = 71.0 bits (170),  Expect = 6e-11, Method: Composition-based stats.
 Identities = 29/141 (21%), Positives = 57/141 (40%), Gaps = 0/141 (0%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                  L+  +   G +          L    +   +    +  L IL  LV+  G +LL
Sbjct  76   ILGIISLVLQTMALGLVLKAAQDKTANLNELTETARKFTLKYLGLSILSGLVIVLGFVLL  135

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            I+PG +     F   YV+  +N+G ++++++S  L  G+  AI+G   + +++ +    L
Sbjct  136  IVPGFIAITRLFLAPYVMVSENLGVVESMKRSSELSKGNAGAIWGVIGVTILLGIIGQIL  195

Query  282  TARIPYVGEAANLAFSLLLTP  302
                P VG        +  + 
Sbjct  196  GFISPVVGAVVASLLGIGYSV  216


>MBM98530.1 hypothetical protein [Planctomycetaceae bacterium]
Length=366

 Score = 72.9 bits (175),  Expect = 6e-11, Method: Composition-based stats.
 Identities = 36/348 (10%), Positives = 85/348 (24%), Gaps = 40/348 (11%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
                CP C      P     ++  +A CP+C + +I     +    +    A        
Sbjct  3    IEFDCPQCQNRLRVPD---GSEGQTALCPKCSEQMIVPQQNAGTASSNPPSAGDGGNSNP  59

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
               P       +   +            +    +S   +  +     + +         L
Sbjct  60   FSNPPADSSESNPYASPTSTTEIPTEGSQASLASSTISVNQVLTETWNGFTENLGAFLLL  119

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              I     +         +    A          +    +  +  +    +         
Sbjct  120  GVILFGMYMATSIVTIPIIFAAAAAGDPRFMVIAEILGGILNLFLVAFIYAVGLRYTLDV  179

Query  182  ICKTDVGLFRSM--KLGLRHVGSFTLLLILLIL-----------------------VVGG  216
            +  +     R+      L  +    +++ LL                         VV  
Sbjct  180  LSGSRSPFERAFKVFPFLLRIVFTNIVVGLLAFAGMLAITLPILPLIFFLQRQEGAVVIV  239

Query  217  GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
              L  +   +            + D   G   ++  S     G+ W IF   +++ +  +
Sbjct  240  AFLAGLTGYIFIFTRLCLPLLFIIDRGQGVFDSISSSFTYTKGNVWTIFFSILIIGLAGI  299

Query  277  TLS------------FLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
             L             F+      +   A +   ++  PF  + + +IY
Sbjct  300  ALFILLPLPGILLSNFIVVAGIALTFVACVLGMIVFLPFITIAFSIIY  347


>WP_169451565.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Marinobacterium sp. LSUCC0821]QJD71594.1 
hypothetical protein HH196_07725 [Marinobacterium sp. LSUCC0821]
Length=195

 Score = 70.2 bits (168),  Expect = 6e-11, Method: Composition-based stats.
 Identities = 31/185 (17%), Positives = 65/185 (35%), Gaps = 0/185 (0%)

Query  136  IFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL  195
            +F               Q     +    + ++ + L+     M   I    +   +S  +
Sbjct  1    MFLQTSSTIGMADGQDMQKQLMMLTFLDLLFVPIYLAATLFYMQAVIDGRTLTPMQSWLM  60

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
            G+R          L  +    G +L+IIPG+   V       +   +N G + A+++S  
Sbjct  61   GVRCWFRLFATFFLSAIATALGLMLMIIPGIYVGVRLALANAICVLENKGPMDAMKESWS  120

Query  256  LVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
               G +W +F    L+      +  + + +   G   ++  S++L  F+ L     Y   
Sbjct  121  ATDGMFWTLFKGLALIYGGLFVVEVILSPLVEPGSLISMLISVVLDFFNVLGVIFSYRVY  180

Query  316  KANYR  320
            +A   
Sbjct  181  RAWRD  185


>HGV67288.1 hypothetical protein [Candidatus Moranbacteria bacterium]
Length=265

 Score = 71.7 bits (172),  Expect = 7e-11, Method: Composition-based stats.
 Identities = 36/194 (19%), Positives = 85/194 (44%), Gaps = 1/194 (1%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L  +   G+      I  ++    +             +++      +  LS  +  + +
Sbjct  58   LFLMIFGGVTAVNGLISGSIWSGLSISETAAITIITLFLVVFIALIYVSLLSQCSLFLIV  117

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
               KT++ +    K   +++  +  + +L+ L V  G +L IIPG++F + + F  +V  
Sbjct  118  KDRKTNLKVLDIFKEAKQYLSGYFYVSLLVGLRVLVGLILFIIPGIIFAIRYSFSSWVYI  177

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP-YVGEAANLAFSLL  299
            D+ + G  AL++S+ LV+G  W + G   + L+I+  + ++    P  +    +   SL 
Sbjct  178  DEGLKGGVALKRSKELVAGFGWQVLGIMGIFLLINFVIEWVLKIFPKQISSILSFPISLF  237

Query  300  LTPFSFLYYYLIYS  313
               ++ + ++ +Y 
Sbjct  238  FGIYAIIVFFNLYH  251


>PIS40798.1 hypothetical protein COT26_01430 [Candidatus Kerfeldbacteria 
bacterium CG08_land_8_20_14_0_20_43_14]
Length=241

 Score = 71.0 bits (170),  Expect = 7e-11, Method: Composition-based stats.
 Identities = 38/228 (17%), Positives = 94/228 (41%), Gaps = 9/228 (4%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATV-A  165
               +WE+F +    +  + ++  +              ++            I+      
Sbjct  15   FRHAWEIFSKNFQSIALLTIIINLPLNLISALFASRAASSASVAGVSAGALGIMAILSAL  74

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
              ++    +   +   +    V    +++   ++  +  +   L+ +++   ++LLI+P 
Sbjct  75   LGVIIPLGIVFIIQSGLNGQRVDYQAALRTAFQNWRAGVVTSFLMAILLILLAILLIVPA  134

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            ++F V++ F  Y + D+ + G+ AL++S+ +V G WW + G  +    ++   S++  RI
Sbjct  135  VIFGVFWAFAIYAVVDEKLSGMAALKQSKGVVQGRWWKVLGNLIAFGFVAGIASWVVNRI  194

Query  286  P-------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPP  326
                    ++   A+   S+  + FS +  YL+Y +LKA       P 
Sbjct  195  FGLLPSNVFISAIASTVASIATS-FSLVGGYLLYQNLKAVKPTTTPPA  241


>NDV08681.1 PQQ-binding-like beta-propeller repeat protein [Rhodococcus sp. 
IEGM 248]
Length=1427

 Score = 74.4 bits (179),  Expect = 7e-11, Method: Composition-based stats.
 Identities = 37/327 (11%), Positives = 82/327 (25%), Gaps = 7/327 (2%)

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
             L  +         +      +F  + L L  V S  +      +     S+ L    L+
Sbjct  349  ALAATRSRMFALCRLSLIFSAIFYLILLILFPVASIIITDRFGWIAFLASSIALSGATLV  408

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
              V F      L  +  G   +L++S +LV   W  +        + +  ++ +      
Sbjct  409  ATVLFSLAPVPLMLEGTGVFASLKRSAILVRPAWLRLTAVHAAWGLGT--ITLILMVNSI  466

Query  288  VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGL  347
            +    +L    +  P    +  L+Y+DL+                   A+          
Sbjct  467  LSGLGSLIGLAVAFPIFRTFQTLLYADLRTRAG---GGAHPGGLHLDDASASTTSSTALR  523

Query  348  LLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQR  407
               + S  +       S    +      +  + P    + P  P  + ++  + +     
Sbjct  524  AQSTPSEDSSVRTDDRSTPSALPLPTRERTDEPPQNTATAPPTPVDVPTSAPEAIPISPI  583

Query  408  KTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDD  467
               +  G +               ++       +    P L  A   S  +  D  +   
Sbjct  584  AAFTPPGQTHPAPPSAESTPADRPEHTGRAQ--DTPAQPALDPAVFVSPGVTEDAFVTPS  641

Query  468  ARDLYDRQHSFEHPAFHWVGINQTDEN  494
            A                          
Sbjct  642  AAHDTPPAPIPSLSTNTNPMTVPAPPA  668


>ERH20767.1 hypothetical protein HMPREF1978_00028 [Actinomyces graevenitzii 
F0530]
Length=210

 Score = 70.2 bits (168),  Expect = 8e-11, Method: Composition-based stats.
 Identities = 31/191 (16%), Positives = 60/191 (31%), Gaps = 34/191 (18%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
            + + T     L +       F  +           +           L IL+++    GS
Sbjct  15   LNIITSIISGLMMIIGIAVFFALLAGVASTAKTDREFLQDLSIVLVGLFILMVISTLVGS  74

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
             L         + F      +  +N+G   A+ +S  L  G++W +FG  +L  +I+  +
Sbjct  75   YL--------SIKFSVASPAMVLENLGVFAAIGRSWSLTRGNFWRLFGINILTAIITSIV  126

Query  279  SFLTARIP--------------------------YVGEAANLAFSLLLTPFSFLYYYLIY  312
            + +   I                            +    +    LL+ PF+     L+Y
Sbjct  127  AGIFGGIAGALGAIFVVVGSSSPEDVIASLNTTYILTMVMSTIAQLLILPFTSSVNALLY  186

Query  313  SDLKANYRGPQ  323
             DL+    G  
Sbjct  187  IDLRMRKEGLD  197


>CAB1321354.1 unnamed protein product [Coregonus sp. 'balchen']
Length=1821

 Score = 74.4 bits (179),  Expect = 8e-11, Method: Composition-based stats.
 Identities = 15/249 (6%), Positives = 39/249 (16%), Gaps = 2/249 (1%)

Query  23    KKSSARCPECCQTLIFDPAESQRTQ--TTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRR  80
             + S A C    + +    A           + ATC                  + +    
Sbjct  1146  EPSHATCCWWWEDMEPSHATCCWWWEDMEPSHATCCWWWEDMEPSHATCCWWWEDMEPSH  1205

Query  81    CNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSAL  140
                 +  +      A+        +    +   +                       +  
Sbjct  1206  ATCCWWWEDMEPSHATCCWWWEDMEPSHATCCWWWEDMEPSHATCCWWWEDMEPSHATCC  1265

Query  141   LLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHV  200
                     +     W W  +  + A  LL +          +   +              
Sbjct  1266  WWWEDMEPSHATCCWWWEDMEPSHATCLLVVGRHGAQPRNVLLVVEDMEPSHATCCWWWE  1325

Query  201   GSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGH  260
                                          +           +++    A         G 
Sbjct  1326  DMEPSHATCCWWWEDMEPSHATCCWWWEDMEPSHATCCWWWEDMEPSHATCCWWWERHGA  1385

Query  261   WWAIFGRFV  269
                     +
Sbjct  1386  QPRNVLLVM  1394


>NLI34789.1 DUF975 family protein [Deltaproteobacteria bacterium]
Length=325

 Score = 72.1 bits (173),  Expect = 8e-11, Method: Composition-based stats.
 Identities = 42/272 (15%), Positives = 81/272 (30%), Gaps = 18/272 (7%)

Query  6    CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIP  65
            CP CG E     S++PA  + ARC  C +    DP +     T     T    G +    
Sbjct  1    CPQCGREYRIEDSRIPAGGTVARCNGCGRRFKVDPPQPGEGFTCPKCGTRQPPGDECLRC  60

Query  66   SDRLEI------------------QSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLL  107
                +                     K  +    +    +      +   +G        
Sbjct  61   GVIFKKLSSWQGQGESSASPSLDSPEKMSSESSPSSPILVSFGERAQGHEAGGFMHDSRF  120

Query  108  ADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI  167
            +    +          +      L    I  AL    A     +       +       +
Sbjct  121  SLGSAIRFGWDRTRENLGFFIGFLILGLILIALPGVLADVAEERAPAVTVILFRIVAVVL  180

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
               ++     + + +         ++   L     +    +L  L+V  G LLL+IPG++
Sbjct  181  ESTVTMGFIKVALMVHDGVKPEISNLLDCLPLFFRYFFASVLYGLIVVLGILLLVIPGVI  240

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSG  259
            + + + F  Y + D+  G   A++ S  +  G
Sbjct  241  WGIKYMFYGYFIVDEGRGPWDAIKASGAITRG  272


>HBF67379.1 hypothetical protein [Candidatus Magasanikbacteria bacterium]
Length=365

 Score = 72.5 bits (174),  Expect = 9e-11, Method: Composition-based stats.
 Identities = 46/357 (13%), Positives = 96/357 (27%), Gaps = 33/357 (9%)

Query  96   SGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLN-----P  150
              +   S  Q L  +WE +      L GI  L +V+A  P   ++               
Sbjct  1    MRTAKFSYFQSLRFAWETWKHNVLFLAGIIFLLMVVASLPSLVSIAGGFLFEDPSSNAAY  60

Query  151  QNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL  210
                    IL+     + L +      + +  C         +         +   ++L 
Sbjct  61   VVFQIFVFILIIVSYILQLIMGIGFIKILLNFCDQKKSTVSDLFRVKGMFWRYVGGVLLY  120

Query  211  ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
             L++  G +L I+PG+ + V + F  Y+L D  I    A   S  L   +    F  F +
Sbjct  121  GLLILAGMILFIVPGIYWAVRYLFVPYLLVDKKISIGDAFRVSSKLTK-NTKWNFILFGV  179

Query  271  LLVIS----------------------LTLSFLTARIPYVGEAANLAFSLLLTPFSFLYY  308
             + +                       L L  +   I ++   A +  ++ +   ++++ 
Sbjct  180  FIGLLNLSYFVGFLFIFFGGFTLTAGNLALGIILLVIGFLLILAAIFITMPIMGLAYVHA  239

Query  309  YLIYSD-----LKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLL  363
            Y   +        A+ +             +   +   + I   + ++            
Sbjct  240  YRTIAYTTGAGFDADRQKGVTVSSGEHATKIVMVVVFVLAILFSIALAGWSIMRLRTMSY  299

Query  364  SAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPV  420
               +         P    D  R           A  +             G      
Sbjct  300  YPTQYQDFPSEVAPVPLKDYIREDGSFDAEAYQAALEEAYGSDSGFDLTEGQFDESF  356


>PWM08302.1 hypothetical protein DBX98_00680 [Clostridiales bacterium]
Length=266

 Score = 71.0 bits (170),  Expect = 1e-10, Method: Composition-based stats.
 Identities = 33/136 (24%), Positives = 56/136 (41%), Gaps = 1/136 (1%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                    G   M       +         ++ +   + G F   +I+  + V   ++L 
Sbjct  95   IDFILSPFGALAMIFLTKEGLEGKMPTYKEALSIAFSNAGRFICSMIVYTICVFVLTMLG  154

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            IIPG+   V + F    +  DN  G+++L  SR LV G WW  F   ++  +IS  LS+L
Sbjct  155  IIPGIFLSVVWSFYLQAMVLDNCKGIKSLGYSRELVKGRWWRTFLYIIVFNLISYALSYL  214

Query  282  TARIP-YVGEAANLAF  296
               I  +VG +  +  
Sbjct  215  IGVIFSFVGASYFMIV  230


>MBA3723719.1 hypothetical protein [Candidatus Levybacteria bacterium]
Length=329

 Score = 72.1 bits (173),  Expect = 1e-10, Method: Composition-based stats.
 Identities = 43/200 (22%), Positives = 85/200 (43%), Gaps = 4/200 (2%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
               +I+  L    G +FI   +  + +          +  +    + +  V  GG  +LI
Sbjct  116  AAVFIIFFLFLSIGGIFIIANQKRLSVGELFTRAKPLIVPYLFASVTVGFVTTGGWFVLI  175

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            IPGLLF ++F F  Y++  +   G+ AL++S  LV  ++W + GR +++ VI    S+L 
Sbjct  176  IPGLLFSLFFIFVSYIVVLEKKRGIAALKRSYALVKANFWKVVGRILIIQVIIFAGSYLF  235

Query  283  ARIPY---VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIF  339
              +     +    +  FS++   F+ +Y YL+Y    A    P    ++  W+       
Sbjct  236  ETLAEQNDIFGLFSFIFSIVAGWFTQVYMYLLYKQ-LAADAYPAQVNLRWIWITAVIGWL  294

Query  340  GWMLIPGLLLVSLSRQNLSA  359
              + +   L  +      ++
Sbjct  295  LAIGLIMALGTTQLPALFNS  314


>PIS21728.1 hypothetical protein COT51_01250 [candidate division WWE3 bacterium 
CG08_land_8_20_14_0_20_41_15]PIZ43199.1 hypothetical 
protein COY33_01965 [candidate division WWE3 bacterium CG_4_10_14_0_2_um_filter_42_7]
Length=334

 Score = 72.1 bits (173),  Expect = 1e-10, Method: Composition-based stats.
 Identities = 34/172 (20%), Positives = 62/172 (36%), Gaps = 12/172 (7%)

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
             G +     +   +   +V      K G R +  F     L  LVV  G +LLIIPG+LF
Sbjct  155  WGQTTALEGVIAKVWGREVRAGDCYKNGFRKIWGFITTGFLYGLVVFFGFVLLIIPGILF  214

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP--  286
              W+    YV   +   G  A+++S+ +++G      GR +L  ++S  +      I   
Sbjct  215  AFWYSLAPYVFLVEGKRGFSAMKRSKEIITGRILGYTGRNILFALLSGVILIPIMGILVG  274

Query  287  ----------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIK  328
                      + G  A     + +     +    +  ++       +  P  
Sbjct  275  LTSATSGVEVFGGLLAFALGLITVLASGTISIAGMIFNMLMIKELIETTPEM  326


>EFD92724.1 hypothetical protein BJBARM5_0577 [Candidatus Parvarchaeum acidophilus 
ARMAN-5]
Length=218

 Score = 70.2 bits (168),  Expect = 1e-10, Method: Composition-based stats.
 Identities = 34/180 (19%), Positives = 69/180 (38%), Gaps = 4/180 (2%)

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
            + +  + ++  V +     S      +   +        + L+  +   +L   ++    
Sbjct  20   YFIFALAVIYTVASVLFALSISSPISSFITSKGTVISNVSFLVIHMILFMLFSVFLQDIT  79

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
            FI +   +  +  ++K+ +     F    I+  ++V  G +  IIPG+     F      
Sbjct  80   FIRVFNKNKNIKYTLKMAVYRFPVFLATDIITGVIVTLGFIAFIIPGIYLLFKFILAPVS  139

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
             A +    L AL +S  L  G+WW +F  F++L +I   LS L    PY+         +
Sbjct  140  SAVEKKSPLDALRRSWQLTRGNWWVLFAVFLILGIIVSILSLL----PYISYFFEFIIII  195


>MBI4281800.1 hypothetical protein [Candidatus Uhrbacteria bacterium]
Length=261

 Score = 71.0 bits (170),  Expect = 1e-10, Method: Composition-based stats.
 Identities = 49/249 (20%), Positives = 90/249 (36%), Gaps = 2/249 (1%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
                  A  L  +  L   +       L     ++    +         +    + L  +
Sbjct  10   LPFFKQAGALTSSVLLMVMSGLLAIFFLAVLGLFMFWIGTTQALFYLALLDGKALELGEA  69

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
            +K G +    F    +L  L+VGGG LL I+PG+++ + +FF   +   + + G  AL +
Sbjct  70   LKNGWKRFLGFLWTSVLTGLIVGGGLLLFIVPGIIWGLRYFFAPLLCLSEGVSGRAALRR  129

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL-AFSLLLTPFSFLYYYLI  311
            S  L  G  W +  R  ++ +I   L  L   IP VG    +  F+  L      Y    
Sbjct  130  SAELTHGVRWVLLARGYVVSIIGFVLG-LLQLIPLVGPFLFVPFFTTPLIALLSWYLLRA  188

Query  312  YSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQ  371
            ++  KA     + P  K +   L     G+ ++   + V++       + L    K  + 
Sbjct  189  FTTAKAANASLETPYGKGKKFALLLPQIGFGIVLIGMFVAIIYGATHRQALQCDPKTGEG  248

Query  372  RLGTQPQQT  380
             L +     
Sbjct  249  CLSSLQNYQ  257


>KKR53597.1 hypothetical protein UT90_C0006G0009 [Parcubacteria group bacterium 
GW2011_GWA1_40_21]
Length=247

 Score = 70.6 bits (169),  Expect = 1e-10, Method: Composition-based stats.
 Identities = 52/221 (24%), Positives = 94/221 (43%), Gaps = 12/221 (5%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            W ++ +R    LGI    I  +       LL       +    N     +L    ++ LG
Sbjct  14   WGIYKKRLRVFLGIMAFPIFASLLLTAFLLLDVSLLRAHSAFFNMLGLGILILSGFLTLG  73

Query  171  LSWMTGSMFIYI------CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
            +  +     I +       + ++G+  S + G   + S+  +  L  +++ GG+ L  +P
Sbjct  74   VIVIQLWSQIALLYAIKDREENIGIKESYRRGWSKIISYFWVSFLSGIIIAGGTFLFFVP  133

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            GL+F +WF    +V   +N  G  AL  S+  V G+W A+F RF+ + +I+    +L   
Sbjct  134  GLIFFIWFSLASFVFISENKKGWAALAASKDYVKGNWRAVFWRFLFIGIIAYLFLYLFNA  193

Query  285  IPYVGE------AANLAFSLLLTPFSFLYYYLIYSDLKANY  319
            +    +              LLTP   +Y +LIY +LK  +
Sbjct  194  LAVFTDTLVIDKIYIYIGYALLTPMVTIYSFLIYENLKKIH  234


>HFH10638.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=394

 Score = 72.5 bits (174),  Expect = 1e-10, Method: Composition-based stats.
 Identities = 32/219 (15%), Positives = 74/219 (34%), Gaps = 22/219 (10%)

Query  134  APIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSM  193
                +  +                  +L  V   +     +  ++       + GL  S+
Sbjct  175  LLGGAEGMGFSPASFGSPEIVVLAMGVLIFVVLFIWTEGALIYAVSEIHLGHEAGLRGSL  234

Query  194  KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKS  253
               L  +G+    ++L+  ++  G L  +IPG +F + +     V+  + + G QA+ +S
Sbjct  235  GAVLPKLGALLSTILLVWFLIFLGVLCFVIPGFIFLLRWLMADKVVVLEGLRGTQAMRRS  294

Query  254  RLLVSGHWWAIFG----------------RFVLLLVISLTLSFLTARIP------YVGEA  291
            R L+   + A F                   + + ++ L   ++ + +       Y+GE 
Sbjct  295  RELMRFRFGAGFWSRPWARVCLLGCGVGLVCLGMYLVFLIPGWILSYLFPGALSSYLGEG  354

Query  292  ANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
              L    L   +  +   + Y D++       H  +   
Sbjct  355  LELLAETLTAAYGSIALVIYYYDIRVRKENFDHRSMAEH  393


>HGX05283.1 hypothetical protein [Gemmataceae bacterium]
Length=338

 Score = 71.7 bits (172),  Expect = 1e-10, Method: Composition-based stats.
 Identities = 47/327 (14%), Positives = 86/327 (26%), Gaps = 24/327 (7%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCP-----------------ECCQTLIFDPAESQ  44
                CP C A           K    +C                  +    L       +
Sbjct  3    IPFTCPACNASFRVKEEMAGRKGKCPKCGAGVVVPHSGPPDAVQAADPPTPLARPRPPVE  62

Query  45   RTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSIS  104
                    A  P             E +          R    + +       SG R  S
Sbjct  63   ERVAASAPADEPVPDRYGADEHWDREERPYQTEAPDLERLRDYRIDMGEWYRYSGGRHNS  122

Query  105  QLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWL-NPQNQNWQWAILLAT  163
              +      F         + +L +V      F  LL + A        +  +       
Sbjct  123  AFMGPMIGFFFIMWAVTFAMMMLSMVFIGYLGFFLLLPQLAAGPTIVCLRQLKGQEWKFG  182

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
              +      W T ++ +      +       + L      T  + +    + G S L II
Sbjct  183  DFFGGFKYYWTTLAIELLAGLMQLVTLAPGFILLIIAKQITDAMDIGPNPIFGFSALAII  242

Query  224  PGLLFCVWFF------FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT  277
               +    +       F + ++ +   G + A + S  L  GH+W +FG F+L+L +   
Sbjct  243  GISIAATVYVWFRLNVFARQIMIETQCGPIAAYKGSWRLTRGHFWGLFGSFMLMLFLVYL  302

Query  278  LSFLTARIPYVGEAANLAFSLLLTPFS  304
            ++F+T  I  +                
Sbjct  303  IAFVTCGIGLLFALPRGLLFWNAAYLL  329


>WP_115032115.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Dermatophilus congolensis]STD15348.1 Membrane 
domain of membrane-anchored glycerophosphoryl diester phosphodiesterase 
[Dermatophilus congolensis]
Length=394

 Score = 72.1 bits (173),  Expect = 1e-10, Method: Composition-based stats.
 Identities = 26/212 (12%), Positives = 57/212 (27%), Gaps = 22/212 (10%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            + +    + + +      + + F    L  +   GL  L +S  L  G +W +FGR +L 
Sbjct  179  IALLITLIAISVIVTYVAIRWCFYLATLTLETGKGLNPLRRSWHLTKGAFWRVFGRLILF  238

Query  272  LVISLTLSFLTARIPYVGE----------------------AANLAFSLLLTPFSFLYYY  309
             +    + F+      +                          +   ++L  P    +  
Sbjct  239  NIAVGIVWFIVTAALGMAFGAPMEETFGNNPATSGGAVTANILSTLLNILFAPLMACFMS  298

Query  310  LIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDI  369
            ++Y D +         P          A       P       +    +AE     G+  
Sbjct  299  ILYLDERRRRNEEDPAPTAHVIDAPWQAETTQWGQPQGQRYGATTPEPAAETPQRYGQQS  358

Query  370  QQRLGTQPQQTPDLNRSLPEEPQRLSSADYKL  401
                    Q+    N +        ++ +   
Sbjct  359  PTEEPRYGQRVETNNETSQRSDDDKNTGNTPP  390


>TAN40314.1 hypothetical protein EPN25_08115 [Nitrospirae bacterium]
Length=206

 Score = 69.4 bits (166),  Expect = 1e-10, Method: Composition-based stats.
 Identities = 57/213 (27%), Positives = 96/213 (45%), Gaps = 8/213 (4%)

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
            +SW   ++   +    +G+  S+ LG + VG+F     +   ++ GG LLLI+PG++F V
Sbjct  1    MSWGLAAVVFAVTDESLGIRDSLALGWQKVGAFIWFFSIAGYIIFGGFLLLIVPGVIFLV  60

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
            WF F Q++LA +++ G+ AL KS+  V G+W  +F R  L+ + S  +  +         
Sbjct  61   WFAFGQFILAREDLRGMDALLKSKEYVRGYWPDVFLRLFLIWIASGVVGIV--------P  112

Query  291  AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLV  350
               + F++   PF  ++ +LIY DLKA      +     +      A     LI    ++
Sbjct  113  CIGILFTVAFMPFMMIFIFLIYEDLKAAKGDIAYHSSTGEKFKWIGAGTLGYLIIPAFIL  172

Query  351  SLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDL  383
             L   +LS   LL  G   Q             
Sbjct  173  LLLGVSLSIPLLLLKGLLNQTAREMIMIPAQFW  205


>OVE80851.1 hypothetical protein BVY04_04755 [bacterium M21]
Length=297

 Score = 71.0 bits (170),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 39/267 (15%), Positives = 75/267 (28%), Gaps = 0/267 (0%)

Query  48   TTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLL  107
                        L                              +      +        L
Sbjct  7    CWGCGEEYSEDMLAAFGERLICGGCKPNYVQSVKENVQPKSAIQVGPDWLATSDLGLGGL  66

Query  108  ADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI  167
                     R +    +  + I+     +  AL +      NP        + L      
Sbjct  67   LGRTFETYGRVFKPCLLMAVVIMGPLYLLMLALKVIAIQTSNPLLLMGNGILGLIAAILG  126

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
               +  +  S+        +    ++  G R  G       L  L+V G   L I+PG++
Sbjct  127  FAPMIGVVHSVHRSTYGEMIDWRTALSFGFRRFGDVFGTAFLGGLIVLGLFFLGIVPGII  186

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
            + +++ F   V+A     G  AL+ S+ LV G WW IFG   ++ ++ +  + L A    
Sbjct  187  WAIYYSFIIAVVAVTYQTGKDALDYSKSLVKGRWWRIFGYIFVINLLVVFSTILFAGAGA  246

Query  288  VGEAANLAFSLLLTPFSFLYYYLIYSD  314
                 +     ++         L+   
Sbjct  247  GLGIVHPVAGAIVGTGGEFINILLSYM  273


>WP_013269169.1 hypothetical protein [Brevundimonas subvibrioides]ADL01067.1 
protein of unknown function UPF0259 [Brevundimonas subvibrioides 
ATCC 15264]
Length=320

 Score = 71.3 bits (171),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 35/156 (22%), Positives = 69/156 (44%), Gaps = 6/156 (4%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                   +    +            + +  +   G +       + IL+ L  G G +LL
Sbjct  136  LWFVGTYMLQGMVVKVTVASFNDKAMSIGAAFAAGSKLFLPLLGVGILVGLGTGLGYILL  195

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            ++PG++  V +      +A ++ G  ++L++SR L  G+ W IFG  V+L + S+ +  L
Sbjct  196  VVPGVILAVIWSVATAAVAVEDRGVTESLQRSRELTKGYRWPIFGLAVILFLGSVMIGML  255

Query  282  TARI------PYVGEAANLAFSLLLTPFSFLYYYLI  311
             A I       +VG +A+L  +++ T  S +   +I
Sbjct  256  VAGIGAATGGSFVGGSASLGVNMITTALSNILTSVI  291


>MBI2506872.1 hypothetical protein [Candidatus Colwellbacteria bacterium]
Length=247

 Score = 70.2 bits (168),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 46/196 (23%), Positives = 89/196 (45%), Gaps = 4/196 (2%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
             +++I   + ++    FIY  +   G+  S +   R++  +  L IL  L++ GG ++ I
Sbjct  35   IISFIGGIIYFIAELAFIYQLRDQAGVNDSYRNAFRNILPYIWLTILPGLIILGGFVMFI  94

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            +PG++F +W  F  YV   +N  G+ A+ +S+  + G+WW +FGR + L V  L + +L 
Sbjct  95   VPGIIFIIWLLFPLYVFVFENQRGMNAVLRSKEYIQGNWWQVFGRILALFVAILIIYYLP  154

Query  283  ARIP----YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAI  338
              +      +         L+  P S +Y+YL+Y   K         P          + 
Sbjct  155  VALTSDSVVISNGIAGLLELIFVPVSIIYFYLMYQSSKQMKTELAGQPPTGPRGFFYFSA  214

Query  339  FGWMLIPGLLLVSLSR  354
               ++   LL++  + 
Sbjct  215  IMGIVGIVLLVLVATY  230


>PYQ65644.1 hypothetical protein DMF53_05085 [Acidobacteria bacterium]
Length=312

 Score = 71.0 bits (170),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 38/214 (18%), Positives = 72/214 (34%), Gaps = 4/214 (2%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              +    I+     +   +   PA    P        +LL+ V    +    +T  +F  
Sbjct  98   FVLLTAVILAPLYILRGYVAAMPAGAATPVAVISALVLLLSAVLCPYIATGAITYGVFQQ  157

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            +   D  +   +  GL  V     L I+   +V  G +  +IPG+L  + +      +  
Sbjct  158  LRGKDTTIGDCLGRGLSAVLPVLGLAIVQTFLVALGLVACLIPGILLALRWAVAVPAMVA  217

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI----PYVGEAANLAFS  297
            +  G   +L +S  L  G    +FG   +L  + L   F    +    P +    +   S
Sbjct  218  ERTGISDSLSRSTFLTEGSRGEVFGVLFVLGALQLGAGFAVTLVALKNPTLSLILSGVQS  277

Query  298  LLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
            L     S     ++Y  L++         I   +
Sbjct  278  LFTVGLSATGSAVLYYRLRSLRESIDVDQIASAF  311


>MBA3550783.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Patescibacteria group bacterium]
Length=316

 Score = 71.3 bits (171),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 30/159 (19%), Positives = 56/159 (35%), Gaps = 0/159 (0%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
             A  F A + +       Q+    + +       ++                    +   
Sbjct  43   LAVSFLASVFQSIITFTLQDSLGLYLLASVLAFIVIYISYLALVIAVADEQNQHADISAL  102

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
             +        +  + IL  L V  G  L +IPG+   ++  F  YVL  +    + AL K
Sbjct  103  YQSAWHVFFPYVGVFILTTLTVIAGLTLFVIPGIAVAIFLSFSMYVLVVEKKHWMSALTK  162

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
            S   V G+WW +FGR ++  ++   L  +T  +  +   
Sbjct  163  SWYYVRGNWWKVFGRMIVFAILMGVLMIITEIVTSLLGL  201


>HEV45185.1 hypothetical protein [Caulobacterales bacterium]
Length=484

 Score = 72.5 bits (174),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 39/121 (32%), Positives = 61/121 (50%), Gaps = 0/121 (0%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +A   +  + +  +    +    V L  S+K+GLR       L IL+ L +  G++LL
Sbjct  293  IMLAAAFVLQAAIVHATVTDLNGRRVVLGDSLKVGLRDCLPLIGLAILMGLGIALGTMLL  352

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            IIPGL+  V +         + +G LQA+++SR L  G  W IFG FVL ++ +  LS L
Sbjct  353  IIPGLILAVLWSVAVPAKVVEKLGVLQAMQRSRDLTRGRRWPIFGLFVLYVIANWMLSAL  412

Query  282  T  282
             
Sbjct  413  I  413


>MBA3422132.1 hypothetical protein [Thermoleophilaceae bacterium]
Length=282

 Score = 70.6 bits (169),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 47/241 (20%), Positives = 83/241 (34%), Gaps = 11/241 (5%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
            G++ +    A   + +AL++     +          IL        L  +     M + +
Sbjct  13   GVFDIYRDQASVLLPAALVVFVVVGVIAAVLVAISPILGILAVIAQLIGTAFFQGMVVQL  72

Query  183  CK------TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
                     D  +          V +     +L  L +G G +LLI+PGL     +    
Sbjct  73   VGDVQDGRRDTSVGDLFASVSPVVAALIGASLLQGLGIGVGFILLIVPGLFLLTIWAVVA  132

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI-----PYVGEA  291
             V+  +  G L A  +SR LV GH W + G   + +V+ + +S +   I           
Sbjct  133  PVVVLERPGVLPAFSRSRELVRGHGWQVLGVLAVFIVVLIVVSLIFGLIGGALGSVGAVI  192

Query  292  ANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVS  351
            A++  S+L  P   L   ++Y +L+A       PP          +       PG     
Sbjct  193  ADIVSSVLTAPLLALAAAVLYFNLRAVKGEVAPPPGAVGLHTPAGSPGIAPESPGPQAAP  252

Query  352  L  352
             
Sbjct  253  P  253


>PIZ48299.1 hypothetical protein COY32_00210 [candidate division WWE3 bacterium 
CG_4_10_14_0_2_um_filter_41_14]PJA39718.1 hypothetical 
protein CO180_00120 [candidate division WWE3 bacterium CG_4_9_14_3_um_filter_41_6]
Length=459

 Score = 72.1 bits (173),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 50/361 (14%), Positives = 121/361 (34%), Gaps = 14/361 (4%)

Query  99   GLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWA  158
                    L     LF         + +  I++  + +    ++ P T            
Sbjct  84   FEIFKQHWLQFIGFLFLPSLVVGTLLVIFIIIVVLSGVELTSVVNPETLQRTVIIVGLLM  143

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
            I  + +   +   S  +  + +      V     MK  +++V    ++      +V GG 
Sbjct  144  IPSSLLIGFIGMWSTASIMVRVRDRNEQVTFVEIMKRSVKYVWPLLIVAFYTGFIVHGGM  203

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            LL I+PG++F +WF F   ++  D++ G+ AL KS+  VS  +  IF R++++++ ++ +
Sbjct  204  LLFIVPGIIFAIWFMFASMIVVFDDVKGMDALLKSKAYVSNIFGKIFSRWIVVILFAMGI  263

Query  279  SFLTARIP--------------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
            +   A I                +       F+L ++    ++  L++S +K      + 
Sbjct  264  TISLAIISGSITTVLGKDNPAIILLYLIETPFNLAVSIVGTIFGVLLFSYVKKAKGSFEF  323

Query  325  PPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLN  384
                   + L       ++    + + +    +S+   L    +    L         ++
Sbjct  324  VAKTSTKIWLWLVALFGLVAFIAISIGIMGMVISSGGALFNMTEGADSLIPSNSSHSIID  383

Query  385  RSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSD  444
             +   +    S         +  K   +         +         +     L  + +D
Sbjct  384  STFSIDDTYYSDVFEVQFGIEMYKFFEDSYPETLSQIVPEYVSTDPSETFTYELTADGTD  443

Query  445  F  445
            +
Sbjct  444  Y  444


>RKY68944.1 hypothetical protein DRP97_05810 [Candidatus Latescibacteria 
bacterium]
Length=237

 Score = 69.8 bits (167),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 41/217 (19%), Positives = 81/217 (37%), Gaps = 6/217 (3%)

Query  104  SQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLAT  163
             +     +      G     +  +  + A    ++  +   + +     +    A+ L  
Sbjct  12   YRTHFRLFWKMALIGELPAYVGTVIFLSAMGRPYALGMEPISIFSGAGWKVIGGALALIG  71

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKL-GLRHVGSFTLLLILLILVVGGGSLLLI  222
            +    + L  +  ++        + +  + +   +    +  L  IL++LV+  G +  I
Sbjct  72   LMLSAMFLVAVVYAVEDLDEGKTLTVMAAYRSVPMGMAVAVLLAFILVMLVISIGMMAFI  131

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            +PG+L  VWF+   Y +  D   G  A  +S+ LV GH   + G       I L  S L 
Sbjct  132  VPGILLAVWFYLSPYAIVIDEAEGWDAFRRSKQLVRGHGGKVLGIIGASFGIELAGSLLI  191

Query  283  ARI-PYVGEAANLAFS----LLLTPFSFLYYYLIYSD  314
            A + P  G   +        L++TPF  L    +Y +
Sbjct  192  AGLRPLFGSLGSGVLLTGWDLVITPFQILLMIFLYDE  228


>WP_050702637.1 EI24 domain-containing protein [Dysgonomonas sp. BGC7]MBD8388045.1 
EI24 domain-containing protein [Dysgonomonas sp. BGC7]
Length=289

 Score = 70.6 bits (169),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 38/245 (16%), Positives = 74/245 (30%), Gaps = 23/245 (9%)

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYIL---L  169
              C      L    L        + S         +          I        +   +
Sbjct  36   PICIFIPIFLIAIYLIPNTQNVSMSSMAAYDNPIDMYKSLFPIGAIIAYFITGISMYLTI  95

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
              +    + +       V          + +    L  IL  L+VG G++  IIPG++  
Sbjct  96   LYTITYMATYAKSTDGIVKSSDIWNRVKKVMIPLFLGSILFSLLVGIGTIFCIIPGIIIY  155

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI----  285
            V+  F  Y   ++++G + + ++S  LV  +WW  FG  ++  ++   +S + +      
Sbjct  156  VYLGFYMYTYINEDLGIIDSFQRSFNLVKNNWWVTFGFGLIFGILFFIVSMIFSIPSYIA  215

Query  286  ----------------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
                             YV    +   S LL P  ++   ++Y    A   G        
Sbjct  216  ILGPTLDIEFLKSDIYMYVATLISSLGSFLLYPLLYMAMGILYYSHVAKLDGTDMDSEIE  275

Query  330  QWLPL  334
                 
Sbjct  276  NIGTY  280


>MBI5175733.1 hypothetical protein [Candidatus Melainabacteria bacterium]
Length=353

 Score = 71.3 bits (171),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 37/302 (12%), Positives = 81/302 (27%), Gaps = 9/302 (3%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
             L+D W+        L G+ ++  ++         ++  +  +           L     
Sbjct  16   CLSDGWKGMISHFLPLFGVMVVSFLIMAMATLPTAVVIVSHIMKFPLPAIADVFLSVLGM  75

Query  166  YILLGLS----WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                 +            + +   +    + + +   +   F     L+  +     L  
Sbjct  76   VAGFVVGPMVQMGMLRACLKVIDNETPSPKDLFVCWPYFWQFFSAGFLMKCMRFPAYLCF  135

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            I PG+   + F F +Y + D  +G +QA   S  +  G    I    V+LLV++     +
Sbjct  136  IFPGIWLDLSFRFYEYFVVDRGMGPIQAFRASNQVTKGTLVRIALTEVILLVVAAIGGMV  195

Query  282  TARIPYVGEAANLAFSLLLT-----PFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTA  336
                        +     +       F          D +A     +     R     + 
Sbjct  196  LIVGAVPAAMIVMLARASMYRQVLQSFLGENKVEYDPDDEARDFDDKLRRQARLAGDPSL  255

Query  337  AIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSS  396
               G +++  +         +SA   + A             Q      +    P     
Sbjct  256  KESGAVIVGSMQKPPQEEPRVSAPSFVDAPPVDDLEAAFANLQKQAQKDAAEGAPTPAPQ  315

Query  397  AD  398
             D
Sbjct  316  GD  317


>OQX00469.1 hypothetical protein BWK69_01345 [Candidatus Parcubacteria bacterium 
A4]
Length=219

 Score = 69.4 bits (166),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 38/215 (18%), Positives = 65/215 (30%), Gaps = 3/215 (1%)

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP---YVGE  290
               Y+L  ++I G+ AL KSR  + G W ++F R +   ++          I      G 
Sbjct  1    MAVYILIVEDIKGMDALMKSREYIRGRWLSVFWRLLFPSLLVAIFFLPLFFISKFIPFGF  60

Query  291  AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLV  350
                 FSL   P   +Y++LIY +LK+        P K +  P        +LI   +L 
Sbjct  61   FVEFIFSLFFVPLLMIYHFLIYKNLKSVKGEFIFEPAKIKKWPFILTAIIGLLIVPAILA  120

Query  351  SLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTT  410
             +     ++ +  +     Q  +                 P  L         S      
Sbjct  121  LIVSTGTNSAREKARDAQRQLDIMHIQMALEFYQMDNDGYPSSLDKLSSSGTYSSNIVDP  180

Query  411  SEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDF  445
                     V      +    +      K   S +
Sbjct  181  KTKKPYQYRVLKGGSDYEVCAEMETKEEKCLTSQY  215


>AGW14098.1 hypothetical protein DGI_2345 [Desulfovibrio gigas DSM 1382 = 
ATCC 19364]
Length=257

 Score = 69.8 bits (167),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 22/170 (13%), Positives = 55/170 (32%), Gaps = 3/170 (2%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
               +  +    +T      +    +   ++++   +      + + L  L +  G LL++
Sbjct  87   AFLFFPVSSGALTLLFTADLFGESMTWQQALRQAWQRKVPLCVTMALSTLCIILGMLLIV  146

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL--TLSF  280
              G+ F + F      +  +      AL +S +L+  ++W      +L+  +++   +  
Sbjct  147  -LGVYFALRFTLIYQTVMLEQADPKTALRRSGVLMKSNYWTALALVLLMWGLTIGVVIGV  205

Query  281  LTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
                 P V                   Y  +Y   +    G Q  P   +
Sbjct  206  SVTASPLVAGVVQWVVEAGTMCAVSACYTALYFSARHAKEGFQLQPGASR  255


>WP_183343261.1 hypothetical protein [Conexibacter arvalis]MBB4663537.1 hypothetical 
protein [Conexibacter arvalis]
Length=217

 Score = 69.0 bits (165),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 32/153 (21%), Positives = 61/153 (40%), Gaps = 6/153 (4%)

Query  167  ILLGLSWMTGSMFIYICKT-DVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
             +     +   +        D  +    +          LL I++ + V  G +LLIIPG
Sbjct  59   TIFYQGMVVRLVDDVRDGALDSSVGELFRSVAPVALPLFLLAIVVGISVAIGLVLLIIPG  118

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            L     +      +  +  G   AL +SR LV G+ W +FG  V++  + + +  + A I
Sbjct  119  LFLLTIWAVSAPAVVLEGKGVFAALGRSRELVRGNGWNVFGVIVIVWALMIGVGIVGAAI  178

Query  286  PYVG-----EAANLAFSLLLTPFSFLYYYLIYS  313
              +G          A ++L+ P + L   +++ 
Sbjct  179  GALGGDVLRVLVQWAVNVLVAPVAALATAVLFF  211


>CCY66643.1 putative uncharacterized protein [Clostridium sp. CAG:678]
Length=372

 Score = 71.3 bits (171),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 34/374 (9%), Positives = 88/374 (24%), Gaps = 13/374 (3%)

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLR------HVGSFTLLLILLILVVGGG  217
            + ++      ++     Y+   +      +K   R      ++    +  +  +L++   
Sbjct  1    MIFLAPLSVALSYFYVEYVTGKEFEFDSGLKSVFRNAFKVTYLKKVAVAFLKELLILLLS  60

Query  218  SLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT  277
             L LI   +     +F  + +     +   QA+  S+ +V G+   +F   +  +   L 
Sbjct  61   ILFLIPGIVFNYSSYFAFEIMAEYPELSPWQAISLSKKMVKGNRTELFVLDLSFIPWMLL  120

Query  278  LSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAA  337
              F+   I            +  T   +   + + +               ++       
Sbjct  121  CVFIFPVIYVW-------PYMFTTRALYYENFKLRALAMHRITEDDFLSDAQKMNRAMNG  173

Query  338  IFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSA  397
               +         +                  Q   G         N   P+      + 
Sbjct  174  GAPYQQQAPYGTYTAQNAAYPPNGAAYQQNPQQNTQGYGNAYPGAQNPQQPQAQYGGYNP  233

Query  398  DYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSAR  457
                  +       +G             +    Q P+      +   P +   +   A 
Sbjct  234  AGNRQYTNNYAPPQQGAPYQPTPPYPQGGYAPTMQQPYGPAVSSVYFTPVMPAQRAAQAA  293

Query  458  IEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSI  517
             E +   +  +          E+              +    I     ++  + E     
Sbjct  294  AEFNSAANAASAQSTPGMRQNENSPAATDDSKTAANGEQKPDITITEPQEPAEPEYAEPQ  353

Query  518  LGKLELTLPLAIES  531
                  T P   E 
Sbjct  354  EPTESFTEPQEPEE  367


>MBI2910977.1 hypothetical protein [Chloroflexi bacterium]
Length=539

 Score = 72.1 bits (173),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 40/250 (16%), Positives = 77/250 (31%), Gaps = 4/250 (2%)

Query  128  GIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDV  187
                       +        L    +   W + L       L  + +  +         V
Sbjct  83   HYTAFSLLGLLSQGPSFLLLLGAITEEVSWVLYLLLAPVETLLGALIILATARDQAGEPV  142

Query  188  GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGL  247
                ++   LR + +      ++ L +   S L +   +   V  +F Q  +  +  G L
Sbjct  143  SFTEALGPLLRRLPALFG-GWVVFLALLVVSALGLPLFIYLLVSLYFFQQPIVIEGAGPL  201

Query  248  QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLY  307
              L +   LV G WW +FG  ++L+++   L+ +       G   +L    L T    + 
Sbjct  202  AGLRRGHALVKGSWWRVFGIGLVLMLLLTILASVAG---LFGRLPSLFAGALTTALGNIG  258

Query  308  YYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGK  367
              L+Y DL+    G     +  +      A           L      + S  ++     
Sbjct  259  ATLLYLDLRVRKEGCTAERLAWELGGRVGASPEPAPRRRARLRPTPPWSPSGPKVSRRRP  318

Query  368  DIQQRLGTQP  377
               +R G  P
Sbjct  319  HEPRRRGWGP  328


>MBI4252880.1 hypothetical protein [Candidatus Uhrbacteria bacterium]
Length=330

 Score = 71.0 bits (170),  Expect = 2e-10, Method: Composition-based stats.
 Identities = 45/210 (21%), Positives = 85/210 (40%), Gaps = 8/210 (4%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
             +  Y     + +   +        + L  S ++  +      ++ +LL L +  G L +
Sbjct  82   ISGFYFSFIFAAIVYLVDEKYRGRTLTLVESFEMAAQRYVDVFVIGLLLFL-IMNGGLAI  140

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFV---LLLVISLTL  278
            ++    F +WF+F  +++  D   G  AL KSR LV G ++ +FGR+V   LL+ ++  L
Sbjct  141  VVMPFFFSIWFYFAFFIVLLDKERGWNALAKSRYLVHGMFFRVFGRYVAITLLVFLAFVL  200

Query  279  SFLTARIPYVGEAANLA----FSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPL  334
             +L   +P++G           +    PF  LY Y  Y DL A  R       + + + L
Sbjct  201  VWLLLALPFIGWLLFTLCFIALAFFAFPFYILYEYFRYQDLVAVERNIPFHAFRGERVGL  260

Query  335  TAAIFGWMLIPGLLLVSLSRQNLSAEQLLS  364
                   ++I  ++           E+   
Sbjct  261  RMWAVAGLVITLMVWSYNVLGAQGRERFAQ  290


>EKE14382.1 hypothetical protein ACD_12C00540G0002 [uncultured bacterium]
Length=244

 Score = 69.4 bits (166),  Expect = 3e-10, Method: Composition-based stats.
 Identities = 54/213 (25%), Positives = 99/213 (46%), Gaps = 4/213 (2%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI---  167
            W  F +        Y    +   +       +             + ++L A ++ I   
Sbjct  29   WWNFDKIIRTSFRFYSKNFLKLISFTMITYGIGWLITFFIGRSLSEESLLSAILSLIKGI  88

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
            L      T  MF+   + ++ +   + LG         L ILL +++G G++LLIIPG++
Sbjct  89   LYLWGMATIIMFLGNAEKNLSIKEYLFLGFPKTWETFWLQILLGIIIGIGTILLIIPGII  148

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI-P  286
            F +W+ F  +    +N G +++L +S+ LV G+ +++F R  +L +I+L L  L   I P
Sbjct  149  FAIWYNFSYFTCLLENRGVIKSLGESKKLVKGYGFSVFLRLFVLGLITLALLILFYFILP  208

Query  287  YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
                  +L F++ L PF  +Y+YLIY +L+   
Sbjct  209  KNLNILDLIFAIFLQPFVLIYFYLIYKNLREIK  241


>NQU78030.1 hypothetical protein [Candidatus Falkowbacteria bacterium]
Length=392

 Score = 71.3 bits (171),  Expect = 3e-10, Method: Composition-based stats.
 Identities = 58/356 (16%), Positives = 131/356 (37%), Gaps = 23/356 (6%)

Query  109  DSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKP--ATWLNPQNQNWQWAILLATVAY  166
             ++ +F +R   L+ I ++  VLA      ++ +    A             ++L     
Sbjct  28   MAFSIFKKRWKTLIMIQIIPAVLALIFGVLSVFMNSGEARGEELSPLLGLGVLVLVIPVI  87

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
             L   S     + +   K + G     +  +        + IL+ L+  GG LLLI+PG 
Sbjct  88   FLGIWSSYAMMLAVINHKQEKGWKDFYRESIHLFWGLLGITILVTLITFGGFLLLIVPGF  147

Query  227  LFCVWFFFCQYVLADDNIG--GLQALEKSRLLVSGHWWAIFGRFVLLLVISLT-LSFLTA  283
            +F VW+ +  ++ A+   G   + A+ +S+ +V G++W+I  R ++  V++   +  +  
Sbjct  148  IFAVWYGYATWIYAEKGKGSGLINAIRESKRIVKGYFWSIVWRNLVFGVLAGIAIMIVAG  207

Query  284  RIPYVGEAANLAFS------------------LLLTPFSFLYYYLIYSDLKANYRGPQHP  325
             +  +G A    F                   +++ P S L+ YL++S+++         
Sbjct  208  ILGLIGLAFGGIFGQSEVVVTLINDIFGLPIQVIVAPVSMLFAYLLFSNVRDLKNKGIDK  267

Query  326  PIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNR  385
                +W+ +   I   +++    ++SL          + +    + +   Q +   D+ +
Sbjct  268  KPLERWVKILVPIIIVVVLIVPGMLSLVAVKALNVARMKSRDATRTQELNQIKIILDIYQ  327

Query  386  SLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLE  441
                      +    LL  +       G L        +  F    +   +  + E
Sbjct  328  GQNGIYPDDLTELNILLPVRNMSDPLTGELYNYQALEGSSGFKVCAELEVIEQEGE  383


>RJQ14347.1 hypothetical protein C4553_01375 [Candidatus Parcubacteria bacterium]
Length=323

 Score = 70.6 bits (169),  Expect = 3e-10, Method: Composition-based stats.
 Identities = 27/165 (16%), Positives = 53/165 (32%), Gaps = 0/165 (0%)

Query  129  IVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVG  188
                     +   L              + I        LL        + +    +   
Sbjct  29   YWRIILLSGAVPSLLYLPVTFGSFGKIPFLIFSLLGVIFLLLFRLALIELAVSQDSSTPS  88

Query  189  LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQ  248
                 K GL+       +  +    V GG  L +IP +   +      Y+L  +    + 
Sbjct  89   FGFIWKEGLKRFFPIIWISTITAFAVFGGFFLFVIPAIFLAIILSLGSYILFAEKASPII  148

Query  249  ALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAAN  293
            +L +S   V G+WW I  R+V   +I + ++     +  +G +A+
Sbjct  149  SLARSWHYVRGYWWGIVLRYVYFGIIVVLIALAVGFLAGLGFSAS  193


>OGC78213.1 hypothetical protein A2619_02620 [candidate division WWE3 bacterium 
RIFOXYD1_FULL_39_9]
Length=309

 Score = 70.2 bits (168),  Expect = 3e-10, Method: Composition-based stats.
 Identities = 38/172 (22%), Positives = 74/172 (43%), Gaps = 2/172 (1%)

Query  160  LLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
            ++      ++  +    +          G+   +K  L    S   LL++  LV+  G L
Sbjct  129  MVVAFVVAMMQYNVSIMTALNIYNSNLAGIKGMLKTALFRSLSMFFLLVVYGLVILIGLL  188

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            L IIPG++F V F F  YV+ D+N G L ++++S  LV G+   ++ + +   +    L+
Sbjct  189  LFIIPGIVFAVRFGFAPYVMLDENKGALASMKESWRLVKGNTLNVYLKLIGFTITIWFLA  248

Query  280  FLTARIPYVGEA--ANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
             L + +  +G     +     +      L+  ++Y DLK     P+     +
Sbjct  249  LLLSPVLALGMYTGISALLLFIGQLVFMLFSVVLYKDLKRIKTAPEPFATPQ  300


>OLD18896.1 hypothetical protein AUI91_09650 [Acidobacteria bacterium 13_1_40CM_3_56_11]
Length=292

 Score = 70.2 bits (168),  Expect = 3e-10, Method: Composition-based stats.
 Identities = 39/201 (19%), Positives = 62/201 (31%), Gaps = 18/201 (9%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                  L        ++          + +S+      +GS   ++IL  LV G   LLL
Sbjct  90   VYFIAYLFSQGATVYAVSELYLGRPTTIGQSLSRVRGELGSLFGVIILNGLVTGLCFLLL  149

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            IIPG+        C      +N+G   +LE+S  L   +    F   +L +VI     FL
Sbjct  150  IIPGIYMACRLCVCIPAALLENLGPRDSLERSFGLTKDNAGRAFLILLLYVVILYAALFL  209

Query  282  TARIPYVGE------------------AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                   G                     N   S+L+TP   +   + Y DL+       
Sbjct  210  FDMPFAFGVQSAAHDPAMLRVWTALMQVGNFVASILVTPVFTIAASIFYFDLRVRKEAFD  269

Query  324  HPPIKRQWLPLTAAIFGWMLI  344
               +         A      +
Sbjct  270  LQLMMNPQAAGVPAPRSATGL  290


>RDZ91371.1 hypothetical protein DEQ92_22740, partial [Haloferax sp. Atlit-6N]
Length=149

 Score = 67.1 bits (160),  Expect = 3e-10, Method: Composition-based stats.
 Identities = 31/148 (21%), Positives = 60/148 (41%), Gaps = 2/148 (1%)

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMK-LGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
              L+       +  I +   +  +   +    +        +  ++   +     +L+IP
Sbjct  1    LFLVLQYMALVATRILVGGYERTIPNDLLTRNIPLAIVNLFVGGIVYSALVVIGSILVIP  60

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            G++  + F F    +A ++   + AL  S  L  G+W  +F  FV++ VI   L  L + 
Sbjct  61   GIIAYLAFVFMTVYIAVEDENFVAALGDSWSLTRGNWLRLFLLFVVIGVIGGVLGVLFSI  120

Query  285  IPYVGEAANLAFS-LLLTPFSFLYYYLI  311
               +  AA+   S L+  PFS L   +I
Sbjct  121  GSMLSPAASTILSALVFLPFSVLSLGII  148


>MBC7853613.1 hypothetical protein [Pirellulaceae bacterium]
Length=259

 Score = 69.4 bits (166),  Expect = 3e-10, Method: Composition-based stats.
 Identities = 29/179 (16%), Positives = 60/179 (34%), Gaps = 4/179 (2%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            F       L + ++ + +     F     +   +             +    + ++    
Sbjct  40   FQLYRRNFLAVAVVTLFVFCPIEFMESYCEYFVFEADDIGAIFRLDCVLEGLFGIIAYGS  99

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
            +       +    +    +++ G+          IL    V  G L+L+IPG+   V F 
Sbjct  100  VIAIGLSDLRGQPLSSLAALRQGIAAWPRLFWTGILSQFAVAIGLLILVIPGVFLAVRFS  159

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
                V  ++ + G+  L++S LL  G+    F  F+ L + SL    L   I  +    
Sbjct  160  LYNCVAVNEGLNGVGCLKRSMLLTKGN----FAGFLSLGIFSLVAMTLLGAIAVIPLIV  214


>MBA3787561.1 hypothetical protein [Actinobacteria bacterium]
Length=231

 Score = 68.6 bits (164),  Expect = 3e-10, Method: Composition-based stats.
 Identities = 39/180 (22%), Positives = 69/180 (38%), Gaps = 16/180 (9%)

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
              +       ++             +++   R + S   + +L  L+   G LLL+IPG+
Sbjct  1    ATVATGACFKAIADGYLGERAEWRPALRFAARRLHSILWITVLGGLLSILGLLLLVIPGV  60

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI-  285
               + F     VL  + + G +AL +SR LV G WW  FG   L  ++   +S   A + 
Sbjct  61   YLYIAFSVAVPVLLTEGLRGRRALGRSRRLVKGRWWGAFGVVALGTILVGIVSGALAGLA  120

Query  286  ----------PYVGEAA-----NLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
                      P +G         +  SL+ TP +  +  ++Y DL+          +  Q
Sbjct  121  GAFTTFDTSNPTLGSFLVNTGATVLASLVATPLTAAFVTVLYFDLRVRKEAFDLQLLAEQ  180


>KKW00375.1 hypothetical protein UY34_C0030G0007 [Parcubacteria group bacterium 
GW2011_GWA2_48_9]HCM68769.1 hypothetical protein [Candidatus 
Kerfeldbacteria bacterium]
Length=450

 Score = 71.3 bits (171),  Expect = 4e-10, Method: Composition-based stats.
 Identities = 52/375 (14%), Positives = 116/375 (31%), Gaps = 31/375 (8%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              I      L         L+     L     +  ++  L     ++L + + T S+   
Sbjct  67   WQIVKSRWKLLAGIALIQALIITGVQLLITATSASFSSFLLYTTLLVLMVFFCTLSLTHT  126

Query  182  ICK-TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
            + + T+  +       ++  G +    +L +L   GG +  +IPG++  +      +V+ 
Sbjct  127  VSRVTEGSVSAVAHATIKTYGFYIWTAVLGVLATLGGLVAFVIPGIILSIMLIPLPFVVV  186

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE----------  290
            ++ + G+ AL++   L     W  F + ++L +  L +  +   I +             
Sbjct  187  EEKVHGMAALKRCFALTRDFRWDTFLKILVLGLAFLAVFIVLFLIIFAMWFAVSASRGAA  246

Query  291  --------------AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTA  336
                                 LLL  FS  YY +IY DL A +     P    +      
Sbjct  247  LSLGGFLAGEIGFLVIQAILYLLLPAFSQAYYAVIYRDLSAIHPRENDPEPIIRQGKKIM  306

Query  337  AIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSS  396
              F    +   + +S+S   L++  +     +  +      +               +S+
Sbjct  307  LGFMIAGMVFAIPLSISVGFLASTGVYDEFLNYGKITQESVR------IEREYYNYLVSN  360

Query  397  ADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSA  456
             +  +     R         +G      D +  +   P    +L  +  P + +      
Sbjct  361  TEELITDEADRNDIVRSINIIGLQVSLQDYYLKNSVYPATLDELIPTFLPEMLVDPATGE  420

Query  457  RIEIDKVLDDDARDL  471
                    +    +L
Sbjct  421  SYGYALSENGKGWEL  435


>WP_191349191.1 hypothetical protein [Candidatus Neoanaerotignum galli]
Length=265

 Score = 69.4 bits (166),  Expect = 4e-10, Method: Composition-based stats.
 Identities = 32/202 (16%), Positives = 70/202 (35%), Gaps = 3/202 (1%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
             L  + L   +         ++    +          + I++  V    L +  +  S  
Sbjct  53   YLQNLMLQVDMDTVMQSTQNMIRFAQSSQWQAILAVFFGIMILNVILTPLLIMAVAASTA  112

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
              +    +  F ++K            +I+ IL VG G +  ++PGL   + FFF +Y +
Sbjct  113  SCLKGVPIRAFSAIKQSFARGFVVVPAVIVYILCVGIGLVFFVVPGLYLAIVFFFFEYAV  172

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE---AANLAF  296
              +  G   +L++S  LV G +W      V L ++    S++   +  +      A +  
Sbjct  173  ILEEKGVFGSLKRSMFLVKGFFWKTALAAVCLFLMRYAASYVLNVVASLLGGSYLAGVLV  232

Query  297  SLLLTPFSFLYYYLIYSDLKAN  318
             +        +  ++       
Sbjct  233  GVCAMAVESYFAVVMTLYYLNR  254


>MBI5092810.1 hypothetical protein [Candidatus Hydrogenedentes bacterium]
Length=268

 Score = 69.4 bits (166),  Expect = 4e-10, Method: Composition-based stats.
 Identities = 33/204 (16%), Positives = 67/204 (33%), Gaps = 1/204 (0%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI-LLGLSWMT  175
                 L I L+    +         +   T            + L        +  + + 
Sbjct  35   WFLPRLAISLISETYSVTHENIGFAVWSGTMYLSHPPLSVAILNLFVFILTDTICTAAVA  94

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
             S+       D+ +          +G      IL  L +  GS+L +IPG+   + +F  
Sbjct  95   WSVTQRFLGFDISVRDCFLAVSSRIGRIFGASILSGLGIVFGSILCLIPGVYLALAWFIL  154

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
              VL  +++   +A+ +SR L+ G         V+    +L +S L + IP +  +   +
Sbjct  155  YPVLMYEDLRATEAMARSRNLMRGQKLRALAVIVITTAAALAVSLLFSIIPGLYFSIVAS  214

Query  296  FSLLLTPFSFLYYYLIYSDLKANY  319
             ++      F         + A  
Sbjct  215  AAIATIVLVFNAAMASVMYVSARC  238


>RJR30374.1 hypothetical protein C4564_00115 [Candidatus Microgenomates bacterium]
Length=410

 Score = 71.0 bits (170),  Expect = 4e-10, Method: Composition-based stats.
 Identities = 55/374 (15%), Positives = 119/374 (32%), Gaps = 16/374 (4%)

Query  104  SQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLAT  163
             ++    ++L  +     + I+L  +      +   L         P +      + +A 
Sbjct  16   WRVYKKHFKLIIQVNILSVLIFLGALAFFVVLLLGLLGTSIVVKQLPLSTTVALIVPIAI  75

Query  164  VAYILL----GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
            +  +L+     L+     + +    + +   +  K     V ++  L+IL   +  GG  
Sbjct  76   IFLLLVQSLTALAIAFLVVDVSGQGSSLSAPKYFKKAKPLVLAYFPLIILSAFLTFGGYF  135

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
              + PG+LF +WF F  Y    +   G +AL  SR  + GH + +F R  L  + +  LS
Sbjct  136  FFLFPGILFSIWFSFSVYTF-IEGKRGFEALFTSRDCIKGHTFGVFWRVALFGLSTYLLS  194

Query  280  FLT------ARIPYVGEAANLAFS-LLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWL  332
             L         +  +G+ A    +  ++ P S L+ Y I+  LKA            + L
Sbjct  195  ALLKYFFDKLGLSVLGDIATAVINWAIIVPLSLLFSYQIFLSLKAMKPELVSALTLNRKL  254

Query  333  PLTAAIFGWMLIPGLLLVSLSR--QNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEE  390
                     +++   ++        ++    +    +   +R G    Q      +    
Sbjct  255  KYFTVSLIGIVVFTGIVALFVPRVNDIKNVFISPDYEGTYKRNGEISNQYNQAYDTKRRM  314

Query  391  PQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADD--QNPHLWLKLELSDFPNL  448
               + +       ++               T         D  ++       E+   P  
Sbjct  315  DISIITNAVYQYAAENNGVLPSDTEFPATPTCIGTAPECFDLAKDIFPTYISEMPMDPED  374

Query  449  SLAQKGSARIEIDK  462
               +     I +  
Sbjct  375  GSEENTGYTIYVKP  388


>KKU10417.1 hypothetical protein UX13_C0012G0001, partial [Candidatus Woesebacteria 
bacterium GW2011_GWB1_45_5]
Length=205

 Score = 67.9 bits (162),  Expect = 4e-10, Method: Composition-based stats.
 Identities = 33/128 (26%), Positives = 62/128 (48%), Gaps = 0/128 (0%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
              +      L   +    ++   +    +    + K   +++ +F +  +L+ L+V GG 
Sbjct  49   WSVVAFIAGLWAQAAGYEAVKRSVKGGALEFKDTFKSSRKYLLTFFITNLLVGLIVVGGF  108

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            +LLIIPG++F VWF F  + + D   G  ++L++S+ LV G +W + GR  + +V     
Sbjct  109  ILLIIPGIIFAVWFSFSLWGVVDKGYGVGRSLKESKALVKGRFWKVLGRIFVYVVFVTLF  168

Query  279  SFLTARIP  286
              L A  P
Sbjct  169  QVLFAAFP  176


>NIM49202.1 hypothetical protein [Gemmatimonadales bacterium]NIN10613.1 hypothetical 
protein [Gemmatimonadales bacterium]NIN49375.1 hypothetical 
protein [Gemmatimonadales bacterium]NIP06839.1 hypothetical 
protein [Gemmatimonadales bacterium]NIR01513.1 hypothetical 
protein [Gemmatimonadales bacterium]
Length=250

 Score = 69.0 bits (165),  Expect = 4e-10, Method: Composition-based stats.
 Identities = 25/221 (11%), Positives = 59/221 (27%), Gaps = 8/221 (4%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
                    +   ++                   Q+    +A  + +    L+        
Sbjct  22   YRTYFATLVSIAIVCEGVPAVMNTYVELGGGPLQHPVMWFAAFVLSGLGGLVAAGATIWV  81

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
            +       D  +  ++   L  +    +  +   ++V   SL  +IPG++    +     
Sbjct  82   ISEVYLGRDPLIGNALGYALGKIVQLFIAGLAKYILVFITSLFFLIPGIIVACGYAVVTQ  141

Query  238  VLADDNI-GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV-------G  289
             +  + +     AL +S  L  G         +++  + +                    
Sbjct  142  AVVLEKLPSATDALGRSWKLTKGFKGKALVLGIVVFALIMLPLMAAGAFAVFVPGLETTF  201

Query  290  EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
                    LLL P     + L Y DL+          + +Q
Sbjct  202  TVGGQLLQLLLYPIVACAFTLFYYDLRVRKEAFDLEHLSQQ  242


>PIE49413.1 hypothetical protein CSA39_02750 [Flavobacteriales bacterium]
Length=289

 Score = 69.4 bits (166),  Expect = 4e-10, Method: Composition-based stats.
 Identities = 33/155 (21%), Positives = 55/155 (35%), Gaps = 0/155 (0%)

Query  144  PATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSF  203
               +L        + +L   V Y LL ++           K DV         L+ +  F
Sbjct  71   SGGFLASFFGVAFFFVLAIIVLYSLLMITSFYYIKSYVDNKGDVSFQEVKSNVLKKIWKF  130

Query  204  TLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA  263
             +L  ++      G  L +IPG+   V F     ++   + G   AL  S  LV G WW 
Sbjct  131  LILSFIVGFTTVIGMYLCLIPGIYIGVVFSMAAPLMIFKDYGVGDALSNSFNLVKGVWWP  190

Query  264  IFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
             FG  + + ++   L    +    +     +  SL
Sbjct  191  TFGVLITVYLLIAILGQAFSLPALIYMFVKMGLSL  225


>MBI4252560.1 hypothetical protein [Candidatus Uhrbacteria bacterium]
Length=255

 Score = 69.0 bits (165),  Expect = 4e-10, Method: Composition-based stats.
 Identities = 41/196 (21%), Positives = 75/196 (38%), Gaps = 1/196 (1%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
             +   W LF R  W LL I ++  VLA  P     + +    L            +A + 
Sbjct  45   YMGAVWPLFKRHFWLLLAIVIIQQVLANLPSAITGVAQGLFSLGENEPISAILSFVAMIV  104

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
             + +  + + G M   +      L   +    R    +    IL  L+  GG  LL+ PG
Sbjct  105  IVTILQAGLVGIMLTVVSGGTPRLGD-LFSKTRVFWRYLGCSILYGLITVGGFFLLVFPG  163

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            +++ + F    Y + D N G ++AL+ S     G+  ++F  ++ L+ ++L         
Sbjct  164  VIWFLKFSLWSYFIVDKNAGVIEALKMSSQATKGYKPSLFVLYIYLMSLNLLGLSALFVG  223

Query  286  PYVGEAANLAFSLLLT  301
             +V     L     + 
Sbjct  224  FFVTAPMTLLILAWVY  239


>AHB41123.1 Integral membrane protein [candidate division SR1 bacterium RAAC1_SR1_1]
Length=272

 Score = 69.0 bits (165),  Expect = 5e-10, Method: Composition-based stats.
 Identities = 30/186 (16%), Positives = 67/186 (36%), Gaps = 11/186 (6%)

Query  127  LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD  186
               +L    +F   +L+       Q      AIL   +    L + +    + + + +  
Sbjct  90   WKYMLGVVLVFFLQILQQEISEPNQPMTLAIAILTILLGIAYLRVDFGLKGLSLSLVEDK  149

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGG  246
            +     + +       + +  I++++    G +L I+PG+   +      Y++   N+G 
Sbjct  150  IIKSLDIFVSAEKFVKYFVAYIIIVVFSLIGIILFIVPGVFVALRLNMVPYLILSKNLGP  209

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFL  306
             +A+++SR L  G    +F   V           L   I  +G  A +       P  ++
Sbjct  210  WEAIKQSRKLTKGKVSNLFALNV-----------LLGFINILGFVALIVGLFWTLPLFYI  258

Query  307  YYYLIY  312
               + Y
Sbjct  259  ANAVFY  264


>TLX97152.1 zinc ribbon domain-containing protein [Thaumarchaeota archaeon]
Length=163

 Score = 66.7 bits (159),  Expect = 5e-10, Method: Composition-based stats.
 Identities = 28/155 (18%), Positives = 56/155 (36%), Gaps = 4/155 (3%)

Query  206  LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
            +  ++ ++V  G + L++PG++  + F      L  +  G  ++L +SR LV   W   F
Sbjct  1    MSSVVGVIVVAGLIALVVPGIILAIMFSLAFPALLIEGTGVSKSLGRSRELVGHRWLKTF  60

Query  266  GRFVLLLVISLTLSFLTARIP----YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
               ++  +I+   S + + I     +     +   S L  P   +   + Y    A    
Sbjct  61   ALALVFGIITAIASAIVSAISGPFGWASNIVSSILSALYVPLIPIALTVYYYSNVARLAQ  120

Query  322  PQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQN  356
             Q   +         A   +    G  L S +   
Sbjct  121  LQVSQVPIAPAAAVQAGMKFCPSCGTQLASSATFC  155


>QNN23843.1 hypothetical protein HED60_16730 [Planctomycetales bacterium 
zrk34]
Length=287

 Score = 69.4 bits (166),  Expect = 5e-10, Method: Composition-based stats.
 Identities = 25/197 (13%), Positives = 60/197 (30%), Gaps = 23/197 (12%)

Query  455  SARIEIDKVLDDDARDLYDRQ--HSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGT-QA  511
            +  IE   V D +   L   +        +   +   Q       +   S+ L     Q 
Sbjct  74   AYIIEPKSVTDVEGHKLLPPRAGRWSTSSSHPPIQRLQMGPGRGGTQGVSLSLNNLKGQP  133

Query  512  EQVHSILGKLELTLPLAIESLQLTRNDIGK-TLQIGGKQLILQRLGSN----AVTLRFLG  566
              + SI G + L        +++      + T      ++ ++++        V   F G
Sbjct  134  AAIRSIRGVIHLLEVTGRREVEIPLESDDQWTDVADDVKVRIKKVQKQSQNYTVQYEFSG  193

Query  567  DRTD----------LLNVHASNSHAE--PLREIGFTWQKSGDAFSLRQMF---DGNIESI  611
                          ++     +S     PLR   + +    +  ++R            +
Sbjct  194  ADPKAMTELDSPPFVMMGLPLDSDGNTMPLRSSPYHYVSGSNLATVRFYMTDPAAKPAKL  253

Query  612  TVLVAGDSMTQSYPFEL  628
             +++A ++  +   F+L
Sbjct  254  QLVLATETTERELEFDL  270


>KKS32243.1 hypothetical protein UU93_C0008G0005 [Candidatus Amesbacteria 
bacterium GW2011_GWA2_42_12]
Length=243

 Score = 68.6 bits (164),  Expect = 5e-10, Method: Composition-based stats.
 Identities = 50/201 (25%), Positives = 86/201 (43%), Gaps = 7/201 (3%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            + G+   GI L         L   A               +     I+   +     +  
Sbjct  38   IFGLVQTGITLVSIIPLIVGLTLIAVKSAILGTIITVIGAIIGFFAIVALQAAGFYQIQS  97

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +  +   +   + LG +      L L+LL LV   G LLLIIPG++F VWF F   ++ 
Sbjct  98   IVNTSRKSIRELLSLGKKLALPLFLTLLLLSLVTILGYLLLIIPGIIFSVWFIFTIVIMI  157

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGR----FVLLLVISLTLSFLTARIP---YVGEAAN  293
            ++++ GL AL+ SR LV GH+W +        +L  + S  +S L + IP   ++G+  +
Sbjct  158  NEDVWGLAALKSSRNLVKGHFWKVASYLGFCLILTFIYSFAVSLLVSIIPGGKWIGQLLS  217

Query  294  LAFSLLLTPFSFLYYYLIYSD  314
               ++ +   S L+ Y +Y  
Sbjct  218  NILNIPVQTISLLFIYNLYHQ  238


>OGF21061.1 hypothetical protein A2257_01450 [Candidatus Falkowbacteria bacterium 
RIFOXYA2_FULL_38_12]OGF32480.1 hypothetical protein 
A2316_03040 [Candidatus Falkowbacteria bacterium RIFOXYB2_FULL_38_15]OGF42438.1 
hypothetical protein A2555_00645 [Candidatus 
Falkowbacteria bacterium RIFOXYD2_FULL_39_16]
Length=240

 Score = 68.6 bits (164),  Expect = 5e-10, Method: Composition-based stats.
 Identities = 36/181 (20%), Positives = 64/181 (35%), Gaps = 1/181 (1%)

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
             +       ++L  I   F P     ++  +             +    +   L     +
Sbjct  13   WKIYKDNFLLFLKIISWLFIPTVIWTIIAVSDLTKVAAVPIDICLAFIYIILSLFVSIAL  72

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
              +        +V L     L    +  F  + IL  L + GG LLLI+PG++F V F F
Sbjct  73   ILASDNLAKGKNVDLKELFNLTYSKLLPFLWVSILANLAIFGGILLLIVPGIIFAVLFSF  132

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS-LTLSFLTARIPYVGEAAN  293
                   D   G  AL  S+ LV  ++W +  R+V    +  + +S +   + Y+     
Sbjct  133  APMATLLDGEKGTLALSYSKKLVKDNFWGVLWRWVASYFMYGVIISVIALGLTYIIGIVT  192

Query  294  L  294
             
Sbjct  193  G  193


>MYH55686.1 hypothetical protein [Acidimicrobiia bacterium]
Length=193

 Score = 67.5 bits (161),  Expect = 5e-10, Method: Composition-based stats.
 Identities = 31/186 (17%), Positives = 63/186 (34%), Gaps = 10/186 (5%)

Query  156  QWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG  215
                L+  +    +  + +   +             +    +  + S T+L ++  + V 
Sbjct  1    MGPWLVIRLLITSILFAAVIRMVGEVFLGVKSSWSENTAASISRMASLTVLTVVFWVGVT  60

Query  216  GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
             GS L ++PGL   V +     VL  +  G + A  +S  L SG    +F   V L  + 
Sbjct  61   MGSALFVLPGLFLVVSWSASLGVLIIEGAGPMAAFRRSWELTSGRRLIVFVTLVPLTFMV  120

Query  276  LTLSFLTARIP----------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHP  325
            +  + L   +                A+    ++  P   +   L+Y DL+         
Sbjct  121  VVTNILILLVLGPLGNYLAGDLGVFLASELAWVVTQPLIGVLLGLLYIDLRVRRDQLNAN  180

Query  326  PIKRQW  331
             + R+ 
Sbjct  181  MLTREM  186


>HHY99918.1 hypothetical protein [Tissierellia bacterium]
Length=542

 Score = 71.0 bits (170),  Expect = 5e-10, Method: Composition-based stats.
 Identities = 47/386 (12%), Positives = 114/386 (30%), Gaps = 13/386 (3%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            F         +  + ++   + I +                    +L+ ++  I L LS 
Sbjct  65   FEILIKHWFILLFILVLTLLSSIINLPFTGYIYANPESAIRTLLLMLIISLPIISLVLSV  124

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
                +   +    +         +R + +  +  ++ +++   G +LLI+PG++F +   
Sbjct  125  FVRIILNMLDNRTIEFRN--LTSVRIIINVLIYSVMYLIITMVGYILLIVPGVIFSIRLL  182

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAAN  293
               Y++ D+N+   +A++KS  +  G+ W IF   VL+ + SL L            AA+
Sbjct  183  LGFYLIVDENLHAFEAMKKSWAITRGYSWKIFWYLVLICIFSLILELTLKHWFVYICAAS  242

Query  294  LAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLS  353
            +   +      FLY  ++                   +       F   ++  +++VS  
Sbjct  243  IFSVIYWFATVFLYRTILSQWQSNVGPIEDMRYNALNYNFPMLNNFIIAVMVIIIIVSSI  302

Query  354  RQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEG  413
                  +           +  +              + Q             +      G
Sbjct  303  IYAKIGKPSFEDMYQPDFQYSSINHNDYLSYLDTYSDTQIPLPLPPVSTDMNRISIPEIG  362

Query  414  GLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARD-LY  472
             + +       ++               +  +         + +  I +  D D    + 
Sbjct  363  NIDIPDSMEIQNQADRQS----------VEAYRKAVDLPPSNYKGVIFRQKDKDKYSCIL  412

Query  473  DRQHSFEHPAFHWVGINQTDENDLFS  498
               +  E   F  +  + T      S
Sbjct  413  VNTYPGEPGDFFKLSEHFTLSEKELS  438


>MPZ48671.1 hypothetical protein [Dehalococcoidia bacterium]
Length=243

 Score = 68.6 bits (164),  Expect = 5e-10, Method: Composition-based stats.
 Identities = 33/216 (15%), Positives = 68/216 (31%), Gaps = 8/216 (4%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
               + ++        I  AL+      +   +  +    L   +    L  +     +  
Sbjct  28   WTQLCVVVAPAVLVSILIALISVLVADVVWLSTVFLLISLPIDLMAYELVSAAGIALLIA  87

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS--LLLIIPGLLFCVWFFFCQYV  238
               + D+    ++        +  +       +       ++ I   +   V + F    
Sbjct  88   REQRQDISTGDALDAAQDRFRAVIVAAFKTTAISVLLCLTIIGIPWAIKRLVLWAFIIQA  147

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY------VGEAA  292
            +  D   G  +L  S  LV GHWW  FGR V   +++   + + ++I        +G   
Sbjct  148  IMLDRQTGEASLGYSAGLVKGHWWNTFGRLVACFLVAGIPALIVSQIVLEAVPGTLGLIL  207

Query  293  NLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIK  328
            +   S +  PF  +   L+Y DLK            
Sbjct  208  SSTASFISLPFGIIATTLLYFDLKVRGAANDDLSPA  243


>MBI4010427.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Candidatus Aenigmarchaeota archaeon]
Length=226

 Score = 68.3 bits (163),  Expect = 6e-10, Method: Composition-based stats.
 Identities = 42/206 (20%), Positives = 74/206 (36%), Gaps = 7/206 (3%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            + +  +     L + +L I       +                     I    +    + 
Sbjct  13   FTIVTKNKIVFLPLTILTIFSTIFTFYYYGYPSSMFNGAGVTPTLFEIIAAILLVVFSIF  72

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
            +   T  +     +  V L  S K+  +    F +  IL  L+V GG +LLIIPG++  +
Sbjct  73   VHLWTIHLTSSAVRRRVSLESSAKVASKSFFKFAIATILYGLIVFGGFILLIIPGIILSI  132

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI-----  285
               F  Y +  DN   + +L+KS  +  G WW  FG   L+ VI++ + F          
Sbjct  133  RLLFYPYAIVLDNSKIVDSLKKSWRVTKGRWWKTFGISFLIGVITMLVIFPIYFAVIFIY  192

Query  286  --PYVGEAANLAFSLLLTPFSFLYYY  309
                +        S+ L+ +S   Y 
Sbjct  193  RDLVLATVLLDILSIFLSAWSIATYT  218


>MBC8521355.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Methanomicrobia archaeon]
Length=172

 Score = 66.7 bits (159),  Expect = 6e-10, Method: Composition-based stats.
 Identities = 22/143 (15%), Positives = 56/143 (39%), Gaps = 5/143 (3%)

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLR--HVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
                 ++    + ++ +  +         + S  +  +L+ ++V  G L L+IPGL    
Sbjct  4    IGISMVYDLEKRGEIDISSAFNKCFSFDRLPSILVSSLLVEIIVLAGLLALVIPGLYLMC  63

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFV---LLLVISLTLSFLTARIPY  287
                 ++  A +  G +++++ S  +  G    IFG  +   L  ++ +  + +   I  
Sbjct  64   RLSLTEHATAIEEKGAIESIKMSWDITKGRAGEIFGILLESGLFSIVFVIPALILILIGI  123

Query  288  VGEAANLAFSLLLTPFSFLYYYL  310
            V        +++ T   F+ +  
Sbjct  124  VSGIEVAVITIIATGIGFIAFIF  146


>WP_119319608.1 hypothetical protein [Capsulimonas corticalis]GCE52186.1 hypothetical 
protein CCAX7_07000 [Capsulimonas corticalis]
Length=335

 Score = 69.8 bits (167),  Expect = 6e-10, Method: Composition-based stats.
 Identities = 33/231 (14%), Positives = 69/231 (30%), Gaps = 32/231 (14%)

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
               +    + L  +T +         V +  +    LR +    +  IL  +++  G++L
Sbjct  87   PLYLIVFAVELCVLTAATSARYLGEPVTMRGAYGSVLRRIIPLIVTSILYGVLISVGAVL  146

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS-----  275
             +IP +          +V   + +   +AL +SR LV      +FG   LL +I      
Sbjct  147  CVIPVIFPLTLLAMLGHVFTIEKLSYFKALGRSRALVGWDALRVFGSLFLLWLIGSILIL  206

Query  276  -------LTLSFLTARIP--------------------YVGEAANLAFSLLLTPFSFLYY  308
                     ++ +   +P                     V E  +    L++TPF     
Sbjct  207  AFEMAIRFLVTSIIQALPGAQAMTSGNSVVGGYTVTDHVVSEIGDGLGQLIVTPFIVCVL  266

Query  309  YLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSA  359
             ++Y DL+          + +        +              +      
Sbjct  267  TVLYYDLRVRREAFDIALLAKDLGYPPLNLRPNAPGHVPAPPPAAPLPQQR  317


>PIS20176.1 hypothetical protein COT53_01965 [Zetaproteobacteria bacterium 
CG08_land_8_20_14_0_20_55_17]PIW42702.1 hypothetical protein 
COW19_06665 [Zetaproteobacteria bacterium CG12_big_fil_rev_8_21_14_0_65_55_1124]PIY53716.1 
hypothetical protein COZ01_02790 
[Zetaproteobacteria bacterium CG_4_10_14_0_8_um_filter_55_43]PIZ38823.1 
hypothetical protein COY36_05025 [Zetaproteobacteria 
bacterium CG_4_10_14_0_2_um_filter_55_20]PJB81449.1 
hypothetical protein CO089_04515 [Zetaproteobacteria bacterium 
CG_4_9_14_0_8_um_filter_55_31]
Length=516

 Score = 71.0 bits (170),  Expect = 6e-10, Method: Composition-based stats.
 Identities = 43/448 (10%), Positives = 114/448 (25%), Gaps = 18/448 (4%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
            +   +          L G  ++  ++    +    ++                ++   + 
Sbjct  1    MWDATLFSRLVWSIKLFGRTIMVQLVVGLVVLVLGIICGILAPVAPALIVFVMVVGYLLF  60

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV--------GGG  217
               +    M              L  +++ GL  +      ++L++L+           G
Sbjct  61   VSWMIQVLMAAFAHCMSTDESPPLKTTLRGGLGRLPVMGNAMLLVMLIFAPLGLAVSLLG  120

Query  218  SLLLIIPGLLFCVWFFFCQYVLAD------DNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
             +  ++  +   V ++F   +         +++G  QA+++S  L SG    + G  + L
Sbjct  121  QIHPLVSFIGMLVIWWFSLRLYTLAGVVAMEDVGPWQAIKRSWQLSSGFVLRMLGNALFL  180

Query  272  LVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
             +I + L  +   I ++         +     +     +      A+            W
Sbjct  181  ALIFVVLMLILVGISWL----TGLSGVTQQMSAMANSAVAAGGPAADPMSMLFGMGSAAW  236

Query  332  LPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEP  391
            LP    I G +L+  L+ V +   +           D                  L    
Sbjct  237  LPFGILIVGGLLLEVLISVLMMAYSFFFYVEQKMAHDGGTPKWQPAGMADRKQWVLYLAL  296

Query  392  QRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLA  451
              L +    L ++    T              +             +          +  
Sbjct  297  VGLLALAPSLAVALAPPTVENEPARAISQAPQSTSPKTTPAVKLAPVVRSEVAKREGAKP  356

Query  452  QKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQA  511
                  +  +  +      + + +   E  A                    + +      
Sbjct  357  ADKVVAVAKNTAVTAAQSGVSEDEQIPESNAHLVKAPELDIAKMRDPFESYLTVLDQQSK  416

Query  512  EQVHSILGKLELTLPLAIESLQLTRNDI  539
            +++    G      P  +E+  L    +
Sbjct  417  QRMEQHRGSGADHDPEPLEAFDLGALRL  444


>NCB78264.1 hypothetical protein [Negativicutes bacterium]
Length=242

 Score = 67.9 bits (162),  Expect = 8e-10, Method: Composition-based stats.
 Identities = 37/198 (19%), Positives = 72/198 (36%), Gaps = 4/198 (2%)

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
            W LL  +   IVL         +    T      ++  + ++LA+    +L    +   +
Sbjct  22   WKLLREHFSPIVLLAGLGALPGIYLVHTMPETALESPPFLLILASALISMLSYMSIIILI  81

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
               +    + L    +     +       +L +L + G  LLL IPG++  V + F    
Sbjct  82   DDVVADRKISLVHIFERASARLPLAITTGLLCLLRLFGWFLLLFIPGIIKSVRYSFSLQA  141

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA----RIPYVGEAANL  294
            +        +A+  S  LV GHWW++    + + +    LS   A        V      
Sbjct  142  VVLREKAHGEAIHYSINLVRGHWWSVVIAGICISLPQFILSLPFAASHPASGSVTSLLLA  201

Query  295  AFSLLLTPFSFLYYYLIY  312
               +L T F ++   +++
Sbjct  202  VVQILTTAFWYVGETVLF  219


>PSQ32180.1 hypothetical protein BRD09_04410 [Halobacteriales archaeon SW_10_68_16]
Length=252

 Score = 67.9 bits (162),  Expect = 1e-09, Method: Composition-based stats.
 Identities = 30/169 (18%), Positives = 59/169 (35%), Gaps = 7/169 (4%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVG--SFTLLLILLILVVGGGSL  219
             ++   +L       ++  ++      +              +F +  I   +VVG G +
Sbjct  78   LSLLVTILSALVAIAAIRTFVAGETESIPEEYFTRSIVWVAVNFVIGGIAFAIVVGIGLV  137

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
             L++PGL   V  FF Q  +A ++   L+    S  L  G  + +F   V+++ ++L +S
Sbjct  138  FLVVPGLFLLVSLFFWQVFIAVEDENFLEGFRHSWQLTKGRRFGLFVLGVVVIFVALVIS  197

Query  280  FLTA-----RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                         +        S L+  F        Y+ L A      
Sbjct  198  IAFGIPGVFFPAILALLVEQVGSALVFVFVLAATAEAYNQLTAADHEGD  246


>MBI3790869.1 hypothetical protein [Gemmatimonadetes bacterium]
Length=258

 Score = 67.9 bits (162),  Expect = 1e-09, Method: Composition-based stats.
 Identities = 27/191 (14%), Positives = 62/191 (32%), Gaps = 8/191 (4%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
             +     L  + +T             L ++ +  +    +  +  ++  ++V  G++ L
Sbjct  68   ISTVSYGLMSAVVTQLGSAAYLGEQPDLAQAFRQAMPKSVTLIVAGLIRSVLVAMGAVFL  127

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            ++PGL     +F    V+  +  G   A  +S  L +G    IF    L  ++   +   
Sbjct  128  LVPGLYLFARWFAIIPVITIEGRGLGAAFTRSSALSAGRKRHIFNTLGLAWLLFWIVGMG  187

Query  282  TARIPYVGEAANLA--------FSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLP  333
               +  +    +           ++LL P   L   ++Y D +    G     + +    
Sbjct  188  LGIVFILLSFGSPVLATLLTTLVTILLYPVIGLTETVLYYDARIRGEGYDLERMAQALAV  247

Query  334  LTAAIFGWMLI  344
              A        
Sbjct  248  PEATPGTPRPA  258


>MBO10016.1 hypothetical protein [Planctomycetaceae bacterium]
Length=361

 Score = 69.0 bits (165),  Expect = 1e-09, Method: Composition-based stats.
 Identities = 51/333 (15%), Positives = 92/333 (28%), Gaps = 37/333 (11%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
                C HCG    T    +       +CP C  ++    A +            P     
Sbjct  54   FEFNCVHCGYYLKTDDRDVAEA---TQCPVCGNSVDPPAAGNNEPPGIGVGNELPGQDDA  110

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                 + +      ++          +        G  +                     
Sbjct  111  SGPSPESISPGRLAIDDVFSTAWSMFKDHLGLAMGGVWIHG-------------------  151

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
                +LGI+        +LL +                       +   ++       + 
Sbjct  152  ---IVLGIISTPTSFAQSLLQENPPEELAGVLVAVVIGGNILSLLLAAYMNGGLMLFLLK  208

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            + + +      +  G R+ G   L  I   ++VG G L  +IPG++F + +   Q+VL D
Sbjct  209  LARGERAEIADIFRGGRYFGRMLLCTICFGIMVGLGCLACVIPGVIFALMYGVYQWVLVD  268

Query  242  DNIGGL-QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
             +  GL  AL +S+ +  GH   +             + F    I  VG  A     +  
Sbjct  269  RDPSGLTDALSQSKQVTDGHKLQLL-----------VIGFAVGCINIVGVLACCVGYIFT  317

Query  301  TPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLP  333
             PFS L   + Y  +                 P
Sbjct  318  VPFSNLIMAVAYLRMTGQPTVMDVGWTSANDQP  350


>ELY56510.1 hypothetical protein C491_13262 [Natronococcus amylolyticus DSM 
10524]
Length=193

 Score = 66.3 bits (158),  Expect = 1e-09, Method: Composition-based stats.
 Identities = 28/164 (17%), Positives = 59/164 (36%), Gaps = 5/164 (3%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
             +   +  +             + +      +   R   S  +  I++ + +  G  LLI
Sbjct  23   YLVMAVYFVVVARAFARPAHKLSSIPSELYSRRIGRATLSMIVAGIIVFVSIMIGFFLLI  82

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL-  281
            IPG+   V F F  +V+  ++ G + +L +S  L  G+   +    V L  I  ++  + 
Sbjct  83   IPGIFLSVCFLFFLFVIGVEDRGVIASLRESWDLSRGNRLKLAVVVVFLGAIGASIGMIG  142

Query  282  ----TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
                   +P  G+  ++  + +L  F +      Y  L      
Sbjct  143  AIFELIGVPVAGDLLSILVNAVLFIFIYGLLADAYLQLSGESER  186


>WP_199556060.1 hypothetical protein [Sandaracinobacter sp. SZY PN-1]
Length=248

 Score = 67.5 bits (161),  Expect = 1e-09, Method: Composition-based stats.
 Identities = 38/229 (17%), Positives = 78/229 (34%), Gaps = 8/229 (3%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            + L  +     L +  +          +      AT     +    + + L       L 
Sbjct  19   FLLLRQNAVPFLILGFVFYGFPGLAAAAVRGFPTATQQLQASFGAAYFVPLLLTLVGSLV  78

Query  171  LS-WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
                M   +          L   +  GL+   +   +L+L +L    G L L++PG++  
Sbjct  79   ALPAMLRFLGEKRAGETPELGGMLLEGLKLAPATFAVLLLHLLASMVGWLFLLVPGIIIY  138

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG  289
            + F     VL ++  G   +L++SR L  G  W IF   ++  ++ + +S       ++ 
Sbjct  139  IMFCVSLPVLTEERAGVTGSLKRSRELTKGSRWRIFLLLLIGWLVLMVISAPIYAAVFLS  198

Query  290  -------EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
                      + A + +  P S      +Y +LK    G     +   +
Sbjct  199  PDSGVLNALISAAANTIFLPLSATMLASLYFELKDVREGAGTETLAEIF  247


>NIP23869.1 hypothetical protein [Phycisphaerae bacterium]NIX27684.1 hypothetical 
protein [Phycisphaerae bacterium]
Length=197

 Score = 66.3 bits (158),  Expect = 1e-09, Method: Composition-based stats.
 Identities = 39/133 (29%), Positives = 61/133 (46%), Gaps = 1/133 (1%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
             V   L     +T  +     +  V + +  +    ++     + IL  L V GG+LLLI
Sbjct  11   AVIIGLWVYVTLTRVIAKLYKEEKVNVKQVYQKAWDNIIPLLWVSILTGLAVFGGTLLLI  70

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF-L  281
            IPG+LF VWF F   +   +   G  AL +S+ LV+G WW++  RF+   V+   L + +
Sbjct  71   IPGILFAVWFAFSSPINVLEGTRGSAALSESKQLVAGKWWSVLWRFLATYVVYGILFYTV  130

Query  282  TARIPYVGEAANL  294
            T  I      A  
Sbjct  131  TILIMVAIGLATG  143


>KKR21395.1 hypothetical protein UT48_C0009G0027, partial [Parcubacteria 
group bacterium GW2011_GWE2_39_37]
Length=133

 Score = 64.8 bits (154),  Expect = 1e-09, Method: Composition-based stats.
 Identities = 36/132 (27%), Positives = 68/132 (52%), Gaps = 6/132 (5%)

Query  198  RHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLV  257
             +V +F    +L+ ++   G +LL +PG+++ V + F  YV+  + +   QA+++S+ LV
Sbjct  1    PYVLNFITTSLLVAVICLLGFVLLAVPGIIWTVVYAFASYVVVFEGLKNWQAMKRSKELV  60

Query  258  SGHWWAIFGRFVLLLVISLTLSFLTARIP------YVGEAANLAFSLLLTPFSFLYYYLI  311
             G WW++  R +++L IS+ +S  +A +P       V +  +   S  + P    Y YLI
Sbjct  61   KGFWWSVALRSLVILGISIVISIPSAILPDKSGSQTVYDIVDSIISFFIAPIFITYSYLI  120

Query  312  YSDLKANYRGPQ  323
            Y +L        
Sbjct  121  YKELTKIKEIKH  132


>MAQ77366.1 hypothetical protein [Candidatus Campbellbacteria bacterium]
Length=259

 Score = 67.5 bits (161),  Expect = 1e-09, Method: Composition-based stats.
 Identities = 29/154 (19%), Positives = 52/154 (34%), Gaps = 4/154 (3%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSM-KLGLRHVGSFTLLLILLILVVGGGSLL  220
                       +    +   +   +            +  G   L  +L+ + +  G LL
Sbjct  61   LIQLVSFFVTLYSMKYLLNLVDGKETSFQGVWESFTWKQFGYGLLAYVLMSIAILAGYLL  120

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL---T  277
            LI+PG++    F      +AD+    L+AL++SR L  G+   IF   + + +       
Sbjct  121  LIVPGIILTYMFAMVVPTIADNTTKPLEALKESRRLTQGYKMKIFLTTLSVAIHYYALPI  180

Query  278  LSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
             + L   I  +     LA  L      F  Y   
Sbjct  181  FAVLLGVIAGMMGIIWLAGILFAVAVGFGIYASF  214


>WP_051192030.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Microbacterium luticocti]
Length=558

 Score = 69.8 bits (167),  Expect = 1e-09, Method: Composition-based stats.
 Identities = 28/332 (8%), Positives = 61/332 (18%), Gaps = 40/332 (12%)

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
            + + L I +     L  I+ G             +  +     + + +S  L  G +W I
Sbjct  171  VAVPLAIALTVLAVLGGIVAGFWISTKLALVAPTIILEKATIREGIVRSWRLTRGRFWPI  230

Query  265  FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP----------------------  302
            FG  V+   I L    +   I +         S  + P                      
Sbjct  231  FGIVVV---IQLIFGTIAQVISFPFSLFGGILSAFIAPTGAPDTSAIISMLVTVGAAEVM  287

Query  303  ----------FSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSL  352
                           Y ++Y D +  + G     +         A               
Sbjct  288  VLLIQSIATVVQSTAYSILYIDCRMRHEGLDLDLLAYVERRDAGAHDL-----PDPYTQH  342

Query  353  SRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSE  412
              +  +          +  +                   +           +   +    
Sbjct  343  IGRAFARPVAAPGYPSVPGQPYPPGAYPQAPGYPPQYGARMPQPYPSAQPPAPYGQAPQA  402

Query  413  GGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLY  472
                               Q P              +   +    +          +   
Sbjct  403  PYGQAPQAPYGQAPQAPYGQAPQAPYGQAAQAPYGQAPYGQAPQALYGQAPQAPSGQASQ  462

Query  473  DRQHSFEHPAFHWVGINQTDENDLFSGIRSIY  504
                    P     G  Q  +       ++  
Sbjct  463  APYAQAAQPQPPSAGPQQPTQAAYGHASQAPS  494


>NPA67212.1 hypothetical protein [Chlorobi bacterium]
Length=265

 Score = 67.5 bits (161),  Expect = 2e-09, Method: Composition-based stats.
 Identities = 39/243 (16%), Positives = 81/243 (33%), Gaps = 0/243 (0%)

Query  77   NCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPI  136
               +       Q        G+    I Q     +    R  +  L +  +   +    I
Sbjct  1    MDNKKEVICFKQKRDFGEILGAPFYFIIQEYKPFFSALFRYTYPYLFLLFVSFAMLSDDI  60

Query  137  FSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLG  196
            +     +P            +   L     I++ +++   +M++   K         +L 
Sbjct  61   YEMSAEQPRFSALSTVYFSFFLAALVLCFLIVVTVTYSYIAMYVKKGKDGFVAEEVGELY  120

Query  197  LRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLL  256
             ++V +  +   L+ + V  G LLL IPG+       F   +L  +N    +A   +  +
Sbjct  121  KKNVLNVFIAGFLVWISVLAGLLLLYIPGIYLSTALSFVFIILVYENKTIGEAFSGTFEI  180

Query  257  VSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLK  316
            + G+WW  F    +  +I+  +S++     Y+      A        SF+    +     
Sbjct  181  IKGNWWYTFALIFVFGLIAGLMSYVVLIPIYLVVLTTFAGGGEFGYISFVILSFLVFLYF  240

Query  317  ANY  319
            A Y
Sbjct  241  AIY  243


>OVE77039.1 hypothetical protein BVX98_03860 [bacterium F11]
Length=412

 Score = 69.0 bits (165),  Expect = 2e-09, Method: Composition-based stats.
 Identities = 52/383 (14%), Positives = 104/383 (27%), Gaps = 9/383 (2%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
             + +   L L      N    +    +    V         +     + +    VG+  +
Sbjct  1    MSLVVPFLQLIDIQVKNWFLFSLLGFVWALGVVAEYCADISLILFFSVAVLNQKVGVLEN  60

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
             K       S+     L  ++V  G L  IIPG+     F F   +   D        + 
Sbjct  61   FKNIRGVFWSYIYYSFLWGIIVVLGVLAGIIPGIYLGTIFAFVIVISVLDRGKDRSPFKL  120

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLT----ARIPYVGEAANLAFSLLLTPFSFLYY  308
            SRLLV G++W +F   +++L +++    L          + E       L+L P      
Sbjct  121  SRLLVKGNFWVVFFIHLIVLFLTVESPGLLDESDQFSKRMSEFGFGLLGLILVPIYPALM  180

Query  309  YLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGW-MLIPGLLLVSLSRQNLSAEQLLSAGK  367
              ++S L A   G       R    +    F   + +  L++V +     S +      K
Sbjct  181  VPLFSKLMAEKEGSGEIAEVRGHRWVGFKAFSGVVGLILLIVVPIVLAFWSFDTFSKDIK  240

Query  368  DIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRF  427
                            +    E                 RK T +   S   +       
Sbjct  241  KHPIFGKYIFAPFIKYSSPDDEVIFSNGINMEFEWHWNIRKHTEKEEFSGFSIRNEHITE  300

Query  428  WADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVG  487
            +       +  K +    P+ +             + D         +    +     + 
Sbjct  301  FTLG----MIDKAQFQVIPSTANMNYHKYVRRGLALADKAGNQYGLEEPIETNRHDQKIW  356

Query  488  INQTDENDLFSGIRSIYLRQGTQ  510
             ++   ++         L    +
Sbjct  357  KHEWFIDEKSYKRHEYSLEANEK  379


>MXZ07158.1 hypothetical protein [Acidimicrobiia bacterium]MYD04937.1 hypothetical 
protein [Acidimicrobiia bacterium]MYF25736.1 hypothetical 
protein [Acidimicrobiia bacterium]
Length=262

 Score = 67.1 bits (160),  Expect = 2e-09, Method: Composition-based stats.
 Identities = 31/187 (17%), Positives = 63/187 (34%), Gaps = 10/187 (5%)

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV  214
                 L+  +    +  + +   +             +    +  + S T+L ++  + V
Sbjct  69   VMGPWLVIRLLITSILFAAVIRMVGEVFLGVKSSWSENTAASISRMASLTVLTVVFWVGV  128

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
              GS L ++PGL   V +     VL  +  G + A  +S  L SG    +F   V L  +
Sbjct  129  TMGSALFVLPGLFLVVSWSASLGVLIIEGAGPMAAFRRSWELTSGRRLIVFVTLVPLTFM  188

Query  275  SLTLSFLTARIP----------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
             +  + L   +                A+    ++  P   +   L+Y DL+        
Sbjct  189  VVVTNILILLVLGPLGNYLAGDLGVFLASELAWVVTQPLIGVLLGLLYIDLRVRRDQLNA  248

Query  325  PPIKRQW  331
              + R+ 
Sbjct  249  NMLTREM  255


>MBD0371881.1 hypothetical protein [Pyrinomonadaceae bacterium]
Length=295

 Score = 67.9 bits (162),  Expect = 2e-09, Method: Composition-based stats.
 Identities = 31/169 (18%), Positives = 62/169 (37%), Gaps = 2/169 (1%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY--ILLGLSWMTGSMFIYI  182
              L   L F       + K  ++ +   +    A  L       +L+  S +   + +  
Sbjct  81   IWLITKLIFVIFAPFEIFKALSFDSKDTRWQVVAGGLFLALVCKMLVAPSLIYALVTVMR  140

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                 GL    + GL  +        +  L+   G + LIIPG++  + F     + A +
Sbjct  141  TGVAPGLNECYRWGLSKLWIMIACAFMSWLLQVLGLICLIIPGIILGLAFELVYPIAALE  200

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
            N   ++ L++S  L  G+ W I G  ++L ++   +      +  V   
Sbjct  201  NRTPVEVLKRSYELTKGYRWKILGAGIVLGILCYVILIPAGLVTGVLAM  249


>WP_175193162.1 hypothetical protein [Achromobacter deleyi]CAB3719008.1 hypothetical 
protein LMG3458_03786 [Achromobacter deleyi]CAB3861405.1 
hypothetical protein LMG3412_02281 [Achromobacter deleyi]CAB3876257.1 
hypothetical protein LMG3481_03021 [Achromobacter 
deleyi]CAB3883349.1 hypothetical protein LMG3482_03414 
[Achromobacter deleyi]
Length=565

 Score = 69.0 bits (165),  Expect = 2e-09, Method: Composition-based stats.
 Identities = 40/282 (14%), Positives = 78/282 (28%), Gaps = 18/282 (6%)

Query  332  LPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEP  391
            +         +L+   L ++L    + A  +++  +  +            L   L    
Sbjct  1    MTRLCPTLARLLLTASLGLALPAHAVPATPVVTEARQTRIAQVADAPTRQALTEMLFILH  60

Query  392  QRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLA  451
            Q+  S D       Q         ++                    L            A
Sbjct  61   QQWRSEDMLRQPPAQTLHAGSPFQAVNDYPDTGRMADVFAAGVVGALDERGDIGVVEPFA  120

Query  452  QKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQA  511
                      KV   D  D+         PA   V I+   +    +  +          
Sbjct  121  LPYPHTYTWRKVRFLDGGDIALDA-----PARSGVTISVGQQGQALTLSQ-----PAGAL  170

Query  512  EQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFL------  565
                S+ G+L + LP +      T  D+GK   +G     L+ +  + V +         
Sbjct  171  RTPESLDGELTIALPKSTPHADFTPADLGKPRTLGDFAFTLRSIDGHRVEIGVTQADGVS  230

Query  566  --GDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSLRQMFD  605
              G  ++ + + A ++  +PL      W  +    +  Q FD
Sbjct  231  PQGFNSNAVLIEALDAGGQPLFNHVRLWGSATPLDARVQAFD  272


>WP_071163930.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Actinomyces tangfeifanii]AOZ72464.1 hypothetical 
protein BK816_03445 [Actinomyces tangfeifanii]
Length=407

 Score = 68.6 bits (164),  Expect = 2e-09, Method: Composition-based stats.
 Identities = 31/179 (17%), Positives = 60/179 (34%), Gaps = 16/179 (9%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                 + L +  +   + I +    VG               +   I ++  V    LL 
Sbjct  223  ILYTLLQLVVEILVMIIPIGLLVGLVGYIIYTASDSDPSSFGSGAAISMVFGVLLLFLLA  282

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            IIP +   +       V+  +  G ++A+++S  L  G ++ IFG   +  +I   +  +
Sbjct  283  IIPLIFVTIRLSVGSVVIVLEKSGPVEAMKRSWQLTKGRFFPIFGYTFVFSLIIGIIVGV  342

Query  282  TARIPYV----------------GEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
             ++I  +                 +      S L  P    +Y LIY DL+        
Sbjct  343  LSQILTLPLMLGLASDPQLTFNFSQLIGTISSALTIPVMAAFYTLIYHDLRIRKEAFAQ  401


>MBC7807260.1 hypothetical protein [Akkermansiaceae bacterium]
Length=286

 Score = 67.1 bits (160),  Expect = 2e-09, Method: Composition-based stats.
 Identities = 33/247 (13%), Positives = 72/247 (29%), Gaps = 26/247 (11%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
                Y+L   +  A +     +                ++       +L       ++  
Sbjct  40   FAPAYILASTVGVAAVSDFNNIFEGPDELATFAISMAWLIPVLTGAYILHFGATALAVRD  99

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +      L    K   R +    L   L+I  +G       +  +L   ++ F  + + 
Sbjct  100  ILTGETATLVNVYKRAFRRIFPLLLAS-LVIGAIGFVVACTTVGPILVAAYYCFTVHGIL  158

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL--------TLSFLTARIP------  286
             +     +AL++S  +   ++    G   L+  I L         ++ L A +P      
Sbjct  159  LEERKLGEALKRSVDMTKSYFGKSLGLLCLMASIILALVVGIESLVALLFAIVPKESGTA  218

Query  287  -----------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLT  335
                        V   A    ++++ P   +   L+Y DL+    G        +W  + 
Sbjct  219  GTAFEMKTAEDVVSAVAGSVIAVVIAPLPAIATTLLYYDLRVRREGLDVESEAAEWGVVL  278

Query  336  AAIFGWM  342
            A      
Sbjct  279  APDPFGG  285


>MBI2084736.1 hypothetical protein [Candidatus Aenigmarchaeota archaeon]
Length=211

 Score = 65.9 bits (157),  Expect = 2e-09, Method: Composition-based stats.
 Identities = 40/209 (19%), Positives = 75/209 (36%), Gaps = 11/209 (5%)

Query  104  SQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLAT  163
               +  S+E+          + LL   L    +   LL      L+  +      I+LA+
Sbjct  8    WDTMVKSFEISMNNPELFWFLVLLMSGLILFLVGVLLLFFSVASLHVLSLFIGAVIMLAS  67

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
            +  ++ G+          + +    ++  MK G+      +L  +L    +  G LLLII
Sbjct  68   LVLLMGGIGGTILYAERLVKRKKASVWTIMKKGITESPRLSLAYVLEQGFIMLGLLLLII  127

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
            PGL+  V           +  G    +++S     G+ W I    ++  +I +  + +  
Sbjct  128  PGLIVAVRLSLVTPACILEKKGLG--IKRSWRATKGNSWQIALLLLVWGLIFMLFAIIPF  185

Query  284  RIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
             I           S LL P   +   L+Y
Sbjct  186  FI---------VASFLLLPVYLVNLTLVY  205


>RYF91076.1 hypothetical protein EON95_15685 [Caulobacteraceae bacterium]
Length=248

 Score = 66.7 bits (159),  Expect = 2e-09, Method: Composition-based stats.
 Identities = 37/232 (16%), Positives = 79/232 (34%), Gaps = 4/232 (2%)

Query  100  LRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAI  159
               + ++ + +W+         L I  L +V   A +   +  + +      ++N     
Sbjct  10   WFQMGRVFSRTWQSVREMPQLGLVIIGLFVVAPEALVAVLVSQQASESTQVLSENMFNIP  69

Query  160  LLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
                    ++G + +  +         V +  S+ +          + IL  L +G G +
Sbjct  70   ---LGLIAMVGQTALVHAALNRQQGKSVTVGSSLSVAGSLFLPMWGVSILTSLGIGLGLI  126

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL-TL  278
            LL++PGL     +         +  G   A+  S  L  G  W +F   +L  ++    +
Sbjct  127  LLVVPGLYLMTLWSVAVPARIMNGPGVSDAMSASAELTKGVRWQVFALILLAGIVLGSGV  186

Query  279  SFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
              +   + Y+     L  S +L P +     LI+S   A          ++ 
Sbjct  187  GAVYIGVEYLPGVGRLVGSAVLAPIALGLITLIFSYGSAALYHELKWGPEQG  238


>MAG61737.1 cysteine--tRNA ligase [Candidatus Pacearchaeota archaeon]
Length=643

 Score = 69.0 bits (165),  Expect = 2e-09, Method: Composition-based stats.
 Identities = 34/163 (21%), Positives = 58/163 (36%), Gaps = 0/163 (0%)

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
            L F+                        +L       L         +     K    L 
Sbjct  427  LWFSVQDPVFYEMLIAEDPENISVKFAMVLFLLSLITLALYFIFEAGLVKDSIKGKFKLN  486

Query  191  RSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
            +++  G ++   F    I++   +    L LIIPG++F V++    Y+  D     +Q+L
Sbjct  487  KTINSGKKYFWKFVWFSIVIFSFLILLFLALIIPGIIFAVYWTLAVYIYLDSKKTVVQSL  546

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAAN  293
              S  ++ G+WW IFG  VLL +I   +  +   I    E   
Sbjct  547  SASYHMIKGNWWRIFGYTVLLFLILGVVGMIINLIGLPIEMLF  589


>OHE89611.1 hypothetical protein A3G75_05400 [Verrucomicrobia bacterium RIFCSPLOWO2_12_FULL_64_8]
Length=698

 Score = 69.0 bits (165),  Expect = 3e-09, Method: Composition-based stats.
 Identities = 35/384 (9%), Positives = 89/384 (23%), Gaps = 29/384 (8%)

Query  143  KPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGS  202
                           ++L   +         +  ++           F  +        +
Sbjct  323  HWLLLAGVSFVAGLMSMLSCGILGGAAQGGMILLALRALRRPDRRVEFSDLFGAFNRWFA  382

Query  203  FTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHW-  261
              ++L++  L             ++  + + F  Y + +   G ++AL  S  LV     
Sbjct  383  LLVVLMVYSLGSLL---------IVPGILWMFAVYFVIERRRGPVEALCNSWRLVRNVGL  433

Query  262  WAIFGRFVLLLVISLTL-----SFLTARIPYVG-EAANLAFS---LLLTPFSFLYYYLIY  312
             + F  ++++ +I +        F    IP  G            + + P   L     +
Sbjct  434  GSCFFLWMVMFLIMIGPQALPYMFSLQEIPIFGDAVVGSVGGGLQMFVAPLGILMIASGW  493

Query  313  SDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQR  372
            + L  +    +      +   L  A    +   G + +  +   L     +     + + 
Sbjct  494  NQLMRSKAAEKAEGPVSRKSVLVVAAILLLGFCGFVTLIGTVITLQVMPEIRRSLILPRE  553

Query  373  LGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQ  432
             G                         +   +       +  +   P+ +        D 
Sbjct  554  AGAIAALRNVAVAQERFRADDADRNGERDYWTHDLAGLHDFNVDGNPIGMIDASLANADA  613

Query  433  NPHLWLKLELSDFPNL----------SLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPA  482
             P     L  +                    G   +             +      +  +
Sbjct  614  APSNVYDLTTARSREQRNGYWFRLIPDARPNGYMVVAYPDEYGTTGERTFILNEDGKIYS  673

Query  483  FHWVGINQTDENDLFSGIRSIYLR  506
                G   T        +R   L+
Sbjct  674  NDNGGRVVTAWPSESELVRDWKLQ  697


>GBD34248.1 hypothetical protein HRbin34_00577 [bacterium HR34]
Length=443

 Score = 68.6 bits (164),  Expect = 3e-09, Method: Composition-based stats.
 Identities = 50/336 (15%), Positives = 104/336 (31%), Gaps = 14/336 (4%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            + ++ I   +        +     +     N    I           LS++  +M     
Sbjct  81   VGIIFITFLYVLFLIIFNISVEIAIIYAIHNKNVRISECFTFAFKKVLSYLGFNMTQGFL  140

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
               + L   + L L  +  F L +++ I  +   +L   IP  +F +WF   +Y+   DN
Sbjct  141  IILIPLLLFIPLTLFFIQFFNLGIVVTIYSLAIFALFFFIPVFVFYIWFIIARYIFILDN  200

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA------------  291
             G   ++ KSR  + G+ W  F R V + ++ +    +   + + G              
Sbjct  201  NGIFTSISKSREYIRGYGWKTFWRLVPIFIMYIIPYLIMFGLMFFGNIDVSLYKNSLLTM  260

Query  292  --ANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLL  349
                  + + +  FS +Y YLIYSD +      +    K+  +    A+    +    ++
Sbjct  261  NLIFSLYGIFVMIFSLIYLYLIYSDFQKIKPELKISSTKKYKIGFIIAVIFIFIDIVFII  320

Query  350  VSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKT  409
              L        +     + I             L  +L +      + +   L       
Sbjct  321  SWLPSILYQKIKNYMIPQPIITNNQNTTLPNKMLPYNLNKVEDTKRAGELAQLQYPIISY  380

Query  410  TSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDF  445
              E G     +           +   +    E   +
Sbjct  381  RIEKGQIPDNLDELKQFLVEKKEVSLVDAIDEGIFY  416


>PWL88827.1 hypothetical protein DBY14_02330 [Escherichia coli]
Length=469

 Score = 68.6 bits (164),  Expect = 3e-09, Method: Composition-based stats.
 Identities = 42/339 (12%), Positives = 93/339 (27%), Gaps = 18/339 (5%)

Query  135  PIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMK  194
                      A  L+         + L     +L   S + G     I      +   +K
Sbjct  139  LPNYMFDNYAAPLLSSIALMIVSLLSLFASLVLLPNQSTLQGYYVSVIRGRKPVISDGIK  198

Query  195  LGLRH-----VGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQ  248
               ++           L I++ + +   S+L IIPG++  + + F + +L D+ ++    
Sbjct  199  YVYKNAFSGSYFKRLALEIIMNIAISFASMLFIIPGIILNLHWAFARQILNDNPDMDVFD  258

Query  249  ALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYY  308
            AL  S  +  GH   +F   +  +               +         +   P+ +   
Sbjct  259  ALRISGQMTKGHKGELFVLELSFIGWF-----------LLSAVTLGIGLIYSLPYYYTTM  307

Query  309  YLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKD  368
             L Y + K          +                       S +  N + +   ++   
Sbjct  308  ALYYQNFKMRSIQEGTVDMNEFMSNAERNRRAQNGYYNGNQYSQNFSNAAPQYNGNSQSQ  367

Query  369  IQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFW  428
               +     + +         EP   +    +   S+  +               AD   
Sbjct  368  HLGQQNNGAKPSFIPTEVPSSEPNSDTVTFTEAEYSEPTEPQYAEETKPIDAFETADSQT  427

Query  429  ADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDD  467
               + P    + + + F +   + K     + D    DD
Sbjct  428  DFAE-PVEAEETDKTGFESADNSDKDEYIPKDDNSDTDD  465


>OGI62560.1 hypothetical protein A2818_02395 [Candidatus Nomurabacteria bacterium 
RIFCSPHIGHO2_01_FULL_40_12]
Length=323

 Score = 67.5 bits (161),  Expect = 3e-09, Method: Composition-based stats.
 Identities = 49/277 (18%), Positives = 102/277 (37%), Gaps = 0/277 (0%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             I     + +       +L  P  + +  +         + + +  L + ++        
Sbjct  13   LIREAWKIYSAKFSILFVLTLPFLFWSIVDVYLDRTYDTSFIIFASLFVLFILVVFTQAA  72

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
               ++    S  L L+    +    IL  L++  G  L IIPGL++  WF F   ++AD+
Sbjct  73   LFGNLANNGSFSLALKTFPKYLWTDILTSLILTTGLFLFIIPGLIWIFWFAFSVPIIADE  132

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
             I G+++L KSR  + GH+  IFGRF++L+ I    + + + +           S+    
Sbjct  133  GISGMKSLLKSREYMRGHFLQIFGRFIVLIFIIAIPAIIFSILTKHYPLFLYVQSIYGAL  192

Query  303  FSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQL  362
               + Y  +Y+  +                    A +  +L+P +++V +   +LS   +
Sbjct  193  ILPISYAYLYNLYQHIKSNKPTLNPLGPGTGFKIAAWVGLLMPVVIIVLVILLSLSQVFI  252

Query  363  LSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADY  399
                     +         D++  L  +P        
Sbjct  253  PGLQHSKDIQNTNYTISPSDISNILDGQPLSDYVISE  289


>MBF0521684.1 hypothetical protein [Candidatus Omnitrophica bacterium]
Length=236

 Score = 66.3 bits (158),  Expect = 3e-09, Method: Composition-based stats.
 Identities = 33/211 (16%), Positives = 77/211 (36%), Gaps = 6/211 (3%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY  166
               S+ ++       + + L+        +    + K       ++      I++ +   
Sbjct  17   FKTSFNVYKEHFQSFVSLSLI--YALLTLLIRIAIKKILGEAEAESFRALGIIVMLSTLV  74

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
            +      M         K  + + +S         ++ L+ I +  ++  G L  ++PGL
Sbjct  75   LCRMGCVMIDMASKNYRKEPLDIRKSFLETKDRYVTYLLVYIGVFFMMAAGLLFFVLPGL  134

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
             F   FFF   ++  +     +A  +SR LV   +  +F  F+ ++++SL        + 
Sbjct  135  YFGAIFFFADTLVILEKKSFGEAFRRSRELVRKQFGMVFLFFLTIILVSLFPVICLQIVG  194

Query  287  Y----VGEAANLAFSLLLTPFSFLYYYLIYS  313
                 + ++ +  FS L  PF  +    +Y 
Sbjct  195  VQYINLIKSLSEVFSTLFIPFYMIAQVQLYH  225


>KKW30364.1 hypothetical protein UY74_C0043G0008 [Candidatus Kaiserbacteria 
bacterium GW2011_GWC2_52_8b]
Length=329

 Score = 67.5 bits (161),  Expect = 3e-09, Method: Composition-based stats.
 Identities = 31/222 (14%), Positives = 69/222 (31%), Gaps = 28/222 (13%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            +  F    +     + +G  + ++ +                      +       +   
Sbjct  10   FISFGWETFKKRPWFFIGATIIYSILAWCASFVSGFVGAFFGSGVAGLVSFVASFSLNTL  69

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
            +     ++FI           S     +   S+  + +L  +++ GG +L IIPG +   
Sbjct  70   IGIGWLALFIKAHDDTAFAALSAFWHPQKFWSYLGVTVLSGIIIVGGFILFIIPGFMALT  129

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS---------------  275
             F F  Y + D  +  ++AL+ S  +  G+   + G      +IS               
Sbjct  130  AFLFAPYFVVDKGMSPIEALKASARITKGNRLRVLGLVAATGLISLLGFVALYRRLSAAA  189

Query  276  -------------LTLSFLTARIPYVGEAANLAFSLLLTPFS  304
                         + L  + A +P +     +  S++L   S
Sbjct  190  DVHQTHQPLTGGEIVLVIVGAIVPLLLIVIGILASIVLATLS  231


>MBI1270537.1 hypothetical protein [bacterium]
Length=235

 Score = 65.9 bits (157),  Expect = 3e-09, Method: Composition-based stats.
 Identities = 37/213 (17%), Positives = 78/213 (37%), Gaps = 12/213 (6%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY  166
            ++  ++    R W  LG+  L + +   P  ++  LK               +   +   
Sbjct  22   MSRGFDDLKSRFWKYLGLLALAVFVPMMPALASFALKWLGESTTLTIVIGSVLSFTSSIL  81

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
              L    +       +   ++     +   +  V +F    ILL +++  G +  I+PG+
Sbjct  82   SFLMTIGLLRIQIRIVRGEEI-HSDDLWRSVGRVWAFMGASILLGIMLAFGFICFIVPGI  140

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            +  + F F  Y + +  +G + AL+ S  +  G  W +F   ++L+V+            
Sbjct  141  ILYLTFQFYPYFIIEHKLGPIAALKASAAITKGVMWELFFLHLVLMVVGS----------  190

Query  287  YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
             +G    L  ++    F  L     Y+DL    
Sbjct  191  -MGWLLLLIGAIPAEMFCRLTITHAYADLLKRC  222


>MBI5170268.1 hypothetical protein [Candidatus Eisenbacteria bacterium]
Length=250

 Score = 66.3 bits (158),  Expect = 3e-09, Method: Composition-based stats.
 Identities = 34/184 (18%), Positives = 67/184 (36%), Gaps = 4/184 (2%)

Query  152  NQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLI  211
            +        L   A  +L +     ++       ++ + + ++ G R   S   +++L  
Sbjct  66   SMAVVIGAYLVFGALYVLLIGTSALTLVAAARGENLSIAQGLREGARRFLSLFAVVLLYA  125

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            L V  G +LLI+PG++  + F         + IG L A+ +   L   H   IFG  +LL
Sbjct  126  LTVLVGLVLLIVPGVIAAIRFSVAVPACVVEGIGPLAAMRRRSELTKDHDGTIFGARLLL  185

Query  272  LVISLTLSFLTAR----IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPI  327
             ++ L  + +       +P       L           L   + Y  L++   G     +
Sbjct  186  GLLLLPANLVFRAGTEKLPLAVSFLVLLVYFGQILVEMLVPAVTYFRLRSAREGFGTEDV  245

Query  328  KRQW  331
               +
Sbjct  246  METF  249


>HBP55355.1 hypothetical protein [Verrucomicrobiales bacterium]
Length=205

 Score = 65.2 bits (155),  Expect = 3e-09, Method: Composition-based stats.
 Identities = 28/173 (16%), Positives = 62/173 (36%), Gaps = 12/173 (7%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
                        T ++       +V + +++  G   +G+   + + + L +  G LL I
Sbjct  4    GFVIFPWAQGASTYAISECYLNREVSIGQAIGFGWSRLGTLFNVSVSVGLRLIIGLLLFI  63

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS---  279
            IPG+++   +      +  +       + +SR L  G  W++F   + + ++SL ++   
Sbjct  64   IPGIVWACSYVAAMPAVIIEGCKAKDGMRRSRELAKGRRWSVFAILITIWLLSLVVTASI  123

Query  280  -----FLTARIPYVGEAANLAFS----LLLTPFSFLYYYLIYSDLKANYRGPQ  323
                         +G       S    + L P   +   L+Y D +    G  
Sbjct  124  YFAVDVTLGMNTTIGTMTLNIASQGVTIFLMPLGVIAATLLYYDFRIRKEGFD  176


>KKR48633.1 hypothetical protein UT86_C0004G0119 [Candidatus Magasanikbacteria 
bacterium GW2011_GWC2_40_17]KKS57177.1 hypothetical protein 
UV20_C0003G0119 [Candidatus Magasanikbacteria bacterium 
GW2011_GWA2_42_32]OGH85303.1 hypothetical protein A2294_00840 
[Candidatus Magasanikbacteria bacterium RIFOXYB2_FULL_38_10]
Length=338

 Score = 67.5 bits (161),  Expect = 3e-09, Method: Composition-based stats.
 Identities = 42/195 (22%), Positives = 75/195 (38%), Gaps = 0/195 (0%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
                 L  +  L I      I +             +      +   +    L     + 
Sbjct  103  NCKTFLKILGWLLIPAVLLIILNIFDTLTNLKYVNYSFPIYLLLSALSFIIGLWTQIVLI  162

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
                  + K  +          R    F  + IL+ L++  G++LL+IPGL+F +W+ F 
Sbjct  163  RLTSASLTKEPLNEKILYTESWRDTAPFLWISILMGLIIMAGTILLVIPGLIFTIWYLFS  222

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
             Y+ A +   G  AL++S+ LV G +WAI  R V+       + F+   IP +       
Sbjct  223  LYIFAVEGKRGYSALQRSKELVQGKFWAIVWRLVVTGFFYGLIIFVIIAIPTLIIGLITQ  282

Query  296  FSLLLTPFSFLYYYL  310
            F+   + FS + ++L
Sbjct  283  FNQFSSVFSTMPWWL  297


>NOZ68443.1 DUF975 family protein [Deferribacteres bacterium]
Length=281

 Score = 66.7 bits (159),  Expect = 4e-09, Method: Composition-based stats.
 Identities = 38/251 (15%), Positives = 76/251 (30%), Gaps = 11/251 (4%)

Query  128  GIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDV  187
              ++          L        +        L     ++   +      + +  C    
Sbjct  25   FFIVLLLVASLIQNLPGIIGRFAEKFPLISLTLFLAGWFLGFVVQMGLIKVSLKFCDGTK  84

Query  188  GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGL  247
            G    +      +  F     L  L++  G +LL++PG+++ V F    Y + D N+G +
Sbjct  85   GKLDDLLSSFDLLFKFIGGTFLYGLIIMAGFVLLVVPGIIWAVKFSLTPYFIVDRNLGPI  144

Query  248  QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLY  307
            +AL+ S     G  W +F   +LL             I   G  A +    +  P + L 
Sbjct  145  EALKASSKATKGAKWDLFLFGLLLG-----------LINLAGALAFVVGLFVTMPVTMLA  193

Query  308  YYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGK  367
            Y   Y  L    + P   P           +            +   +         +G+
Sbjct  194  YAHAYRTLAGGEKSPAESPETGDGEGPEELLHASPEAAQGEAPAEPAEPSEETYEAVSGE  253

Query  368  DIQQRLGTQPQ  378
              ++    +P+
Sbjct  254  SPEKVPEKRPE  264


>WP_133516618.1 hypothetical protein [Methanimicrococcus blatticola]TDQ71032.1 
hypothetical protein C7391_0131 [Methanimicrococcus blatticola]
Length=333

 Score = 67.5 bits (161),  Expect = 4e-09, Method: Composition-based stats.
 Identities = 31/260 (12%), Positives = 72/260 (28%), Gaps = 8/260 (3%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
                  L I  +G+ +                               +          + 
Sbjct  74   FNFARFLAITYIGLFIFVIFAVFIEAGLTGMSKEATLTGETSLRDFFSYGAKYFLKLLLL  133

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
              +   I    +G+     + L         + L IL+     L +I+      +  +F 
Sbjct  134  TIVIGIIVGVPLGIVFGFLIILIIALIAAGSIALAILIGFISVLAVIVIVFALSLILYFT  193

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF-------LTARIPYV  288
             Y L  +N G + +++KS  L   +   +F   ++L  IS  +SF       +   IP V
Sbjct  194  TYALILENCGVIGSIKKSYNLFMENKGEVFLFALVLFAISFGVSFVMNIVVTILGFIPLV  253

Query  289  GEAANLAFSLLLTPFSFLYYYLIYSDLKANY-RGPQHPPIKRQWLPLTAAIFGWMLIPGL  347
            G       ++++         +    +  +            ++   +    G +   G 
Sbjct  254  GFIVGTLINVIVVSVLTGLQTVWCVRMYYSLVEEKTDEEPAEEYSEYSRIEQGILDADGN  313

Query  348  LLVSLSRQNLSAEQLLSAGK  367
            ++     +  +         
Sbjct  314  VVSEEVYEETTTSWEDEKRD  333


>WP_113962173.1 hypothetical protein [Roseimicrobium gellanilyticum]RBP35919.1 
hypothetical protein DES53_11985 [Roseimicrobium gellanilyticum]
Length=295

 Score = 66.7 bits (159),  Expect = 4e-09, Method: Composition-based stats.
 Identities = 33/208 (16%), Positives = 71/208 (34%), Gaps = 9/208 (4%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            +       L  +L+ + +A         +              +  +     +I +G + 
Sbjct  78   WDIFRQHWLLFFLINVTVALPVSILEGYVSTQVDPVEGGMRLLFFSISVNGIFISVGHAA  137

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
            +  +M             +  + +  +G+  L  I ++++V  G  L ++PG L  VW  
Sbjct  138  IFAAMSRIWMGKVPTYGNAWFVTMARLGNVVLASIFVLILVMAGFFLCLVPGFLASVWLA  197

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL----TARIPYVG  289
            F   ++ D+      A+E+S  L  G +W +F  F+ + +    L F+       +P + 
Sbjct  198  FTICIVMDEGRSAWPAVERSADLARGRFWLLFLYFIAVTLPLTVLVFIYQVVVQFVPTLS  257

Query  290  -----EAANLAFSLLLTPFSFLYYYLIY  312
                        S  L       + L  
Sbjct  258  HWLLDAVILTILSTPLLLVQVFTFVLYR  285


>OGG86161.1 hypothetical protein A2392_01720 [Candidatus Kaiserbacteria bacterium 
RIFOXYB1_FULL_46_14]
Length=219

 Score = 65.6 bits (156),  Expect = 4e-09, Method: Composition-based stats.
 Identities = 36/165 (22%), Positives = 73/165 (44%), Gaps = 7/165 (4%)

Query  157  WAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGG  216
            W +  A    + +       ++ +      + + ++          +    I+L L++  
Sbjct  46   WTLFGALAITLAVFNILSGIALIVAANDQTLSVRKAYGQASGFFWRYVGFTIVLSLILFV  105

Query  217  GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
              +L IIP ++  VW  F  +VL  +N   ++++++SR  V G WWA+FGR V ++ +++
Sbjct  106  SFILFIIPAIIVSVWVAFAAFVLVLENARIMESMKRSREYVRGRWWAVFGRSVFIMFVAV  165

Query  277  TLSFLTARIPYVGE-------AANLAFSLLLTPFSFLYYYLIYSD  314
             +  +   +  +                 L+TPF  LY YL+Y D
Sbjct  166  VVMAIVTSLGSLISDQKAVTDGLVSLVIALITPFLLLYVYLMYKD  210


>MBI5585589.1 DUF975 family protein [Deltaproteobacteria bacterium]
Length=261

 Score = 66.3 bits (158),  Expect = 4e-09, Method: Composition-based stats.
 Identities = 31/180 (17%), Positives = 66/180 (37%), Gaps = 12/180 (7%)

Query  136  IFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL  195
               A++                 I       +   L       F+ + +     +  +  
Sbjct  20   WGLAVVGNLIFLAATSVTQILPLIGWVANLIVGGPLVLGVTLFFLGLSRGQDVQYSRLFD  79

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSR  254
            G +      +  +LL+L++   SLLL++PG++  + +    +++AD   + G+ AL++SR
Sbjct  80   GFQRFVDALITYLLLVLLIFLWSLLLVVPGIMAALSYALTFFLMADRPELKGMDALKESR  139

Query  255  LLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
             L+ G+ W +F  F+  +               +G        L + P+        Y D
Sbjct  140  RLMKGNRWKLFCLFLRFIGWF-----------LLGALTLGIGYLWVMPYLQTTAARFYDD  188


>NLK36502.1 hypothetical protein [Epulopiscium sp.]
Length=269

 Score = 66.3 bits (158),  Expect = 4e-09, Method: Composition-based stats.
 Identities = 45/255 (18%), Positives = 87/255 (34%), Gaps = 2/255 (1%)

Query  79   RRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFS  138
            +  NR   +               I  L   ++ +        +    +   L       
Sbjct  4    QMRNREMTIGEMFSLSFQLFAKNLIHILWITTFLIMPFELLRSIIAPNVYYNLNILEQDL  63

Query  139  ALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLR  198
            A+ L  A         +  A+ L  + +  +G+  +       +   +  L +++   + 
Sbjct  64   AISLAKALPTLKSISIYLLAVTLIDIFFTPIGIVSVAKLAQNDMIGKEATLKQAVLQTIE  123

Query  199  HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVS  258
               +  L   L  + +   S++ I P + F V + F  Y +   +  G++AL  SR LV+
Sbjct  124  KAPAIILGATLYHVSLLLWSMVFIFPAIYFAVLWIFYLYSIGLSDKKGMEALPYSRSLVA  183

Query  259  GHWWAIFGRFVLLLVISLTLSFLTARIPYV--GEAANLAFSLLLTPFSFLYYYLIYSDLK  316
            G WW  FGR  LL++      +    I             SL  T  S  Y Y+  +   
Sbjct  184  GKWWRTFGRAALLIIAVSLAQYGIEAIFLTEDESLILSIISLFFTSVSQCYTYIFVAIWY  243

Query  317  ANYRGPQHPPIKRQW  331
             N    ++P    ++
Sbjct  244  TNRDCMKNPEQYEKY  258


>OHB25146.1 hypothetical protein A2X84_09095 [Desulfuromonadaceae bacterium 
GWC2_58_13]
Length=310

 Score = 66.7 bits (159),  Expect = 4e-09, Method: Composition-based stats.
 Identities = 52/286 (18%), Positives = 99/286 (35%), Gaps = 23/286 (8%)

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            +F +WF F QY+LA D++ G++AL KSR  V GH W + GR +LL      +  L A IP
Sbjct  1    MFTIWFIFAQYILATDDVHGMEALLKSREYVRGHSWGVAGRVLLLA----AVGTLVALIP  56

Query  287  YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPG  346
             +G   NL       PF+ +YY+ I+ DL+       +       +   AA     LI  
Sbjct  57   IIGPLLNLLL----VPFTLIYYHEIFKDLREIKGSVSYIASPGAKIKWLAAGAAGYLIVP  112

Query  347  LLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQ  406
             + ++L   +L             +    Q   +     S   +     + +        
Sbjct  113  AIGLALLGPSLLQGWSHWTWNIRNEGGILQSSSSSVSTASDQSQLSVEVTGEE-------  165

Query  407  RKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDD  466
               ++      GP  L   +   +     +      + +  L+        + +  +   
Sbjct  166  ---SAGAEPDNGPHRLDLGKDRFEPGEGAVPEPSPATVYVPLTGDDPDQVMVYVFAINYR  222

Query  467  -----DARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQ  507
                 D +++Y  +   +    +   +     ++ F          
Sbjct  223  GSVRFDGKEIYPIEGERDMSYNYTGNLTLNRGSNTFEVDYQALPDP  268


>MSP93144.1 hypothetical protein [Myxococcales bacterium]
Length=331

 Score = 67.1 bits (160),  Expect = 4e-09, Method: Composition-based stats.
 Identities = 33/172 (19%), Positives = 63/172 (37%), Gaps = 4/172 (2%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                 + L  + +T               ++ +  L  +      +I   +++  G++L 
Sbjct  152  LHGLAMPLAKTALTLQAGDLAVGGRGDWKQAWRRALTRLPVLIGAVIPGGVLIAFGTVLF  211

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            ++PGL+    F F   V   +    L ALE+S  LV   W  +   FV L V S    F+
Sbjct  212  VVPGLIAAFLFSFIAPVALLEGKTWLAALERSARLVLADWVRVAVVFVGLAVASALFRFV  271

Query  282  TAR----IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
                   + + G  A+    L+  P   +   L+Y + +    G     ++ 
Sbjct  272  LGFGLSPLGFFGSLADDLILLVTVPVHAVAIVLLYVEAREAEDGFDEDGLRE  323


>RMF97190.1 DUF975 family protein [Candidatus Schekmanbacteria bacterium]
Length=415

 Score = 67.5 bits (161),  Expect = 5e-09, Method: Composition-based stats.
 Identities = 39/187 (21%), Positives = 79/187 (42%), Gaps = 11/187 (6%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
            +  IF A L+     +   N  +   I+      + L +   +  + +     +  ++  
Sbjct  107  YLIIFIAPLIPGFLGMFAGNNAFVGIIVFLLSFILSLLMQMGSLKIVLKFTYGEKPVYDD  166

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
            +      +  + L LI++ ++VG G +LLIIPG++  +   F  +V+ D  +G ++A++K
Sbjct  167  LINTYPLIVKYLLALIVVCIIVGAGMVLLIIPGIILMLALQFFAFVIVDKEVGAIEAVKK  226

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
            S  +  GHW  IF             + L + + ++G       +L+  P + L Y   Y
Sbjct  227  SYAITKGHWLNIF-----------LFNLLFSFVSFIGFLVIGIGALVSIPVAMLAYAHFY  275

Query  313  SDLKANY  319
              L    
Sbjct  276  RQLAGEE  282


>MXY43355.1 hypothetical protein [Dehalococcoidia bacterium]MYD51760.1 hypothetical 
protein [Dehalococcoidia bacterium]
Length=274

 Score = 66.3 bits (158),  Expect = 5e-09, Method: Composition-based stats.
 Identities = 38/271 (14%), Positives = 86/271 (32%), Gaps = 11/271 (4%)

Query  71   IQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIV  130
            +                        S   +     L      L  +    + GI  L  +
Sbjct  1    MTPYHTPSGHEPPPLPHGGLGHIADSTFSVFGAHYLPFILIALLPQIPLLVGGIISLTGI  60

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
                     +          Q   W+   ++ +    +L       ++  +   + V + 
Sbjct  61   GPIFTFDFPVEPSEEPVEPLQISLWEVLGVILSFVISILAGGATIYAVARHYLGSPVLVQ  120

Query  191  RSMKLGLRHVGSFTLLLILLILV-----VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIG  245
            RS +             +L++L+     V    ++ I   +   V   F  + +A +   
Sbjct  121  RSYEYAWARFPKLLGSFLLVLLILIVPGVLSVFIIGIPLLVFAVVALLFVTHAVAIEQQD  180

Query  246  GLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT------ARIPYVGEAANLAFSLL  299
               AL++S  +V G+WW  FG    + +  +  + +          P +    + AFS +
Sbjct  181  PTDALKRSWNVVQGNWWRTFGLLAGITLAIIGATLIIFLPAGRFLPPALVVLLSTAFSTV  240

Query  300  LTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
            +TP + +   ++Y D++       H  + R+
Sbjct  241  VTPIATIAITVLYFDIRVRKEQYTHNDLARE  271


>WP_145056105.1 hypothetical protein [Lignipirellula cremea]QDU97320.1 hypothetical 
protein Pla8534_51660 [Lignipirellula cremea]
Length=358

 Score = 67.1 bits (160),  Expect = 5e-09, Method: Composition-based stats.
 Identities = 40/359 (11%), Positives = 85/359 (24%), Gaps = 30/359 (8%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
                CP C      P   +     + RCP C   ++             + +  P     
Sbjct  3    VDFHCPRCQRGIRAPDQAI---GQTMRCPNCNSGVMVPYPGMHLPGMGPDGSNPPAPAGG  59

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPE------------------------REFRASG  97
                    +      +    +     Q                                 
Sbjct  60   PVPKMPLRQKPVLFPDVFGTSGKLLAQKGGLIAGMLVVTALIEIVIMALLTGLTMLIIWS  119

Query  98   SGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQW  157
               R     L     L        L I ++   +++  +     +           +   
Sbjct  120  IASRLPVMRLDMRISLVVSIALIPLSIGVIVACMSYMIMGLIKAVTSTCRGQASFVDLFL  179

Query  158  AILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGG  217
                     + + L  +T      +          +   +     +    I + L V  G
Sbjct  180  PGKCVLWGALYMFLYLLTLYAVTALIAFGAAALSWLVTSMLVGMKYDPASIAMYLRVIIG  239

Query  218  --SLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
              SLL+    L+  + F    + +AD ++    A+  S   + G+    F   +   V++
Sbjct  240  AASLLIAAVNLMATLNFLLTLFFIADRDMNINDAMMHSVKYMGGNKMQTFLVLLATNVLT  299

Query  276  LTLSFLTARIPYVGEA-ANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLP  333
                     I  VG     +   L+   F+ +   ++Y             P   +   
Sbjct  300  SIGMTAAGFIGGVGAVGLAVVLLLISYAFTMITMTVVYLRATGQLTCADEAPAPPRPQY  358


>MBI2036982.1 hypothetical protein [Candidatus Liptonbacteria bacterium]
Length=232

 Score = 65.6 bits (156),  Expect = 5e-09, Method: Composition-based stats.
 Identities = 40/169 (24%), Positives = 69/169 (41%), Gaps = 0/169 (0%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
                L  V+A        L   +           + +  A V        W   ++ + +
Sbjct  26   HWRRLMPVVAIYAFGGLALQLFSESGFRSMGKPFFPVAAAFVLAGAFIYVWGFAALLLAL  85

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                +   R+ +  L +V  + +  +L  L+V  G L L+IPG+   V F F  YVL  +
Sbjct  86   RDDRLDWQRAYRGALSYVVRYAVAWLLYALIVMAGVLALVIPGIYLAVQFSFVSYVLVFE  145

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
            N G  +AL +SR+ V G WWA+  R V L ++++        I  + + 
Sbjct  146  NTGAREALRRSRMYVRGRWWAVLWREVALGLMAMGAYSFVLLILGILQF  194


>NIA05393.1 hypothetical protein [Proteobacteria bacterium]
Length=216

 Score = 65.2 bits (155),  Expect = 5e-09, Method: Composition-based stats.
 Identities = 37/150 (25%), Positives = 70/150 (47%), Gaps = 4/150 (3%)

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV  214
              + ++      +   + W   ++   I   ++GL  ++  G ++    +  + +L+ +V
Sbjct  67   ILFGVVPVLFLLVTALVFWSQTALLALIVNEEMGLIDALHAGWQYFWPMSKTISILLGIV  126

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
              G  L I+PGL+F VWF F  ++L D++  GL  L  SR  V GH W  F +     V+
Sbjct  127  LIGLALGIVPGLIFIVWFSFGLFILIDEDRRGLDGLLASREYVRGHGWDTFFK----FVL  182

Query  275  SLTLSFLTARIPYVGEAANLAFSLLLTPFS  304
               +S +   IP+ G+  +  F+  L  + 
Sbjct  183  IWAISLVAGIIPFAGQIFSFLFTPFLMLYL  212


>MAG82522.1 hypothetical protein [Candidatus Poribacteria bacterium]
Length=347

 Score = 66.7 bits (159),  Expect = 5e-09, Method: Composition-based stats.
 Identities = 35/175 (20%), Positives = 69/175 (39%), Gaps = 5/175 (3%)

Query  126  LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKT  185
             L   +A   +   + + P       +      ++      + L +      + + +   
Sbjct  160  NLVFFIAILIVSMLIGIAPDVIERLVSSIPIRIVVGIPFWVLNLVIWMGLIRIALRLHDN  219

Query  186  DVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIG  245
                F  +        ++    IL  L++  G +LLIIPG+++ + F F  Y + D+ +G
Sbjct  220  KDVGFTDLFSCFPLFFNYLFGSILYGLIIFVGMILLIIPGIIWAIKFQFYSYFIVDEGLG  279

Query  246  GLQALEKSRLLVSGHWWAIFGRFVL-----LLVISLTLSFLTARIPYVGEAANLA  295
             ++AL++S L+  G  W +F   +L     +L     L  L A IP    A +  
Sbjct  280  PIEALKRSSLITKGSKWNLFLFGLLLILINVLGALCLLVGLFATIPTAMIAVSFV  334


>WP_145201323.1 hypothetical protein [Thalassoglobus polymorphus]QDT34037.1 hypothetical 
protein Mal48_32960 [Thalassoglobus polymorphus]
Length=326

 Score = 66.7 bits (159),  Expect = 6e-09, Method: Composition-based stats.
 Identities = 45/335 (13%), Positives = 81/335 (24%), Gaps = 27/335 (8%)

Query  2    PTVRCPHCGAERNTPSSK------LPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATC  55
                CPHC A    P SK       P        P   +   F     + +   D     
Sbjct  3    IEFYCPHCEAYLRAPESKARLSIACPHCGGRCWVPLKSEQETFFDEVEEWSDEEDFDEAD  62

Query  56   PHCGLQRRIPSDRLEIQSKTVNCRRCNRSFC----------LQPEREFRASGSGLRSISQ  105
                 +       + + +      + +   C           +       +         
Sbjct  63   YEADYEPTPGPADVPVLAPKKKQLQTSCKVCDSVLSPSEKVCKVCSHRVGAPIFDEPADV  122

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
             +        R      G  L+  V        A +L                 L+  + 
Sbjct  123  DVGRILSTSWRLYTRHFGTCLIVTVTDAVLTIVACVLAIFIGGTAALAVGNRPGLVVFMF  182

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
             I  GL W        I      L            S      +  +  GG     +   
Sbjct  183  LIASGLGWGIAMSMFAIGHMRFYLDLCRTDRADFHKSMDFQGPIGHIFSGGVVYWSLFLF  242

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            LL  ++ +    V+ D N+ G + L  +  L   H    F    +L+ + +    +++  
Sbjct  243  LLPPIFLWPFGRVVVDQNVSGARGLWTALRLSVKH----FSVCFVLVFVKIGAILISSLF  298

Query  286  PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
            P  G           TP+  +   + Y        
Sbjct  299  PIGGAILM-------TPYFAILNTVAYLHFIGELE  326


>MBI4122325.1 hypothetical protein [Parcubacteria group bacterium]
Length=275

 Score = 65.9 bits (157),  Expect = 6e-09, Method: Composition-based stats.
 Identities = 50/229 (22%), Positives = 81/229 (35%), Gaps = 6/229 (3%)

Query  103  ISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLA  162
            ++     SW +F      L         L    +  A ++            W  A  + 
Sbjct  1    MNTNWRSSWTVFTAAFQTLWQALGSVAWLWLVELALAAVMITLVSFLSSFTAWFAAFGIT  60

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTL-----LLILLILVVGGG  217
             +    L  +    ++              ++  LR      L     L ILL + +   
Sbjct  61   VMFVYRLLATTSLIAIASGRPGQPGSPILRIRDVLRRPVLLKLPAAVALAILLGVGISIA  120

Query  218  SLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT  277
            SL L+IPG+L  +++     +L  ++     AL +S  LV G +W +  R V +  I   
Sbjct  121  SLFLLIPGILLSLYWSMSLVILIVEDTSVRDALRRSVTLVRGWFWPLLSRMVFVFTILTI  180

Query  278  LSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPP  326
             + L A +P +G       S LLTP    YY L Y +L  N R      
Sbjct  181  FT-LPAMVPGIGAMVTAILSFLLTPVLVFYYVLTYQELAENKRYKHLQQ  228


>MBF0930940.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Actinomyces graevenitzii]
Length=458

 Score = 67.5 bits (161),  Expect = 6e-09, Method: Composition-based stats.
 Identities = 29/189 (15%), Positives = 62/189 (33%), Gaps = 28/189 (15%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGL--FRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
             +  I L +     S  + I    V      S+    +        L + ++ +    ++
Sbjct  257  FLPLIGLNIITSIISGLMMIIGIAVFFVLLASVASTAKTETELFQGLGITLVGLLILMVI  316

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
              +      + F      +  +N+G   A+ +S  L  G++W +FG  +L  +I+  ++ 
Sbjct  317  SALVSSYLSIKFSVASPAMVLENLGVFAAIGRSWSLTRGNFWRLFGINILTAIITSMVAG  376

Query  281  LTARIP--------------------------YVGEAANLAFSLLLTPFSFLYYYLIYSD  314
            +   I                            +    +    LL+ PF+     L+Y D
Sbjct  377  IFGGIAGALGAIFVVVGSSSPEDVIASLNTTYILTMVMSTIAQLLILPFTSSVNALLYID  436

Query  315  LKANYRGPQ  323
            L+    G  
Sbjct  437  LRMRKEGLD  445


>WP_095981871.1 hypothetical protein [Melittangium boletus]ATB33902.1 hypothetical 
protein MEBOL_007403 [Melittangium boletus DSM 14713]
Length=294

 Score = 65.9 bits (157),  Expect = 6e-09, Method: Composition-based stats.
 Identities = 42/272 (15%), Positives = 73/272 (27%), Gaps = 8/272 (3%)

Query  48   TTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLL  107
                   CP+        +           CR    S          AS     S     
Sbjct  1    MQTASTPCPNHPTSPATFTCARCGSFACEACRSPQASTWCASCAARYASPGLPVSEVLGD  60

Query  108  ADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA--  165
               + +       LL  Y +   LA  P   A+     +         +    +  +   
Sbjct  61   TFGFLMRHPGPIALLAGYHILFGLAQLPFNMAMQEAIKSGNLFPFMTDRLPSWIVLIVGG  120

Query  166  --YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
              +  +  +         +          +  GLR         +LL + +G G +L + 
Sbjct  121  SVFSSIAYALFIRFAGDALEGPPRPTGELVNAGLRRALPVLGTNLLLGIALGVGFILCLA  180

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
            PG+   V              G + AL  S     GH   +F   V+L  I + +  + A
Sbjct  181  PGVFLSVTLALALPATVLHPSGPIDALSFSWTRTRGHRGNLFLLLVILGAIFMGVGIVNA  240

Query  284  RIPYV----GEAANLAFSLLLTPFSFLYYYLI  311
             +  V    G       S++    S     L+
Sbjct  241  GVNLVVTPMGLGGMAVGSVITQALSGTCVALM  272


>MYF31094.1 hypothetical protein [Gammaproteobacteria bacterium]MYK45371.1 
hypothetical protein [Gammaproteobacteria bacterium]
Length=245

 Score = 65.2 bits (155),  Expect = 6e-09, Method: Composition-based stats.
 Identities = 26/226 (12%), Positives = 72/226 (32%), Gaps = 8/226 (4%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            +  R   L+    L   +    + +  + +   +     +       L ++   +   + 
Sbjct  19   YYVRNLALILPVSLLAFVPAFVVIAVSVDEGFRFDLGPAKVEFRYRELTSLVCGVWLQAG  78

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
            ++  +   +  +      ++   +  +     + ++  +V   G + L +PGL+     +
Sbjct  79   LSTIVIRQLEGSYQDFGETLVASIHSLFRCAHVALIGAVVFVVGLMALALPGLVLATMLW  138

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE---  290
                  A +  G + AL +S  L   +   I G F+  +V+ L + F    + +      
Sbjct  139  VAMPSAAVERCGLVGALRRSYELTREYRLRILGLFLAFVVVLLIVEFAIGLVLFPRVPEG  198

Query  291  -----AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
                        + +         + Y  L+    G     I + +
Sbjct  199  LSSQEIVKQIGGIFIGGLVGSIVAVSYYYLRTIKEGASSKAIAQAF  244


>MBR62226.1 hypothetical protein [Dehalococcoidia bacterium]
Length=262

 Score = 65.6 bits (156),  Expect = 7e-09, Method: Composition-based stats.
 Identities = 34/218 (16%), Positives = 70/218 (32%), Gaps = 15/218 (7%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              I  +  +L    I   +       +            LA      +    +  ++ + 
Sbjct  36   FLILTVTAMLPVLLISYFISGSVTGLVLLSFIEILIVTTLAGAFIYSVPYYCINRTVNLR  95

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                         LG+       L +++   VV    +   I  LL+ ++F   Q V+  
Sbjct  96   ESLKASQAVYFKLLGVTLGLQIILAILMTFGVVLPLLIFPFIGALLYMIYFMVSQPVVVI  155

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL-------  294
            + +  + A ++S  LV GHWW +FG   + ++ ++    L+    Y+             
Sbjct  156  ERLKPVDAFKRSLNLVRGHWWRVFGATAIFILCAIGTFVLSWLPFYLFSLVLSEGSELRG  215

Query  295  --------AFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
                        L  P  +++  +IY DL+        
Sbjct  216  IIQQIGVYVGGTLTIPPIYIFLTVIYYDLRYRKEDFSF  253


>NTU91881.1 hypothetical protein [Chlorobiaceae bacterium]
Length=244

 Score = 65.2 bits (155),  Expect = 7e-09, Method: Composition-based stats.
 Identities = 33/159 (21%), Positives = 64/159 (40%), Gaps = 0/159 (0%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L  ++L+ + L                  P    +    L     +  L    +   +  
Sbjct  31   LPLVFLISVPLEMVSWQFHDFTVVDQQNLPLLIRYLLLALFMWFVFGTLLQLSLIRMVES  90

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +  T + L  +++  L          +L +L+V   SLLL +PG ++ V++ F   V+A
Sbjct  91   SMLGTPITLQEALRHALSRWMPALGTGLLSLLIVAAWSLLLFVPGFIWSVYYTFGLMVVA  150

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
               + G  AL++S+ +V G WW + G  V+  +    +S
Sbjct  151  LRGVSGKAALDRSKAMVRGRWWRVLGYQVVFFLFPAVVS  189


>WP_098796000.1 hypothetical protein [Bacillus sp. AFS040349]PGT87617.1 hypothetical 
protein COD11_06760 [Bacillus sp. AFS040349]
Length=611

 Score = 67.5 bits (161),  Expect = 7e-09, Method: Composition-based stats.
 Identities = 64/512 (13%), Positives = 145/512 (28%), Gaps = 14/512 (3%)

Query  103  ISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLA  162
                + DS+E+                +         +           N      I  A
Sbjct  37   WGFTIFDSFEITIGNPIQPFQPTGTYPIWIPQISDLKVPAGFIHATLDYNVALSLIITAA  96

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
                    +S   GS+   + + +  +F   +   +      +L  +++LV+     L+ 
Sbjct  97   FFVIKSFFISGYLGSLKSVLTEEETMIFHEGRYYFKRFFLLHILFAIIMLVLTLLGSLIG  156

Query  223  IPGLLFCVW---FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
               ++  +    +    YV+  ++ G  +AL +S+ LV  H+W +    +LL+ I+  L+
Sbjct  157  PFVIVLFLLTIPYILTPYVVVLEDCGITEALGESKNLVQQHFWFLLRFALLLMFITFLLT  216

Query  280  FLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIF  339
                 +P    A  +   ++ +  S      +   L           +  +         
Sbjct  217  LSLQTLP--KSAIYIIILIVYSFISSAAILTVMGRLYEKGEEEATKELNGKIHLPFLLAC  274

Query  340  GWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSA--  397
             ++L    LL +   +      L  + K   + L  +      L+ + P           
Sbjct  275  YFVLFSLPLLGANLAKGEYLMFLDFSEKQTFEGLSFRSANGYILDDTFPTYEWLNDEKIK  334

Query  398  ---DYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKG  454
                   L  +  K +  G + +   T        +    +   K  ++D      +Q+ 
Sbjct  335  IELSLPSLKDEPDKISGTGTVGIVAPTNSGKHEMKNVDFIYKLHKSTINDVTYYRNSQQE  394

Query  455  SARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQV  514
            S  +   +             +      F+        +          Y       +  
Sbjct  395  SISLLNWESDIPQPAVTIMVNNEGNDVFFYLAEPVHPQDGVFGFQQTETYFDVNETGQIF  454

Query  515  HSILGKLE-LTLPLAIESLQLTRNDIGKTLQIGG--KQLILQRLGSNAVTLRFLGDRTDL  571
              I       + P    S +LT+  +   L+         L   G    +L   G    +
Sbjct  455  LPIRNDTNMYSYPYYWFSTELTKEKVSSFLESKNNYSGFHLNDPGFQVASLIREGSYQTI  514

Query  572  LNVHASNSHAEPLREIGFTWQKSGDAFSLRQM  603
            + +   N   + ++     W    + F   Q 
Sbjct  515  IRLAP-NLTEQNIQHNLDEWSTRWEGFFEDQY  545


>HGV36320.1 hypothetical protein [Spirochaetes bacterium]
Length=196

 Score = 64.0 bits (152),  Expect = 7e-09, Method: Composition-based stats.
 Identities = 28/164 (17%), Positives = 59/164 (36%), Gaps = 2/164 (1%)

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL--GLRHVGSFTLLLILLIL  212
              W + +      +     +   +     K +      +      +++G F L   L+ +
Sbjct  30   LLWILTIWIPYLNIGTTIGLLLGVIAKASKNESIAMTEIFNPEYRKYMGDFFLTAGLMSI  89

Query  213  VVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
             V  G+ L + PG++  + +     +  D      +AL  S  +  G+   +F   +++ 
Sbjct  90   GVSIGTALFVAPGIVLAIAWSQSLLLAVDKGKSPTEALNLSNKVTYGNKGTMFLVNLVVA  149

Query  273  VISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLK  316
            +    LS +  +IPY+         LLL   +      IY  L 
Sbjct  150  IAFAILSVIFMQIPYLDFILIFVAGLLLLFVNIGIQAHIYQKLC  193


>CAA9462944.1 hypothetical protein AVDCRST_MAG38-339 [uncultured Solirubrobacteraceae 
bacterium]
Length=318

 Score = 65.9 bits (157),  Expect = 8e-09, Method: Composition-based stats.
 Identities = 27/235 (11%), Positives = 68/235 (29%), Gaps = 14/235 (6%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
             ++    +  +    ++        VG    +  GL  +G       +++++V  G + L
Sbjct  83   LSLVLYAVASAATVHTVAAAHDGRRVGWQEGLGAGLSRLGGVVAASFIVLVLVIAGLVAL  142

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            ++PG+   V        L  + +  L A+ +S  LV G WW       L  +++     +
Sbjct  143  VLPGIWIAVALALTTPALVLERLTALAAIRRSYTLVQGRWWRTAAVVALSFLVAFAAVVV  202

Query  282  TARI--------------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPI  327
             +                  +    N   S LL P +     +++ + +           
Sbjct  203  VSIPAGVLVATTEDRSLRALIAAVVNALSSGLLIPLTAGVMTVLFLERRGGAAALGAEEG  262

Query  328  KRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPD  382
              ++      +     +              A +  +  ++              
Sbjct  263  SGRYRGFEPPVAPSGGVRPAATRLGEPAGGDAREPQARPREPSDGAPQSDGPPRT  317


>WP_077913226.1 DUF975 family protein [Listeria floridensis]EUJ28218.1 hypothetical 
protein MFLO_12506 [Listeria floridensis FSL S10-1187]
Length=524

 Score = 67.1 bits (160),  Expect = 8e-09, Method: Composition-based stats.
 Identities = 33/164 (20%), Positives = 59/164 (36%), Gaps = 1/164 (1%)

Query  132  AFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFR  191
                I   LL             +   I        +  L+      ++ + + +     
Sbjct  19   WGIAIGVFLLSYLIITAGSSILGFIPVIGWLASFLFVGPLTVGVSWFYLALNRREDPDVG  78

Query  192  SMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQAL  250
             M  G    G   L  IL+ L     +LL IIPG++    +    Y+L ++ NI  L A+
Sbjct  79   YMFSGFNDFGRTLLAYILVTLFTFLWALLFIIPGIIKMYSYSQTFYILRNNPNISALDAI  138

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
             +SR +++GH   +FG  +  L+  L    +      V     +
Sbjct  139  TESRHMMNGHKGRLFGLSLTFLLWYLIPIAIFMIGGAVMGMGAI  182


>MBI4982381.1 hypothetical protein [Candidatus Omnitrophica bacterium]
Length=267

 Score = 65.2 bits (155),  Expect = 8e-09, Method: Composition-based stats.
 Identities = 27/210 (13%), Positives = 64/210 (30%), Gaps = 18/210 (9%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
             +    +G     +   +   +        +    +           ++ ++     +  
Sbjct  66   FVITTCIGAWGMVSLFVAIDKITANFSATIKENITEAGRYFVPYLTGMIIVALFAFMILF  125

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +         ++   L +      + ++   +V           L   +          
Sbjct  126  VVRSVVPLHVLALTGELNNKWLLLSVDVIGTAIVFVA--------LYCLIRCSLYGVCCV  177

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY----------VGE  290
             +N G +QAL++SR LV  H   + G F L +++ + L+     I +          VG 
Sbjct  178  VENSGPIQALKRSRELVKSHVNPVVGTFFLSMLVVIILAIPLVAISFFLKNENILAVVGL  237

Query  291  AANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
                   L++TPF      ++Y  L+    
Sbjct  238  VYQSLMGLVMTPFLGATLVVLYQKLREVTE  267


>KKS47526.1 hypothetical protein UV09_C0004G0015 [Candidatus Gottesmanbacteria 
bacterium GW2011_GWA2_42_18]
Length=324

 Score = 65.9 bits (157),  Expect = 8e-09, Method: Composition-based stats.
 Identities = 38/188 (20%), Positives = 74/188 (39%), Gaps = 6/188 (3%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
                     +          +     + LL   A   +  N   +    L  + +  +  
Sbjct  31   NWSFEDIDTVFRQIHFPTQNSAGIQINHLLPPSALLNSSWNIYKKTWKSLVKILFFSVYA  90

Query  172  SWMTGSMFIYIC------KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
            + +    +I +       +  V +       L    ++  L  L ++++  G +   IPG
Sbjct  91   AAVQAIQYISLISFIASGEEKVYVKTLFIESLAKAKAYWWLSFLQMVILFSGVMFFFIPG  150

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            +++ VWF F QY+L  + IGGL+A+  SR +V G +W I  R  ++L I    SF+ + +
Sbjct  151  IIYFVWFSFSQYILILEKIGGLKAMLISREIVRGRFWGILLRMGVMLAIFFVASFVLSYV  210

Query  286  PYVGEAAN  293
            P +     
Sbjct  211  PKIMMFIA  218


>WP_128545667.1 hypothetical protein [Larkinella soli]
Length=294

 Score = 65.6 bits (156),  Expect = 8e-09, Method: Composition-based stats.
 Identities = 23/155 (15%), Positives = 51/155 (33%), Gaps = 0/155 (0%)

Query  140  LLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRH  199
                  ++       W           ++  + +     +       + +          
Sbjct  72   GGWGVLSFSYTAPSFWAILFFTLLNNLLVAMVVYRHLLSYEERPDEPITVKGIWSWLEVD  131

Query  200  VGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSG  259
                 L  +   ++V  G+L L +PG+   V       V+  +++   +A+ +   LV G
Sbjct  132  FLQVFLTSVASFVLVLAGALFLFVPGIYLAVVLSLGTIVVMREDLSAGEAIRRCFELVRG  191

Query  260  HWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
            HWW   G  +++L+I + LS        +  A   
Sbjct  192  HWWETGGLLLVMLLIQIVLSNGIGIPLGLLTAGTA  226


>OHB18443.1 hypothetical protein A2666_05150 [Parcubacteria group bacterium 
RIFCSPHIGHO2_01_FULL_47_10b]
Length=395

 Score = 66.7 bits (159),  Expect = 9e-09, Method: Composition-based stats.
 Identities = 56/344 (16%), Positives = 94/344 (27%), Gaps = 23/344 (7%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL-L  220
                  L+       +  I I    + LFR+++   R      L +++L+  V  G+  L
Sbjct  23   LWTIGALVSFWVTFAAYQIVIADEKLSLFRALRSISRKQYFLGLWIVVLVNAVFLGAFAL  82

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
             IIP ++  V   F   V   +N  GL AL +S  LV G+WW I GR +L  +IS  L  
Sbjct  83   FIIPAVVLWVALLFPLVVFFAENRKGLDALARSGQLVKGYWWQIIGRTILFGIISWILPL  142

Query  281  LTARIPYVGEAANLA---------------------FSLLLTPFSFLYYYLIYSDLKANY  319
                    G A                          S +  P    ++Y++Y +L    
Sbjct  143  ALLFGLTFGFAFYGTTNLLLSTLILTIGPQIIFVLYASYIAGPLGLTWFYILYRELAQKL  202

Query  320  RGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQ  379
                + P  ++                LL+                              
Sbjct  203  PAQDYLPSAQKRNIYLLLAVLGGTATVLLMGYALFFAPPPTLEQEGIDPGTFINDKPVAT  262

Query  380  TPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPH-LWL  438
              ++      E  +      +  L   +          G   +      +  +    L L
Sbjct  263  MDEMGIVDIGETPQTRDLKRESDLKSLQLALELYHSRYGNYPVALQLTRSTSEFISALGL  322

Query  439  KLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPA  482
                     + L  K           D     L  R       +
Sbjct  323  LRIDGTIGAIPLDPKSDEDWYYGYESDGQQYRLTARLEEPYDRS  366


>OGG70203.1 hypothetical protein A2929_03920 [Candidatus Kaiserbacteria bacterium 
RIFCSPLOWO2_01_FULL_45_25]OGG81870.1 hypothetical protein 
A3I99_02175 [Candidatus Kaiserbacteria bacterium RIFCSPLOWO2_02_FULL_45_11b]OGG85374.1 
hypothetical protein A3G90_04980 
[Candidatus Kaiserbacteria bacterium RIFCSPLOWO2_12_FULL_45_26]
Length=306

 Score = 65.6 bits (156),  Expect = 9e-09, Method: Composition-based stats.
 Identities = 45/269 (17%), Positives = 95/269 (35%), Gaps = 16/269 (6%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
            + L I+LA + I  ++      +          A+ +     + + +     ++  ++  
Sbjct  37   WPLFILLAGSGIVLSVTAYLIEFGTTLGVWQVLALTVVMFIALYINVLGTVVAVKYFLND  96

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
                +            S+  +L L  LV+  G ++ IIPG++   +  F  +  A ++ 
Sbjct  97   GTEKISTYFSEVSSSAWSYLWVLSLSTLVLFSGFIMFIIPGIILSTYLMFALHTRAAESA  156

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR-IPYVGEAANL---------  294
             G+ +L +S  LV G+WWA+  R + L +  + + F     +  +     L         
Sbjct  157  TGMDSLVRSTELVRGYWWAVTARLIFLTLAMVVIFFSVGFALGILASIVGLTQDTADLIS  216

Query  295  ------AFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLL  348
                   FS L +        ++Y  L A   G    P   +   +  AI   +L+  +L
Sbjct  217  LLAVEPIFSALGSIICLYAITVMYKALVAMKAGVVAEPNAVRGWYIGLAILTPLLMVAVL  276

Query  349  LVSLSRQNLSAEQLLSAGKDIQQRLGTQP  377
             + L       + +  A +          
Sbjct  277  ALELLGFMNEFDFVDPAQEGPGLMETEAT  305


>TSC61666.1 hypothetical protein G01um1014106_720 [Parcubacteria group bacterium 
Gr01-1014_106]
Length=296

 Score = 65.6 bits (156),  Expect = 1e-08, Method: Composition-based stats.
 Identities = 32/184 (17%), Positives = 70/184 (38%), Gaps = 1/184 (1%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +L + +  +++  A   +A +   A                  +  I LG+  +      
Sbjct  87   VLLLTIPIVIVLSAVSGAAQVFIAANEQRVTFGEAFRWGFSRVLPLIGLGVLQILILCAA  146

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             I    V    +  LG  + G+      + +++     +L++I  +L  +   F   ++A
Sbjct  147  LIPGLVVTAIIAFALGF-NFGTLDGAGAVFMILFFLMLVLVLILLVLIGIRLLFAMPLVA  205

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
                  L++  KS  L  G +W + GR ++  ++ L ++ +   IP VG  A +  S   
Sbjct  206  LKESAVLESFRKSWRLTGGRFWPLLGRVLVAFLVGLVINMILGFIPIVGSLAQVVISAYF  265

Query  301  TPFS  304
              + 
Sbjct  266  MVYF  269


>MBI1317494.1 hypothetical protein [Candidatus Hydrogenedens sp.]
Length=243

 Score = 64.8 bits (154),  Expect = 1e-08, Method: Composition-based stats.
 Identities = 23/192 (12%), Positives = 60/192 (31%), Gaps = 2/192 (1%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
            +LL  +        A+L         ++    +     ++    L  + +  ++  Y+  
Sbjct  17   WLLVAIALALGFPMAVLAWAMGESVAESAVASFLFGGLSLFTGALANAAVIVAVNTYLKG  76

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
                L  + +              L +  +  G+L  I+PGL   +   F    +  + +
Sbjct  77   GYATLPDAYRTAFAVFPLVLPAYALSMGAIAIGTLFFILPGLYLGLRLAFVLPAVLLEGL  136

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLL--VISLTLSFLTARIPYVGEAANLAFSLLLTP  302
               +A+E+S  L   +   +F  ++L     +             +     L  S +++ 
Sbjct  137  PPFRAIERSWNLSRDNLIELFMYYLLPFATGLFTVALTYAGADNALSVFGQLFVSTVVSV  196

Query  303  FSFLYYYLIYSD  314
                     + +
Sbjct  197  LYLTIMVDFFRE  208


>WP_091743029.1 hypothetical protein [Marininema mesophilum]SDX57401.1 hypothetical 
protein SAMN05444487_1279 [Marininema mesophilum]
Length=378

 Score = 66.3 bits (158),  Expect = 1e-08, Method: Composition-based stats.
 Identities = 33/310 (11%), Positives = 70/310 (23%), Gaps = 2/310 (1%)

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
             F  +    L +  L  V+         L       N                       
Sbjct  67   FFSEKAIIALILLCLFYVVTVFIFTPFQLAGLTGMANEGIMGGSTRFSSYFRMGARYFWR  126

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
             +   + +Y+    V +  S+   +           +  L+V   SLL +IP  L  + F
Sbjct  127  MLGHLLLMYLIIAVVSIPSSILDRISKY-LIDSGNEVEGLIVALVSLLFMIPVFLTAMVF  185

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
             F  ++L  +N G  Q+++ S  L        F   V +++ S+ +      +       
Sbjct  186  MFSPFILTAENKGPWQSIKLSFTLFRKAPGKYFATLVFIMIYSIFIGIFITILVIFFFIF  245

Query  293  NLAFSLL-LTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVS  351
            +    +   T         +   L  +            +L                   
Sbjct  246  SSVGGVSGTTLLVLSIIAFLVFSLIGSLMFSLIAVRYHNYLRRWITPEAGPNSFMPPHYG  305

Query  352  LSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTS  411
                +                  T P      +       +   +   +     +     
Sbjct  306  GQFPDNGPSPEGFNSNSNNLGWQTPPPPNQGPSHGDNPYNETKPATHEENNKEAKNPNPQ  365

Query  412  EGGLSLGPVT  421
                      
Sbjct  366  SYPHFPSDPH  375


>NLO94611.1 hypothetical protein [Firmicutes bacterium]
Length=309

 Score = 65.6 bits (156),  Expect = 1e-08, Method: Composition-based stats.
 Identities = 35/317 (11%), Positives = 75/317 (24%), Gaps = 31/317 (10%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            V CP CGA                          +       +                R
Sbjct  23   VVCPSCGAVL--------------------HEGCWRGHGGCSSCGWRESPEPTKRCPYCR  62

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLG  123
                   +  K             Q +    A+                +        + 
Sbjct  63   ELIHGEAVICKHCRSSLQAAPGSGQGQEGLVAAALRAGWRGFTSRAGLHVGIFLFGVAMV  122

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            +    +       +    L+       +       +    V   L  L      +   + 
Sbjct  123  LLAALLAYGLFGSYRYYPLRRGVGYFLRMAPGSALLFGLGVYLALQWLLVGLVKIHSALA  182

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
                  F  +  G   +  F    +   L+      L    G+   ++  F  Y++    
Sbjct  183  AGQPATFGQLFSGGDRLLPFVGASLFTGLLADLAWDLWPGVGIAVMIFLVFVPYLVVAHG  242

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
            +G L+A+  S  LV  ++ ++ G  V+  V+              G       +++  P 
Sbjct  243  LGPLRAMNVSVQLVLNNFGSVLGFVVVGAVLIAV-----------GVFLAGLGTIVTGPI  291

Query  304  SFLYYYLIYSDLKANYR  320
              +    +  +L A + 
Sbjct  292  VGIGTAHLCRNLLARWD  308


>WP_133164692.1 hypothetical protein [Candidatus Sulfotelmatomonas gaucii]SPE24946.1 
conserved membrane hypothetical protein [Candidatus 
Sulfotelmatomonas gaucii]
Length=344

 Score = 65.9 bits (157),  Expect = 1e-08, Method: Composition-based stats.
 Identities = 32/239 (13%), Positives = 68/239 (28%), Gaps = 20/239 (8%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            +  + +        +A  +                   A +  + + L     ++     
Sbjct  92   VAWIYLGETATVRAAAGSVLLRLRRYLWLMAVTVFRAWAPLYVLYVVLFAALFAILPSGF  151

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
              +  + +             L+ +L+   +  G+ +    G++  + +         + 
Sbjct  152  LFNPQVAQHPPAMNPSTAIGFLVGMLIFAPLFLGATIY---GVIMWLRYSLAMPACVVEG  208

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA------------  291
            +   QA+ +S  L  G    IF  ++L+ VI L L  L      +               
Sbjct  209  LSTRQAIRRSIELSQGSRGRIFVLWLLVYVIRLLLGILFGFPFIILGLKHPGHALPLALL  268

Query  292  -----ANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIP  345
                 A    + L+ P       L Y DL+    G     + +    L  A  G    P
Sbjct  269  AVSEVAAFVTNTLIGPIYSTGLTLFYYDLRIRKEGFDIEWMMQAAGLLPQARTGLAGFP  327


>KKW24533.1 hypothetical protein UY66_C0001G0034 [Parcubacteria group bacterium 
GW2011_GWC1_51_35]KKW26042.1 hypothetical protein UY68_C0001G0035 
[Parcubacteria group bacterium GW2011_GWF2_52_12]KKW27024.1 
hypothetical protein UY69_C0017G0003 [Parcubacteria 
group bacterium GW2011_GWF1_52_5]KKW34834.1 hypothetical 
protein UY80_C0005G0022 [Parcubacteria group bacterium GW2011_GWB1_53_43]
Length=284

 Score = 65.2 bits (155),  Expect = 1e-08, Method: Composition-based stats.
 Identities = 51/228 (22%), Positives = 76/228 (33%), Gaps = 4/228 (2%)

Query  129  IVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVG  188
            +  +               L    +N    +    +   +  L+       I    +   
Sbjct  1    MFFSALMRGIFFGEGFFHSLALPGENTALLVASVLLMLAVHILTAGAVLRVITEGDSAAS  60

Query  189  LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQ  248
               ++       G    L  L  L+  G   L IIPG++F VWF     VL  +   G +
Sbjct  61   PRNALAYAWGRRGDLFSLFFLNFLLG-GAYALFIIPGIIFQVWFSLSVIVLIAEGRSGTE  119

Query  249  ALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA---ANLAFSLLLTPFSF  305
            AL  SR  V GH WA+F R   L+  +L +S     +P             S L  PF  
Sbjct  120  ALLASREYVRGHDWAVFSRIGFLVFFALLISSAADLLPLPSPVRQGLTFVLSALAGPFIA  179

Query  306  LYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLS  353
            +Y Y IY  LK + +G    P  R+  P  A          +      
Sbjct  180  IYLYRIYHSLKHHGKGVNKYPAHRRIAPYLALGMLGWAFILIFGSFFI  227


>MBV70577.1 hypothetical protein [Myxococcales bacterium]
Length=468

 Score = 66.3 bits (158),  Expect = 1e-08, Method: Composition-based stats.
 Identities = 47/448 (10%), Positives = 107/448 (24%), Gaps = 23/448 (5%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
               CP C      P +++P +     C  C   +  D   +     T+           R
Sbjct  2    KFNCPKCNTPHMFPDAEIPEEGLVVECTACGHHIGLDSFAADGDDKTNVQFEQQPADDPR  61

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
             + S R+++     +    +R        +  +        S+               L 
Sbjct  62   AVLSARIKLDDADWSEMDDSRIMPALESGQPASRPRPEGVGSRNPDYQPSPVSEALISLQ  121

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
            G          +    +     A      +  +     L         +      M    
Sbjct  122  GAVGFNDPALSSSHTPSPNADSAEVSARLSGPYFSWRDLVQALATPFEMRRFFAVMGSIW  181

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                  +  S    L    +     +   L +   S+ +++  L+   +     Y    +
Sbjct  182  TGL---IIHSFFSWLSLKVALKSGALGGTLALVTWSISVVLGCLICA-FCAHQTYRQCVE  237

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG-------------  289
                  ++  S   V G   +I G  V+ + +   L  L + I ++G             
Sbjct  238  KQS--TSIRSSIDWVRGWLSSILGAPVVAVGVIAILVVLESLIGFMGRIPYAGPAIWGVL  295

Query  290  ----EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIP  345
                   ++A  L L   ++     I   +                    ++    +   
Sbjct  296  SLAVVLISIAGGLALVAMTYGLLLYIPLIIAERTGPMDTLKRILSLFKHHSSQIILLGTT  355

Query  346  GLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSK  405
             ++ + +      A  L+   +   +           +       P    S  +      
Sbjct  356  SVIGIGVFLALTLAPALIIGREFTNRIAQISMGGAFQMTIDQTPSPLAAMSRLFFRAGYS  415

Query  406  QRKTTSEGGLSLGPVTLFADRFWADDQN  433
               T    G  +G          A   +
Sbjct  416  DVPTGPNIGHDIGGFFAGMASLIAPSIS  443


>NUN53930.1 hypothetical protein [Planctomycetaceae bacterium]
Length=386

 Score = 65.9 bits (157),  Expect = 1e-08, Method: Composition-based stats.
 Identities = 38/232 (16%), Positives = 67/232 (29%), Gaps = 37/232 (16%)

Query  425  DRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFH  484
                   +     L++ L   P   L    +  + +D  +D+  R    R       +F 
Sbjct  159  SAEAEAAEPGKEVLRVGLRFVPLAGLDLAEAGSLRVDAAVDEKGRA--FRTSGDGMSSFG  216

Query  485  WVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQ  544
                   D   +F          G   E++    G LE+ LP A E+ +L    +  T +
Sbjct  217  SDAGIAGDRWAVFDAS-------GGTGERLAEFAGALEVLLPAAEETAELDLTTLPATAR  269

Query  545  IGGKQLILQRLGSNAVTLRFLG---------------------------DRTDLLNVHAS  577
            IGG    +  +G  ++ +   G                            + D L V   
Sbjct  270  IGGTTFTVLEVGDASMKVTVAGALLGGSDASPPGPQVVTGEEGGGGMFGPKPDALRVLLR  329

Query  578  NSHAEPLREIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFELT  629
            ++  + L    +     GD  +      G     TV        +  PF   
Sbjct  330  DAAGKVLTSSSWG-GGGGDVMTYDFDLSGRPVKATVSARTKVEKRDVPFRFK  380


>NQU99098.1 hypothetical protein [Parcubacteria group bacterium]
Length=250

 Score = 64.4 bits (153),  Expect = 1e-08, Method: Composition-based stats.
 Identities = 41/199 (21%), Positives = 72/199 (36%), Gaps = 4/199 (2%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L  +  +GI    A       L     L           ++A     LLG + +  +   
Sbjct  50   LGFVGGIGIFSFIATAPFESELLFNAILLTSAIILTLLYIIAITFLPLLGQAALIKATDD  109

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                                     +  L  L+   G + LIIPG++      F  Y+L 
Sbjct  110  IAKGQKKTTKEYFIFAWNLKWKIWGVYFLSALLAMVGFIFLIIPGIIIGFCMMFAPYILI  169

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
             +N    + L++S +LV  ++  +  + +L+      +   ++ IP     AN+  S L 
Sbjct  170  LENKKITECLKESFILVKDNFMNLLWKHILVFFAFAAIIMFSSFIP----LANMVISFLS  225

Query  301  TPFSFLYYYLIYSDLKANY  319
              FS +Y YL+Y D+K   
Sbjct  226  GIFSIIYVYLLYVDIKKVK  244


>TAN57580.1 hypothetical protein EPN15_03890 [Patescibacteria group bacterium]
Length=479

 Score = 66.3 bits (158),  Expect = 1e-08, Method: Composition-based stats.
 Identities = 49/344 (14%), Positives = 92/344 (27%), Gaps = 15/344 (4%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
             V  +   LS           +  +GL  +       + SF  + IL I+++GGGS L I
Sbjct  131  LVIAVAHILSQAALIYAAVHREKIIGLGEAFGFASGKLFSFLWVAILSIIIIGGGSFLFI  190

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            IPG++F +WF    +VL  ++  G+ AL KS+  + G+  A+F R     +I     F  
Sbjct  191  IPGIIFAIWFILAPFVLMGEDARGMAALLKSKEYIRGNGLAVFWRLFAFGLIVGLAFFAV  250

Query  283  ARIPYVGE-------------AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
                 +                                   + +   A            
Sbjct  251  YLGVSIIGGIIIAAIKSSMIKLIAGLIFFFTPFIIQPIVAALNAIYAALIYEHLRAARPA  310

Query  330  QWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPE  389
                 + A     L  G+L ++L    L+           +             N  +  
Sbjct  311  SAYTPSVAQKAIFLGIGILGLALPVLGLAGAVGSLFSSYGKF--KNSNPSLDLKNFQINA  368

Query  390  EPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLS  449
                          +       +   +       +   + D     L   LE     N  
Sbjct  369  NFNSGIYFGNNKNSNNANSANGQSFNNNASNNNASGNKFPDTDKDGLPDNLEAFYKTNPD  428

Query  450  LAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDE  493
                    +     ++    +  +     +         +  D 
Sbjct  429  AVDTDFDGLTDSAEVNSWKTNPINPDTDGDGFRDGDEVKSGYDP  472


>WP_071067494.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Arthrobacter sp. FB24]
Length=424

 Score = 65.9 bits (157),  Expect = 1e-08, Method: Composition-based stats.
 Identities = 25/222 (11%), Positives = 56/222 (25%), Gaps = 31/222 (14%)

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
            +  +     + I          +        S      L+I+ +  G +      LL  +
Sbjct  116  IGALLALGGLQILAGIAFAVVLVGGTFLLADSMGATSALIIVPLFLGGIAT---ILLVSI  172

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR------  284
                    +  +  G L  L +S  L   +WW IFG  +++ ++   +S +         
Sbjct  173  RLMVTPAAIVVEEQGVLDGLRRSWQLTRHNWWRIFGIVLVISLLIGIISQIVQIPIGLAT  232

Query  285  ----------------------IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGP  322
                                  +             +   F      L+Y DL+    G 
Sbjct  233  GGFSSVIAPHGGDDGQRVLSAVVGIASIIVTAVIGAVGYAFQTSVMGLLYMDLRMRKEGL  292

Query  323  QHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLS  364
                 ++      +       +      +    + S  +   
Sbjct  293  DLALQRQLESGEDSDGVPGRGVAPEGNFAGGPTSGSWTRSPY  334


>HEC97179.1 hypothetical protein [Nitrospirae bacterium]
Length=239

 Score = 64.0 bits (152),  Expect = 1e-08, Method: Composition-based stats.
 Identities = 27/203 (13%), Positives = 64/203 (32%), Gaps = 2/203 (1%)

Query  127  LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD  186
            + ++          ++       P     +  + +    +       +   M     K  
Sbjct  36   IVVIGMRMTAMEPDVMGKLGEGMPDLIYVKSVLSIVGWVFYGFAQGMVASMMVELEDKGQ  95

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGS--LLLIIPGLLFCVWFFFCQYVLADDNI  244
              +          + S  +   ++  ++  G   +L IIPGL+    F F   V+  +  
Sbjct  96   TSIGFGFGRANEMIISLMVSGFIIGALLLLGFTFMLFIIPGLIVAFIFIFTFVVIMFEKR  155

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFS  304
            G + A+++S  +V  +    F  F  ++ I   L F +  +  +     L+   L   + 
Sbjct  156  GPIDAMKRSVQIVRSNLSDTFKLFAAIIGIGFFLMFFSVILSKLSLIGILSSMALTGAYM  215

Query  305  FLYYYLIYSDLKANYRGPQHPPI  327
               Y +     +      +  P 
Sbjct  216  GYTYVVFVKAYQKFKEEGRGFPR  238


>MBI4145956.1 hypothetical protein [Candidatus Woesearchaeota archaeon]
Length=277

 Score = 64.8 bits (154),  Expect = 1e-08, Method: Composition-based stats.
 Identities = 29/176 (16%), Positives = 68/176 (39%), Gaps = 0/176 (0%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             I            F   +      L+  + N +       V    +G++     + + +
Sbjct  90   AIDFGWQTTKKHLGFFIGVGLFILVLSLISGNAKGIATKILVGLFQVGVTLGYLKLALDL  149

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                   F+ +    + + ++ +  ++  L V  G +LL+IPG+++ + F    +V+  +
Sbjct  150  VDNKPAAFKELFSAFKQLPAYIITFVIYGLTVLLGLVLLVIPGIIWAIAFGLYPFVILLE  209

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
            N G + AL KS  L  G    +   ++ LL +++  +       +V    +L  + 
Sbjct  210  NAGPMTALRKSAHLTEGVKPKLLVFYLALLGLNILGTIPIGLGLFVTMPISLIAAA  265


>MBI3158699.1 hypothetical protein [Chloroflexi bacterium]
Length=254

 Score = 64.4 bits (153),  Expect = 2e-08, Method: Composition-based stats.
 Identities = 44/221 (20%), Positives = 84/221 (38%), Gaps = 8/221 (4%)

Query  110  SWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
                          I  L  ++        +   P     P+     +  ++ +  + LL
Sbjct  22   MAISLQIYLTRFPVIAGLAALVNLPVNMYFVFYGPPAPATPEETLAYYPYIIVSSLFTLL  81

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
             +  +   + + I     G+  ++   L          +L +L +  GSLL I+PGL   
Sbjct  82   AVLAIAYVVELTIFAERPGINAAINHALTRWLPGIGTFLLALLAILSGSLLFILPGLALG  141

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR-----  284
            ++FFF  YV++  +   + AL  SR +V GHWW   GR + ++ +    S   A      
Sbjct  142  IFFFFDVYVVSLRDFSLMGALNYSRTVVQGHWWQTLGRVLFVMFLVTAPSLFIAFSMGSA  201

Query  285  ---IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGP  322
               I  VGE        ++  F+ + + +++ +L      P
Sbjct  202  GPRIGQVGEILYDTLLDVIFAFALVAFTILFLNLDYRKNSP  242


>NJN27555.1 hypothetical protein [Cyclobacteriaceae bacterium]
Length=228

 Score = 64.0 bits (152),  Expect = 2e-08, Method: Composition-based stats.
 Identities = 24/172 (14%), Positives = 61/172 (35%), Gaps = 0/172 (0%)

Query  142  LKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVG  201
            ++             +  L      I   ++     M+      ++ +   +    R+V 
Sbjct  1    MQNPFSFISNPSYIGFIFLSTFSTLISFAVTVNYLKMYQSSYPKEITVTEVLNASWRYVL  60

Query  202  SFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHW  261
               +L I+ ++++  G L  ++PG    V       V+  ++ G  +++ ++  L++G W
Sbjct  61   PLLILAIVAMIIIVLGMLAFVLPGFYLMVVLALAFPVVLIEDKGIFESIGRAFKLINGKW  120

Query  262  WAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYS  313
            W+ FG   +  ++   +S +     YV     +      T          + 
Sbjct  121  WSTFGLLFISSILMYAISLIFIIPFYVFYFLQIFSLTEQTGVGVETSAWWFQ  172


>OHA80890.1 hypothetical protein A2675_02245 [Candidatus Yonathbacteria bacterium 
RIFCSPHIGHO2_01_FULL_51_10]
Length=339

 Score = 65.2 bits (155),  Expect = 2e-08, Method: Composition-based stats.
 Identities = 57/271 (21%), Positives = 108/271 (40%), Gaps = 15/271 (6%)

Query  104  SQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWA-----  158
             +L + +WE+F +RGW L G+ +LG +       +A      T       ++        
Sbjct  67   HELFSRAWEIFKKRGWVLFGVLVLGGIPLAILFAAAFAFGVVTIFQIAAWSFVTKQLLAL  126

Query  159  -ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGG  217
              + A V  I+           +     D G+F S + G   + +   + IL    V GG
Sbjct  127  VGIGAVVLIIIALWIGAAMLYAVRDDMADNGVFGSYRYGWSKIPALFWVSILSAFAVLGG  186

Query  218  SLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT  277
              L IIPG++  +W    ++V+  +   G+ AL KS+  V+G WWA+FGR +  + +   
Sbjct  187  IFLAIIPGIIVGIWLVAARFVVVVEGDRGIYALAKSKAYVAGRWWAVFGRMLAAMAVVFA  246

Query  278  LSFLTARI---------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIK  328
            ++   + I           V +A +L  +  +  + F Y Y +Y+ L+ +          
Sbjct  247  VNAAMSTIVKIIVSPNALIVAQAVSLIVTTTVNIYMFAYGYALYTSLRESRLEVSTRTDH  306

Query  329  RQWLPLTAAIFGWMLIPGLLLVSLSRQNLSA  359
             +           +++  L+   +     S 
Sbjct  307  PRGALNLFIAIPLLVVGALVGYVIMHSMKSF  337


>NCU42827.1 hypothetical protein [Candidatus Falkowbacteria bacterium]
Length=252

 Score = 64.0 bits (152),  Expect = 2e-08, Method: Composition-based stats.
 Identities = 39/227 (17%), Positives = 92/227 (41%), Gaps = 17/227 (7%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            W+ + +  +  + + + G++ +   +    +   A     +       + + ++  +   
Sbjct  26   WQEYKKYAFKFIELLIYGLIGSLPFLILIFVFFNALVKFEEGLIGFGVLGILSIFLVAAF  85

Query  171  LSWMTGSMFIYI-------CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
            +  +  ++   I                S K   +++  F  +  LL +++    +L ++
Sbjct  86   VLMIIWNLRAQIGSIILLKEDYQPSPRESFKKANKYLVPFLGVTALLTVLIFAWGMLFLL  145

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
            P L+F +++ F QYV   ++     ++E+S  LV  +WW +FGRF LL++I   +  L +
Sbjct  146  PALIFGIYYGFAQYVQIAEDKRPFSSIERSYDLVLNYWWPVFGRFCLLILIGFLVYGLLS  205

Query  284  RIPYVGE----------AANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
               +                    +LL+PF  +Y Y +Y +L     
Sbjct  206  VPFFWIGDSGWLFESYNLLINLIWVLLSPFFTIYPYKVYKNLVKIKD  252


>WP_088618176.1 hypothetical protein [Methylovulum psychrotolerans]ASF45294.1 
hypothetical protein CEK71_04005 [Methylovulum psychrotolerans]
Length=251

 Score = 64.0 bits (152),  Expect = 2e-08, Method: Composition-based stats.
 Identities = 41/212 (19%), Positives = 77/212 (36%), Gaps = 10/212 (5%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
              G   +  +  +     I   +          +        ++A      +  + +   
Sbjct  29   WIGFSTVLGVLSLGFNWFIKQYIGPGADASALAEGGTVLLLAIVAVTLCSSVSYAAIVYR  88

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
            +     +       +++LG++   S  L   L +L V  G+LLL++PG++  +   F   
Sbjct  89   LDNTAHQRQDSFMEALQLGIKKTPSLLLAGFLYMLAVSVGTLLLLVPGIILSLSLLFHIN  148

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV--------ISLTLSFL--TARIPY  287
             +  D      +L+ S  LV G WW   G   + +V        + + L F+   A I  
Sbjct  149  FIVLDAKKAYGSLKASYALVKGSWWRTAGVLTVPMVLLLLFYFILIMLLGFIVKFAGIDG  208

Query  288  VGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
            + E     FS+  TPF     Y+ + DLK   
Sbjct  209  IAELIINLFSIASTPFFLTVLYVQFHDLKLRK  240


>WP_080064285.1 hypothetical protein [Ruminiclostridium hungatei]OPX44292.1 hypothetical 
protein CLHUN_18460 [Ruminiclostridium hungatei]
Length=258

 Score = 64.0 bits (152),  Expect = 2e-08, Method: Composition-based stats.
 Identities = 31/161 (19%), Positives = 52/161 (32%), Gaps = 1/161 (1%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                   L LS    S    +     G+  S K   R+ G   +  IL  L+V  G +  
Sbjct  95   VIRLITNLFLSVYMYSYISELKGKTSGIAASFKGAFRNFGRLVVYNILFGLLVLMGLIFF  154

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS-LTLSF  280
            +IPG++  + F F    + D  +    A   S  L       I   F+   ++  L +  
Sbjct  155  VIPGIIAYIIFIFGFCYILDLKLNVADAFTASSELTKRRKMQIVSVFMGFYLMFELPIVL  214

Query  281  LTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
            L +            FS +           +Y D++     
Sbjct  215  LISGSTLGSAYLASFFSTIAGIILQRLITQLYMDMEYKKER  255


>MBI4039457.1 hypothetical protein [Candidatus Daviesbacteria bacterium]
Length=253

 Score = 64.0 bits (152),  Expect = 2e-08, Method: Composition-based stats.
 Identities = 48/227 (21%), Positives = 89/227 (39%), Gaps = 10/227 (4%)

Query  98   SGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQW  157
              L   +  L     LF +    L+ I  L  +  F   +         +          
Sbjct  12   WQLYIANFNLFLGMGLFGQLLSALVEILWLPALPLFTEKYIGDQFNQVNFARLGFGLLFL  71

Query  158  AILLATVAYILLGLSWMTGSMFIYICK-TDVGLFRSMKLGLRHVGSFTLLLILLILVVGG  216
               +  +  I    +         +C+   +G+  S + GL  +  +    IL+ L+   
Sbjct  72   VFAVVGLLVIEAIRAIALIIAIDRVCRHQSIGVLESFRSGLSRLWVYLWTNILVGLMTII  131

Query  217  GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
            G +LL++PG+   + F F  Y++  +N  G+QA++ SR +V G+WW IF R +  + + L
Sbjct  132  GLILLVVPGIYLWISFSFVSYLIILENQKGVQAIKLSREMVKGYWWKIFVRILFSICLLL  191

Query  277  TLSFLTARIPYVGEA---------ANLAFSLLLTPFSFLYYYLIYSD  314
             LS     + +V              L F+++  P    Y Y +Y +
Sbjct  192  ILSASQYLVWFVLSFTDQVLIDFAIELLFAIVFLPIFIAYGYFLYQE  238


>VTU00964.1 Uncharacterized protein OS=Haliangium ochraceum (strain DSM 14365 
/ JCM 11303 / SMP-2) GN=Hoch_3130 PE=4 SV=1 [Gemmataceae 
bacterium]
Length=278

 Score = 64.4 bits (153),  Expect = 2e-08, Method: Composition-based stats.
 Identities = 32/186 (17%), Positives = 67/186 (36%), Gaps = 25/186 (13%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
             V    +G + +   +          +  ++       G+     IL+ ++V  G +  I
Sbjct  73   VVVLQPIGTAAILHIIMQEYRGKSASIGDALSFAFTRFGALLGTSILVGVLVVVGFICCI  132

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            IPG+   V + F   V+  + + G  AL + + L+ GH   +FG  +L+L+    +    
Sbjct  133  IPGVYLYVSYIFVAQVVVLERLSGGAALGRCQKLIEGHRGRVFGVVLLVLIGGKLVETGV  192

Query  283  -----ARIP--------------------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKA  317
                 A +P                     +   A     +L + ++ +   L+Y D++ 
Sbjct  193  DKGLEAVLPPVKVVAAADGPRPEVSVPNYLINTLAGGLVQILFSTYAAVCTTLLYLDVRI  252

Query  318  NYRGPQ  323
               G  
Sbjct  253  RKEGFD  258


>OHA00147.1 hypothetical protein A3C07_00290 [Candidatus Sungbacteria bacterium 
RIFCSPHIGHO2_02_FULL_47_11]
Length=318

 Score = 64.8 bits (154),  Expect = 2e-08, Method: Composition-based stats.
 Identities = 34/199 (17%), Positives = 65/199 (33%), Gaps = 6/199 (3%)

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
              L  +             + + +  A       +      +L  +   ++G+       
Sbjct  21   MFLGALLFYAEHWHVILGIALIPILIAGPNILLGKYTPSLAILIAMLATVVGVLARLAMF  80

Query  179  FIYICKTDVG--LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             +     +    +  + K G + +  F  +  L+ L   GG  L I+PG+L  +W     
Sbjct  81   DVVSENGEPSGGIIGAYKKGWQILIPFVWVSALVTLTTLGGFFLFIVPGVLLSIWLSMSL  140

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR----IPYVGEAA  292
            Y    +   G+ AL  S   V G+W+ +F RFV L +I    + L         +     
Sbjct  141  YAFIVEGHKGISALTTSWHYVKGYWFPVFWRFVFLGIIIGAANLLIGLATTSPVFFAALK  200

Query  293  NLAFSLLLTPFSFLYYYLI  311
                +   +P       L 
Sbjct  201  TGGATTDPSPLWQFVSLLF  219


>PIS23345.1 hypothetical protein COT49_00665 [candidate division WWE3 bacterium 
CG08_land_8_20_14_0_20_40_13]
Length=177

 Score = 62.5 bits (148),  Expect = 2e-08, Method: Composition-based stats.
 Identities = 28/175 (16%), Positives = 65/175 (37%), Gaps = 13/175 (7%)

Query  151  QNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGL--RHVGSFTLLLI  208
                W +  +L ++    +       ++   +       ++++   L  +++  +  +  
Sbjct  12   LFGFWSFGFVLISIYLSCIIQIGFMRTLLSLVRGEKKLSWKTIFPQLTVKYLLRYFAVSF  71

Query  209  LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
            L  L+V  G +LL++PG+ + + + F + +  D  IG  +A   S  +  G  W +    
Sbjct  72   LYGLLVAFGFILLVVPGIYWAIKYSFSELLFIDKEIGIKEAFNLSGKMTQGIKWQL----  127

Query  269  VLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                   +        I  +G  A     ++  P S L   L+Y+ L +      
Sbjct  128  -------ILFGLAILGINILGVLALGVGLIVTVPLSVLSNLLLYTHLLSRLPKKD  175


>RYX82517.1 hypothetical protein EON83_18975 [bacterium]
Length=549

 Score = 65.9 bits (157),  Expect = 2e-08, Method: Composition-based stats.
 Identities = 39/367 (11%), Positives = 91/367 (25%), Gaps = 25/367 (7%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             +Y+  + L F  +   +            Q W +          +L L+ +   +  ++
Sbjct  59   LLYIFTMPLMFGAVSCVVAAAVRGQNVSFKQVWGFTKPRYGALLGVLILAMILMGIVAFV  118

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                +GL   +         +    +  I+ V  G +       +   WF     +   +
Sbjct  119  LLFVIGLVVILVGAAASNYGWFATGVAWIVGVIAGLIGGSFLLAIVMGWFNLAPVIACLE  178

Query  243  NI-GGLQALEKSRLLVSGHWWAIFGRFVLLLVI-SLTLSFLTARIPYV------------  288
            +   G  AL ++  L++G+W    G   +L +  +     +   + +             
Sbjct  179  DANRGSNALSRAWSLMAGNWRKACGVATILTLAGTAAFLIIFGFLMFFFYGGWDKFVGAN  238

Query  289  --------GEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFG  340
                           F ++ TP   +   + Y DL+                P T     
Sbjct  239  TDVFSGLAFSGFATLFFVVWTPLQAVVGAVFYLDLRTRKEALDLEWTNYAAKPETLTTTE  298

Query  341  WMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYK  400
             +        +                   Q  G  P               +   A   
Sbjct  299  QVAPTAGYASAPPPNFAGGP---PYAPASTQSAGPPPPPNFPPPPQGTGFSGQAPVAPPP  355

Query  401  LLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEI  460
            + ++    + +     +               N            P +   +K +  +E+
Sbjct  356  VGVAPADDSFAAFAAQIESAPQNIGTVPPSGINIEKAASPAPVTPPPIVNIEKAAETVEV  415

Query  461  DKVLDDD  467
              V    
Sbjct  416  QAVEFSP  422


>WP_179512173.1 MULTISPECIES: hypothetical protein [unclassified Sphingomonas]MBB4879519.1 
hypothetical protein [Sphingomonas sp. R1S3D]NYG87863.1 
hypothetical protein [Sphingomonas sp. R3G8D]NYG93858.1 
hypothetical protein [Sphingomonas sp. R1S6C]
Length=223

 Score = 63.3 bits (150),  Expect = 2e-08, Method: Composition-based stats.
 Identities = 28/178 (16%), Positives = 59/178 (33%), Gaps = 10/178 (6%)

Query  156  QWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG  215
                 +  + +  + ++ +  +   Y+      L   +  G         + +L  +  G
Sbjct  34   MMMGWVVQLFFWAVAVAAVIQAGLAYVEGRRPDLGEILYTGGMRSLPMLAVYLLYAIATG  93

Query  216  GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
             G  LL++PG+   V +      +  +  G   A ++S  L +G  W I G  +L + I+
Sbjct  94   IGFTLLLVPGIFLAVIWSMAMVAMVAEEPGVFGAFKRSAALTNGARWKILGILLLTVAIA  153

Query  276  LTLSFLTARIPYVGEA----------ANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
            L ++ L   +                A +  SLL T        +  +          
Sbjct  154  LIVNLLGGFVVLTAGLRPETQPAALPAQIVMSLLNTIVLTWTTTMFTALFVELREWKD  211


>WP_184452173.1 hypothetical protein [Schaalia hyovaginalis]MBB6334294.1 hypothetical 
protein [Schaalia hyovaginalis]
Length=373

 Score = 65.2 bits (155),  Expect = 2e-08, Method: Composition-based stats.
 Identities = 22/176 (13%), Positives = 59/176 (34%), Gaps = 23/176 (13%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +  +      +   + +      + L    +  +        L+ L+I+ V   +L+ 
Sbjct  183  LLICLLSFAFVALMTVLILTAVFGVLALAPEFEGSVFAWMIMLALVCLVIIAVSFAALVA  242

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS-LTLSF  280
            +       V  FF       +  G ++++ +S  L  G +  + GR++L++++    +  
Sbjct  243  L------AVRLFFAPIACVLEERGPVESISRSWALTRGAFARLLGRYLLMILVVNTLVGV  296

Query  281  LTARIPYVGEAANLAFSLLL----------------TPFSFLYYYLIYSDLKANYR  320
            L   +  V  +  +  +  +                 P       L+Y+D +    
Sbjct  297  LVGAVTGVMTSIMMLLNSPVFDGAASGLTVVLAGIAIPVQVACTVLMYTDERMRTE  352


>MBE3596173.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Hydrogenibacillus sp.]
Length=205

 Score = 62.9 bits (149),  Expect = 2e-08, Method: Composition-based stats.
 Identities = 37/184 (20%), Positives = 63/184 (34%), Gaps = 1/184 (1%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
             G   L  +L     F    L          +N          A +   L      +++ 
Sbjct  17   FGTLFLAWLLYSILTFLPTFLVSRYETAAYGENALPVYSNLVSALVSSILLGGFLYVYLQ  76

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
              + +      +     +   + L   +  L++  G LLL+IPG+   V   F  Y +AD
Sbjct  77   ASRGNPVHATEVLSFPLYFWKYLLAGTIYTLIISLGLLLLLIPGIFLAVRLGFWPYAIAD  136

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
               G L++L  S  L  GH+W++ G ++L +VI L    L             A +    
Sbjct  137  -GRGVLESLSYSWNLTKGHFWSLLGLYLLAVVIILIGVLLLVIGVIPATILVYAMTAQYY  195

Query  302  PFSF  305
                
Sbjct  196  VLLQ  199


>PKL92426.1 hypothetical protein CVV21_03705 [Candidatus Goldbacteria bacterium 
HGW-Goldbacteria-1]
Length=295

 Score = 64.4 bits (153),  Expect = 3e-08, Method: Composition-based stats.
 Identities = 44/233 (19%), Positives = 82/233 (35%), Gaps = 12/233 (5%)

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
              + + LL   ++ + +        + L      NP      +A + A     + G +  
Sbjct  20   YFKNFPLLFTIMIIMEIPIFGYMFQMGLMTKNPENPVYAIMFFATMAAATLLAVAGWNAS  79

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
              +          GL  + K+G +         IL IL+  GG++LLIIPGL+F + + F
Sbjct  80   VSAFSSLYKGEKTGLGIAYKVGFKRYFRSLAAGILYILIAMGGTVLLIIPGLIFIITYMF  139

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE----  290
               V   +++     L+ S  L   + W +F  F+LL ++    +FL + +         
Sbjct  140  AIPVAVLEDVKV-SPLKMSAKLSRNNKWQLFAIFLLLYIVVCGPAFLISYLSGAYADPVK  198

Query  291  -------AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTA  336
                     +    +LL         + Y  LK   +    P   +       
Sbjct  199  SMDITYLLLSSIPGILLGQLFTGALVIAYEKLKEVKKDEIKPEELKGLSTPIG  251


>TML36175.1 hypothetical protein E6G29_06075 [Actinobacteria bacterium]
Length=361

 Score = 64.8 bits (154),  Expect = 3e-08, Method: Composition-based stats.
 Identities = 34/224 (15%), Positives = 76/224 (34%), Gaps = 12/224 (5%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI--L  168
            +  + R     L +  + +V     +    L +     +         + +     I   
Sbjct  23   FGTYFRNFGTFLALAAVVVVPVQLILSGIGLKQLWAHYDKTPSAAATFLPIVVNWLITTP  82

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
            L  +    ++             ++  GL    +  + ++L  L +  G + LI+PG+  
Sbjct  83   LVTAMTIYALLDLGEGRRPNAREAILRGLEIFTAVLVPVVLAALGIIAGLIALIVPGIYL  142

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP--  286
             V ++F    +  +   G  AL++S  LVSG  W +FG  ++  +   + +         
Sbjct  143  AVRWYFVAQAVVVEERRGPAALQRSWELVSGSGWRVFGILIIASLAIGSATRALQTPFTA  202

Query  287  --------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGP  322
                    ++     +   +L  P   +   L+Y DL+      
Sbjct  203  GASAANMAFIQLIGVMVTQILAAPAGAVIGALLYFDLRTRKELM  246


>WP_110520753.1 hypothetical protein [Bacillus lacisalsi]PYZ96814.1 hypothetical 
protein CR205_14130 [Bacillus lacisalsi]
Length=199

 Score = 62.5 bits (148),  Expect = 3e-08, Method: Composition-based stats.
 Identities = 36/166 (22%), Positives = 70/166 (42%), Gaps = 5/166 (3%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
             +   +L        +            ++ +  LR      +  +L  L+ GGG+LL I
Sbjct  17   ILYSFILFAYVTLLVLIPEDTANKDRRRKARRPLLRLFIPVVIATVLFFLIFGGGTLLFI  76

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            IPG++  V+F    +V+  +N    +AL++S +L+ G ++   G F++ L     +    
Sbjct  77   IPGIIAFVFFLLFSHVITIENKTPYEALKRSAVLIKGSFFKAAGLFLIFLGAQALVVVTA  136

Query  283  ARI-----PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                     Y+  AA  A  +L+ PF      L+Y + +A++    
Sbjct  137  GFYLPLDTLYLNAAAGAAAHILIVPFQAGVMTLLYMESRADHEAFD  182


>MAG37794.1 hypothetical protein [Candidatus Pacearchaeota archaeon]
Length=250

 Score = 63.6 bits (151),  Expect = 3e-08, Method: Composition-based stats.
 Identities = 42/208 (20%), Positives = 81/208 (39%), Gaps = 0/208 (0%)

Query  90   EREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLN  149
                   G       +       L        + +  L     F                
Sbjct  1    MGFGEIFGKSWNEYWKNFLVILTLMFLFYIIPMFLLNLYNSNYFPGETLLGQENVGVDFI  60

Query  150  PQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLIL  209
             QN N+   +++ ++   L+ L      + I + K       ++KLGL++   + LL+I+
Sbjct  61   IQNVNYFGTLIIFSIVAFLIALIASLSILSISLKKDKYTFSEALKLGLKNYFGYLLLVIV  120

Query  210  LILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFV  269
            + + + G  +LLIIPG++F V++    YV  ++  G + +L++S  LV G WW   G  +
Sbjct  121  IWIFLAGLFILLIIPGIIFMVYWTLASYVFINEKKGIIASLKESMRLVKGRWWKTLGYGI  180

Query  270  LLLVISLTLSFLTARIPYVGEAANLAFS  297
            L+ +I + +      I  V +       
Sbjct  181  LIFLIVIAIGIGIGLIGMVIQLILGIPL  208


>WP_072854441.1 glycerophosphodiester phosphodiesterase [Lactonifactor longoviformis]POP31960.1 
hypothetical protein C3B58_14115 [Lactonifactor 
longoviformis]SHF48019.1 glycerophosphoryl diester phosphodiesterase 
[Lactonifactor longoviformis DSM 17459]
Length=599

 Score = 65.6 bits (156),  Expect = 3e-08, Method: Composition-based stats.
 Identities = 46/508 (9%), Positives = 111/508 (22%), Gaps = 48/508 (9%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
               +++    +  A I                     ++  A   +    L      + +
Sbjct  82   FFILFIYVSFVEIAAIILYCDCASEGRKTGVRWLLLHSLKRALAIFQPKNLGMSIVVLLL  141

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                  V +   +      +  F L  I    ++G     + +  L + + + F      
Sbjct  142  MPLTGIVFVSGPLGS--LKIPGFILEFIKGQPLLGFAYFGITLLFLFYFLRWIFSIPEFV  199

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA--------  292
                   +A  KS  L       IF   ++  +    ++  +  I  +            
Sbjct  200  LHPCSFKEARIKSVKLGKRKRIRIFLFVLIGSLFIHVMALGSKAIYVLSLLFWTKFTAGP  259

Query  293  ------------------NLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPL  334
                               + F+ +   F        Y   +             +    
Sbjct  260  GEAEYAFWFYYYRQSAVGTILFASVQAVFLISVVMAAYYKYQGITIDTVKRKGTWRKKAY  319

Query  335  TAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRL  394
                         L   +     +        + +  R G        L           
Sbjct  320  KCLQVLASFFCVTLYAEMIYPLQNDIFKNENIQIVAHRAGAVFAPENTLAALKEAIRSGA  379

Query  395  SSADYKLLLS--------KQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFP  446
             +A+  +  +                 G+S     +  +     D   HL    E    P
Sbjct  380  DAAEIDVQQTKDGVLVVMHDTDFKRIAGVSQKIWDVTYEECRRYDIGKHLMQGFEGEYLP  439

Query  447  NLSLA--QKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIY  504
             L     +       + ++         + +       +H+         D     R   
Sbjct  440  TLEDMLKEADGKINLMIELKSSGHEQKLEEKTVELIRRYHFESQCTVASMDYSILERVKE  499

Query  505  LRQGTQAEQVHSIL----GKLELTLPLAIESLQLTR--NDIGKTLQIGGKQLILQRLGSN  558
            L    +   + ++      +L      +IE   +T         L        +    S 
Sbjct  500  LDPHLKTVYIAALAYGDMTELNAADAFSIEETFITPQLLAQASVLDKEVYAWTINDEKSM  559

Query  559  --AVTLRFLG--DRTDLLNVHASNSHAE  582
               + ++  G          +  N+  +
Sbjct  560  KRMIKMQVNGIITDNTYYTSYILNTKGQ  587


>PZU50759.1 hypothetical protein DI568_03145 [Sphingomonas sp.]
Length=248

 Score = 63.3 bits (150),  Expect = 3e-08, Method: Composition-based stats.
 Identities = 37/200 (19%), Positives = 67/200 (34%), Gaps = 0/200 (0%)

Query  98   SGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQW  157
                S  Q+   +     R G  L  +    + L        +  +P             
Sbjct  1    MDPISFGQIWPRTVASVRRHGDMLWSLAAAFLFLPQLLFARQMNDRPPEQWFKGELAIGD  60

Query  158  AILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGG  217
             + +A V    +    M   + +        L + +K   R   +   +L+L  L  G G
Sbjct  61   GVAVALVVLCSILSQIMMARLLVRDGTGGQPLGQDLKAAFRLFPAALAVLMLQGLATGFG  120

Query  218  SLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT  277
              LLI+PGL           ++A D    + AL+ S  L SG  + +FG   ++++  + 
Sbjct  121  LFLLILPGLWIFTRLSLAMPLVATDQPDPINALKTSWALTSGRTFKVFGMIFVIILGFML  180

Query  278  LSFLTARIPYVGEAANLAFS  297
            LS     I       +   +
Sbjct  181  LSVGIMGIGAALGVISTIAA  200


>MBI4668874.1 hypothetical protein [Elusimicrobia bacterium]
Length=135

 Score = 60.6 bits (143),  Expect = 3e-08, Method: Composition-based stats.
 Identities = 30/100 (30%), Positives = 50/100 (50%), Gaps = 0/100 (0%)

Query  197  LRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLL  256
            +  +    ++ +    +   GSLLL+IPG+   V + F   +L  ++     AL +S  L
Sbjct  1    MGLMFPLIVVNVCYGALTLLGSLLLVIPGIWLSVKYSFSPILLVVEDQTAFGALGRSSDL  60

Query  257  VSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
            V   W+ +FGR  LL++I     FL +RIP+VG   +   
Sbjct  61   VKDFWFGVFGRLFLLIIIVYLGGFLFSRIPFVGPVISGLI  100


>MBE6284254.1 hypothetical protein [Mediterranea massiliensis]
Length=236

 Score = 62.9 bits (149),  Expect = 3e-08, Method: Composition-based stats.
 Identities = 31/210 (15%), Positives = 63/210 (30%), Gaps = 12/210 (6%)

Query  104  SQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLAT  163
            + +      +     W L     L  + +       + +  A             + +  
Sbjct  16   NFIKYCFGHISVCLLWPLFRTLALSNIDSNITEEEWMNMMVANGEVWNWVGRISIVGIIV  75

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
                   +      +   +    V +    K        +    I+ +++V  GS L I+
Sbjct  76   FLVSNYLVVVGGRMLHAAVYNERVDMTAEFKNARHTFLFYLGTWIVYLVIVTIGSFLCIL  135

Query  224  PGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            PG+   V F F   +  +   +   +A  +S  +  GH+W + G           L  L 
Sbjct  136  PGIFLAVRFMFAPMIAINHPELTFSEAFTRSWQMTKGHFWKLLG-----------LGILA  184

Query  283  ARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
              I  +G        LL    +++ Y   Y
Sbjct  185  VLINILGIICCCVGYLLTIVITYMMYGYAY  214


>TMD13252.1 hypothetical protein E6J07_08815 [Chloroflexi bacterium]
Length=157

 Score = 61.3 bits (145),  Expect = 3e-08, Method: Composition-based stats.
 Identities = 31/151 (21%), Positives = 58/151 (38%), Gaps = 16/151 (11%)

Query  203  FTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWW  262
              ++ ++ ++++    L  +  G+ F V +      +  +NIG ++ L +S  LV G WW
Sbjct  6    LVIVFVIGLVLLVLPGLAALCGGVYFAVRWSVSIAAMMAENIGPIRGLGRSWNLVKGMWW  65

Query  263  AIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS----------------LLLTPFSFL  306
              FG  +L ++  + +      +  V  A   A S                 L+ P   +
Sbjct  66   RTFGIILLAVIAYIVIYLALLALFTVVAAIMPAISTDTRSGVATAATTLVDALIAPMFPI  125

Query  307  YYYLIYSDLKANYRGPQHPPIKRQWLPLTAA  337
               L+Y DL+    G     +  Q  P  A 
Sbjct  126  LLTLLYFDLRVRKEGLDLDQLAEQTSPGPAP  156


>HDI11102.1 hypothetical protein [Candidatus Acetothermia bacterium]
Length=219

 Score = 62.9 bits (149),  Expect = 3e-08, Method: Composition-based stats.
 Identities = 36/218 (17%), Positives = 66/218 (30%), Gaps = 14/218 (6%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            + G YLL   +     +    L               A+++  +   ++           
Sbjct  1    MAGFYLLLAAMGGGITYVLGPLGIPEVGEELPAWAIAALVVLNLTVGVMLYLGFYYYTLK  60

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +      +   +    R   +   + +L  L V GG  LLI+PG++  + FF+    L 
Sbjct  61   LVRGRTPSILGDLLCPFRRPLAVIGVAVLYALAVAGGLALLIVPGIVVALAFFYAGVALI  120

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGR--------------FVLLLVISLTLSFLTARIP  286
            D N+G L+AL +S  L  GH   +                  +L   +            
Sbjct  121  DRNLGVLEALRESSRLTRGHRPTLLLMLLFFTLVDAALKLSVLLFFALVPLTPPPDTTTG  180

Query  287  YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
             V +  +L         SF+  Y    +          
Sbjct  181  LVADLLSLFVITPWEAASFMCAYEALLEEFHRPEVHAP  218


>MBA3531544.1 hypothetical protein [Ardenticatenales bacterium]
Length=287

 Score = 63.6 bits (151),  Expect = 3e-08, Method: Composition-based stats.
 Identities = 31/230 (13%), Positives = 73/230 (32%), Gaps = 27/230 (12%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L+ +  L  +   + +  +  L  +++      +  +++ +      LL           
Sbjct  56   LISLPSLSFMADPSILEESPGLFFSSFGLSWVGSVIFSLGITYFIMPLLAAGIGVAMQGF  115

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV------WFFF  234
             +    VGL   +     ++       ++ ++      +   +  +   +       F+ 
Sbjct  116  LLEGRRVGLLEMITGMRNNIRPMATTGLVAMIFSFASFITFPLVPVWAAIVTLFTYAFYL  175

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI---------  285
              +V   +   GL+AL +  +L+ G  W   G  VL    SL    +   +         
Sbjct  176  SLFVTLYEKRAGLEALRRGWILIRGSVWRALGYLVLFYFFSLVFGGIVGGVMGASLLALA  235

Query  286  --------PYVGEAANLAF----SLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                    P++            ++L  P  F    L+Y DL+  + G  
Sbjct  236  PLIDTADSPFLSLILQSLAQGFTNVLTLPLLFGTTALLYFDLRIRHEGLD  285


>MBI5170267.1 hypothetical protein [Candidatus Eisenbacteria bacterium]
Length=244

 Score = 63.3 bits (150),  Expect = 3e-08, Method: Composition-based stats.
 Identities = 30/173 (17%), Positives = 69/173 (40%), Gaps = 4/173 (2%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
             +   +   +    ++ +   +  V     ++  L  +       +L+ L+   G+LLLI
Sbjct  71   GMLLSISTYAAFARAVELRTREETVDSGEVLRSALETLFPVAATSLLVGLIAFVGTLLLI  130

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            +PG++     +        + +   +AL++SR L  G+   +FG  ++  +  L +  +T
Sbjct  131  VPGIMATCALYVAIPACVLERLSAPEALQRSRSLTKGYRMTLFGMILVAGLAMLPIIGIT  190

Query  283  ARIPYVGEAANLAF----SLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
              +     A         S +LTPF+     +I+  L++         I  ++
Sbjct  191  IAVGAAAPALAFPLLVLQSTVLTPFTSAMAAVIFLRLRSAKGLDPVGVIAEKF  243


>MBE6030738.1 DUF975 family protein [Clostridiales bacterium]
Length=342

 Score = 64.4 bits (153),  Expect = 3e-08, Method: Composition-based stats.
 Identities = 33/292 (11%), Positives = 75/292 (26%), Gaps = 6/292 (2%)

Query  129  IVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVG  188
             +           +                        +   LS+   S  +   +    
Sbjct  26   WMFVIIGSIIYTAIAELPIYFLNFAFSSDFGSSLYTLLVSGALSFGYASFLLSFFRKTGA  85

Query  189  LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN-IGGL  247
             +  +  G         L++ + +     SLL IIPG++    +     ++ D+  +   
Sbjct  86   DYGQLFSGFERFAKLLGLMLYISIFTALWSLLFIIPGVIASYRYSKAFMIMVDNPSLSAR  145

Query  248  QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV--GEAANLAFSLLLTPFSF  305
            +AL +S+ L+ G+    F   +  +         ++ I  +      +     +      
Sbjct  146  EALNESKRLMRGNVMKNFCLDLSFIGWQFLALIPSSIIASLLGASIVSSVGGAMGDVAGM  205

Query  306  LYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSA  365
                LI + L  +        +    L L A           L   ++ +   A      
Sbjct  206  GAEALIQTSLTGSKLLISELVLFVANLGLVAVSAYIQTANVALYDMMTGRLKEAPASGEW  265

Query  366  GKDIQQRLGTQP---QQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGG  414
              D                 +        +     D K    + +  T+E  
Sbjct  266  PLDPVSPPPAPTPGIFSQEAVYTRPENPEEYNYIPDMKEPTKQAQGETAEPA  317


>MBA3630351.1 hypothetical protein [Actinobacteria bacterium]
Length=113

 Score = 60.2 bits (142),  Expect = 3e-08, Method: Composition-based stats.
 Identities = 26/104 (25%), Positives = 45/104 (43%), Gaps = 9/104 (9%)

Query  216  GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
             G +LLIIPGL+   W+     V+  + +G   +  +SR LV G    +FG  VL  ++ 
Sbjct  2    IGMVLLIIPGLVLLTWWAVIVPVIVLERVGAFDSFSRSRELVRGWDLKVFGVIVLEALLI  61

Query  276  LTLSFLTARIP---------YVGEAANLAFSLLLTPFSFLYYYL  310
            + +S +   I          +V    + + +  L+       YL
Sbjct  62   IAVSLILGLIMAPLPDGPQTFVSNVLSGSLTGPLSALITTLLYL  105


>RLF61093.1 hypothetical protein DRN25_01195 [Thermoplasmata archaeon]
Length=282

 Score = 63.6 bits (151),  Expect = 4e-08, Method: Composition-based stats.
 Identities = 30/189 (16%), Positives = 62/189 (33%), Gaps = 4/189 (2%)

Query  130  VLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGL  189
               F       L               +   +A     L+         +  +      L
Sbjct  34   FSYFMKGIRPSLFFLNPAHLGAFFGAVFLFSIAIGILGLIASGVTITMCYDVLSNGKTSL  93

Query  190  FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQA  249
             R  +  +  +    +  IL+ ++V  G +L IIPGLL  ++  F   ++  D+     A
Sbjct  94   GRGFEKVMEKLLDIVVAAILMGIIVVVGFILFIIPGLLAMLFLMFTLVIVIVDDASASDA  153

Query  250  LEKSRLLVSGHWWAIFGRFVL---LLVISLTLSFLTARIPYVGEAANL-AFSLLLTPFSF  305
            + KS + V  +   +    ++   + +I   +  +  +IP +G        S   T +  
Sbjct  154  IRKSYMKVKENIGNVLIFIIVAFIVFLIVGIIGKIIEKIPLIGMILLSPIISGATTAYLN  213

Query  306  LYYYLIYSD  314
                + Y  
Sbjct  214  AALTIFYLH  222


>HHH55202.1 hypothetical protein [Bacteroidetes bacterium]
Length=287

 Score = 63.6 bits (151),  Expect = 4e-08, Method: Composition-based stats.
 Identities = 35/200 (18%), Positives = 69/200 (35%), Gaps = 0/200 (0%)

Query  97   GSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQ  156
                +   + L           + LL I         +     L     + L+       
Sbjct  21   FIFFKMNFKPLFKRIWKLILPYFILLFISYSFFSFTISRTGVNLAAVFGSNLDFSYFLSF  80

Query  157  WAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGG  216
            +  +  T+ Y+ +    +   +  Y+    +      K   R   S T L  L++L+   
Sbjct  81   FIYIFITIIYVSILQLSVLNYIKEYVNNVSIEESELKKRVYRRFFSMTGLNFLIVLISFL  140

Query  217  GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
            G LL  IPG+   V   F   +   +N     ++ +S  L+  +WW  F   V+L++I+ 
Sbjct  141  GFLLFFIPGVYMIVVLVFAPVIFIFENKDLRSSISESFTLIKNNWWTTFTSLVILVLITF  200

Query  277  TLSFLTARIPYVGEAANLAF  296
             ++ +   I Y+        
Sbjct  201  VINMVLGIIFYIYSMIKAFV  220


>WP_073559828.1 hypothetical protein [Archangium sp. Cb G35]OJT25795.1 hypothetical 
protein BO221_08055 [Archangium sp. Cb G35]
Length=235

 Score = 62.9 bits (149),  Expect = 4e-08, Method: Composition-based stats.
 Identities = 29/148 (20%), Positives = 63/148 (43%), Gaps = 0/148 (0%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
                  +   +++ +    + ++ + L  S+++GL       L+ +L  + V  G+L  I
Sbjct  64   ACLSSFVTAVFLSWTAQNLLPRSGLTLRSSLQVGLIRFLPLLLMSLLFGITVVSGTLCFI  123

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            +PGL F V       ++  +  G L++L+ S  L  GH   +F  F ++ ++ + L+   
Sbjct  124  LPGLYFVVCLSLAPALVVIEGYGPLESLQHSYRLTRGHRLTLFVVFAVMFLLQVGLALAG  183

Query  283  ARIPYVGEAANLAFSLLLTPFSFLYYYL  310
              +     +      L +  F  L+  L
Sbjct  184  QLVLGFLPSLGAPSWLSVELFFPLWQAL  211


>WP_132698640.1 hypothetical protein [Reinekea marinisedimentorum]TCS43687.1 
hypothetical protein BCF53_10129 [Reinekea marinisedimentorum]
Length=221

 Score = 62.5 bits (148),  Expect = 4e-08, Method: Composition-based stats.
 Identities = 28/206 (14%), Positives = 72/206 (35%), Gaps = 7/206 (3%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
                  +    + +         +LL       NP++    +  +     +  +    + 
Sbjct  12   FYLNHFIRFAQIVLPFTIPLGLFSLLYDKFLLTNPESALQVYFPMAVAFLFRPVYQLALL  71

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
             S+   +      +    ++G        ++ +L    V  G ++ +IPGL   + F F 
Sbjct  72   QSISQSLQHNYPTVGSLWQMGWSKWAPMFIVSLLYTAAVFSGMMMFVIPGLYLAIKFCFA  131

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF-------LTARIPYV  288
            +  +A +    ++A+++S    +G    + G F+L+ ++ +  S              ++
Sbjct  132  EIFVAVEGCDPVEAMKRSWKATTGRLLPLTGGFILISLLLIVPSMQIANYVTTAGIPDFI  191

Query  289  GEAANLAFSLLLTPFSFLYYYLIYSD  314
            G         LL  F  ++ Y ++  
Sbjct  192  GRFIPSVIFSLLGVFYTVFTYRVFDQ  217


>MBA2301696.1 hypothetical protein [Acidobacteria bacterium]
Length=243

 Score = 62.9 bits (149),  Expect = 4e-08, Method: Composition-based stats.
 Identities = 31/229 (14%), Positives = 64/229 (28%), Gaps = 23/229 (10%)

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
                   G + +Y +   +A      A+              +           +LL L 
Sbjct  4    WLFAGMMGAMAVYWVVYTVALGATTFAVSEIYVGRTVTIPYVYGRMRGRVGALVLLLLLI  63

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
             +       +    +GL   +       G    +     L      L L+   +L  + +
Sbjct  64   ALRLGALCLLGAMAIGLSMGLGRLGGIAGPIVAV-----LATLLIGLALVGVVMLMMLRY  118

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR--------  284
                  L  + +   +A+++S  L  G    +F   +   +++     L           
Sbjct  119  GVAVPALVLEGLSPGRAIQRSVDLTRGRLGRVFLLVLCSTLVTYAALMLFQGPFIGLALY  178

Query  285  ----------IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                      +  +G  +    + L TPF  +   LIY D +    G  
Sbjct  179  VGFETAQGSWLNIIGAVSGTIGATLTTPFMIIGLALIYYDARIREEGFD  227


>WP_055953470.1 hypothetical protein [Curtobacterium sp. Leaf261]
Length=479

 Score = 64.8 bits (154),  Expect = 4e-08, Method: Composition-based stats.
 Identities = 35/275 (13%), Positives = 64/275 (23%), Gaps = 42/275 (15%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            + G  LL  +      V F      +A + +G + A   S  LV G +W  FG  +L+ V
Sbjct  212  LFGVGLLFGVLAAWLAVKFSLVVPGIALEGLGPIAAARNSWRLVRGAYWRTFGIELLVRV  271

Query  274  ISLTLSFLTAR-IPYVGEAANLAFS-------------------------------LLLT  301
            +      + +  +  V   A+  FS                                +  
Sbjct  272  MFNVAVSIASIPLSIVLAFASGVFSPLGARGSGATGFSPVSVIVIVVGGALAVAVTAITD  331

Query  302  PFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQ  361
              +     L+Y D +    G               A      IP     +++      + 
Sbjct  332  VVAAATTTLLYIDRRFRTEGLDGR----------IASSLETGIPADPFATVAPAERRPDP  381

Query  362  LLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVT  421
               A     +                P  P +  +        +Q            P  
Sbjct  382  WGGASGPRGRTPAGPGAGPQGQWGPGPWGPGQQQAPGPWGPAQQQAPGPWGPAQQQAPGP  441

Query  422  LFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSA  456
             +         +            P  +     S 
Sbjct  442  QWGAAPQWGPGSQTRPGSPRDDGDPRQTPPGPPSE  476


>WP_060992333.1 hypothetical protein [Aliivibrio sifiae]PQJ84747.1 hypothetical 
protein BTO22_14690 [Aliivibrio sifiae]PQJ87224.1 hypothetical 
protein BTO23_13945 [Aliivibrio sifiae]
Length=226

 Score = 62.5 bits (148),  Expect = 4e-08, Method: Composition-based stats.
 Identities = 26/201 (13%), Positives = 61/201 (30%), Gaps = 4/201 (2%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
               +     + +   +   P          ++            +L  +         + 
Sbjct  9    MDSFNYFKKHFIAFCILVLPFALITNGIALSFNEEDGSGKFLIYMLFILTIYPFYKGAIL  68

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
              +        V   +  ++      SF L+ I+L   V  G + L++PGL     F F 
Sbjct  69   YYIAYSFDGRRVPFSQLYQIPASTWFSFVLMNIILGAAVLTGFIALVLPGLYLMARFSFT  128

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL----VISLTLSFLTARIPYVGEA  291
            +          L A++        ++W +F   V++      +   + +  + I     A
Sbjct  129  EIYCVLYKEKTLDAIKLGWHETKDNYWILFKGLVIIFGFTTGLMWLIEYAFSLIGLKSAA  188

Query  292  ANLAFSLLLTPFSFLYYYLIY  312
             +  FS+     S +    ++
Sbjct  189  LSFFFSVCEVVLSMMNTIFMF  209


>WP_114585485.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Haloplanus rubicundus]AXG06346.1 hypothetical 
protein DU500_07845 [Haloplanus rubicundus]
Length=120

 Score = 59.8 bits (141),  Expect = 5e-08, Method: Composition-based stats.
 Identities = 20/92 (22%), Positives = 37/92 (40%), Gaps = 0/92 (0%)

Query  203  FTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWW  262
              +  +L  L +  G  LL +PGL       F  + +  ++ G ++AL++S  L  G+  
Sbjct  1    MLISGVLTFLAIMIGFALLFLPGLFLAACLLFVIFTVEVEDRGVIEALKRSWTLSKGNRL  60

Query  263  AIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
             +     L  VI   +  + +     G  A  
Sbjct  61   RLMVIVFLTGVIGAIVGAVPSLFQLAGATAMG  92


>WP_086784362.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Streptomyces scabiei]
Length=299

 Score = 63.6 bits (151),  Expect = 5e-08, Method: Composition-based stats.
 Identities = 21/143 (15%), Positives = 43/143 (30%), Gaps = 28/143 (20%)

Query  209  LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
            ++  +   G L   +  +   + F      L  +  G  +++ +S  LV G WW + G  
Sbjct  135  VVPAMAFFGGLGAFVLTVWLMIRFSLASPALMLEKQGVRKSMSRSVKLVRGSWWRVLGIQ  194

Query  269  VLLLVISLTLSFLTARIP----------------------------YVGEAANLAFSLLL  300
            +L  +I+  ++ +                                  +     +  S L 
Sbjct  195  LLATIIAWIVASIVVIPFSFAGAALDGDGLSGFLATGGTALGWTYLVISGVGAVIGSTLT  254

Query  301  TPFSFLYYYLIYSDLKANYRGPQ  323
             P S     L+Y D +       
Sbjct  255  FPISAGVTVLLYIDQRIRREALD  277


>MBI2900576.1 protein kinase [Planctomycetes bacterium]
Length=533

 Score = 64.8 bits (154),  Expect = 5e-08, Method: Composition-based stats.
 Identities = 30/216 (14%), Positives = 67/216 (31%), Gaps = 4/216 (2%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                L+ +     +         +       L            +  +         ++ 
Sbjct  318  MNQILVPVGGRFDIGTCLSHALTVWKNNWFMLAVAALISSMLSFMTLMILAGPMTGGLSV  377

Query  177  SMFIYICKTDVGLFRSM-KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
                 + +    +   +   G R  GS   L  L + ++    +LLI+PG+   + + + 
Sbjct  378  LYLDALQRPGQKIRMDLLFAGFRRFGSLVGLFFLQLAILIPAFVLLIVPGIYLSIRYMYA  437

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
             Y++ D  +G + AL+ S  +       + GR   L ++ +        IPY G      
Sbjct  438  YYLVVDRGLGPVAALKGSWRMT---ASPLLGRHFGLQLMQILFDNGPTVIPYAGFVIAFF  494

Query  296  FSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
               +        Y ++ S+           P++R  
Sbjct  495  VGAIGRLMVVHAYTVVRSERAKEIEPLGGVPVRRAM  530


>HCR55653.1 hypothetical protein [Candidatus Saccharibacteria bacterium]
Length=242

 Score = 62.5 bits (148),  Expect = 5e-08, Method: Composition-based stats.
 Identities = 30/155 (19%), Positives = 53/155 (34%), Gaps = 0/155 (0%)

Query  137  FSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLG  196
            F +   + +             IL   +   L+       +           L  + K  
Sbjct  66   FESFFNEMSVSDWVVISLIGGTILFVLILIGLVIGGIADYTASKLASDEGTTLGEAFKEV  125

Query  197  LRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLL  256
            ++HV S+  L +L+ + V   +LL IIPG+   + +         + + G  A+++S  L
Sbjct  126  MKHVASYLWLSVLIFVKVFLWTLLFIIPGIYMAIRYSLAGTAFFKEGLRGNAAIKRSLAL  185

Query  257  VSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
              G W   F    L  VI+  L          G  
Sbjct  186  TKGAWLTTFASHTLWNVITFGLMDAVLSPGTNGVL  220


>HAE32562.1 hypothetical protein [Dehalococcoidia bacterium]
Length=189

 Score = 61.7 bits (146),  Expect = 5e-08, Method: Composition-based stats.
 Identities = 29/177 (16%), Positives = 67/177 (38%), Gaps = 15/177 (8%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
            +++      +L  + +      +I    V     +   L        + I L +++    
Sbjct  2    LIILVWITSILSTAAIIFGAAQFIQDGKVSQVLCIDYALSCSIKLIGVSIALPILLIIPL  61

Query  219  LLL-----IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            LL      I   +   + + F    +  ++ G + +L++S LLV+G WW  FG  + +LV
Sbjct  62   LLSFILIGIPLLVFLLIRWNFAVCAVVLEDKGVINSLKRSWLLVTGKWWITFGTVLAVLV  121

Query  274  ISLTLSFLTARIP----------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
            + +  S +   I            V        ++++ PF+ +   + +  L+ +  
Sbjct  122  LIVIPSAVLGLINTVISSFFENFLVSHILEGITTVIIIPFASIATGIYFLGLRMSKE  178


>NLL64538.1 hypothetical protein [Clostridiaceae bacterium]
Length=359

 Score = 64.0 bits (152),  Expect = 5e-08, Method: Composition-based stats.
 Identities = 29/281 (10%), Positives = 84/281 (30%), Gaps = 4/281 (1%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             +            F++          P  +     + +  + ++      +  +     
Sbjct  69   ALLQFLNDQIQLIGFNSTTESLTPSDIPLTKGNLIFLSVQILLFLFNIFMGLLYAGAYTS  128

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
             +        +K  LR +    LL+++LI+     S L+ +P ++      F  ++ ++ 
Sbjct  129  ERLKQPAALGIKAMLRSLPKLILLVLMLIIPTILSSFLMWLPIIVILCGLSFMPFLFSEK  188

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI----PYVGEAANLAFSL  298
             +   +A+++S  L   H  +IF  F  L  I   ++ +   I     ++ +      + 
Sbjct  189  RMSFFEAVKRSWTLTRTHKLSIFLSFFFLNSIKRLITMVVGYIAPDQMFILQVLMAFLTT  248

Query  299  LLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLS  358
            +          L+Y          +   +    +          L   + +  +   +  
Sbjct  249  IFALMKGRLLGLLYVFYTRQVTVLKSGFMVMTDMKDIFNDTLTPLPDDVFIPEIRNLSRY  308

Query  359  AEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADY  399
             E    +  D   +L          ++    +    +  + 
Sbjct  309  KENSKPSQSDSPFQLNDDHSIRGGSSQQDRPDASLPNEQEP  349


>WP_053241145.1 hypothetical protein [Clostridium sp. DMHC 10]KOF56028.1 hypothetical 
protein AGR56_03345 [Clostridium sp. DMHC 10]
Length=235

 Score = 62.5 bits (148),  Expect = 6e-08, Method: Composition-based stats.
 Identities = 38/191 (20%), Positives = 74/191 (39%), Gaps = 4/191 (2%)

Query  127  LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD  186
            +  ++          +   +  +         + L  +  I + LS  T  +      T 
Sbjct  36   VFFIICLFSGLITGSVTILSGFSIGEVLLCIIVYLVMIFAICIPLSVGTYYIARNYQDTH  95

Query  187  VGLFRSMKLGLRHVGSF--TLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-N  243
              +F  + +  +         L +L+ +++  G +LL+IPG++    F F   V+ D+  
Sbjct  96   EIVFSDILVCFKKNMVLKSIGLSLLMTIILIVGYILLVIPGVILTYMFIFAFIVMIDNPK  155

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
            +G L+ +  S  L  G+   IFG  +LL +I   +  L      VG        L+++ F
Sbjct  156  LGILEVISLSAKLSKGYKLKIFGYNILLGIIPALVYVLLRG-SIVGILIYFIICLVISAF  214

Query  304  SFLYYYLIYSD  314
            + L     Y D
Sbjct  215  NLLGLGFFYMD  225


>NIO16973.1 hypothetical protein [Deltaproteobacteria bacterium]NIS77687.1 
hypothetical protein [Deltaproteobacteria bacterium]
Length=314

 Score = 63.3 bits (150),  Expect = 6e-08, Method: Composition-based stats.
 Identities = 51/316 (16%), Positives = 83/316 (26%), Gaps = 15/316 (5%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIAT----CPHC  58
             +RCP C AE  T +       +          +      +    +              
Sbjct  1    MIRCPGCQAEFETGAKFCIHCGADLSGEFLRNPVCPKCGRTYPEGSNYCDDDGSRLVEEE  60

Query  59   GLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRG  118
             L  R          +T  C            R+   S   +            +     
Sbjct  61   MLVPRCVVCGRTFSEETRFCPDDGGVVIAGALRKKSPSPFTIGKKVSDHEWEGLVGREYR  120

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
                     G  L        +     T        +   +     A I   L      M
Sbjct  121  VKRHEYLSRGWELFKENAGGFIGFSALTIFVNLALQFLPMLGFLVRAAIYAPLIGGFYIM  180

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
               + K     F    LG        L  +L       G +LL++PG+   V + F   +
Sbjct  181  AFKMIKRQRTEFADFFLGFGFFLPLLLGGLLTGFFTIVGLVLLVVPGIYLAVSYVFTIPL  240

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
            + D  I   QA+E SR  V+ +W+++F    +L             +   G        L
Sbjct  241  IVDREIDFWQAMEISRKFVTKNWFSLFLFLFVLF-----------LVNIGGALLFGVGLL  289

Query  299  LLTPFSFLYYYLIYSD  314
            +  PF+F      Y D
Sbjct  290  VTIPFTFCSLAAAYDD  305


>MBI5135168.1 hypothetical protein [Candidatus Uhrbacteria bacterium]
Length=300

 Score = 63.3 bits (150),  Expect = 6e-08, Method: Composition-based stats.
 Identities = 43/185 (23%), Positives = 77/185 (42%), Gaps = 4/185 (2%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            W  F  +   LLGI L   V+          ++  +  +    +    I       I L 
Sbjct  70   WGQFMMQWKTLLGISLFPTVVGIVLGIPLFAIEFISGFSESPDSGMQTISGFIELAIRLI  129

Query  171  LSWMTGSMFIYI----CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
            ++ +     I +        +G  +++K   R + ++  + +L+ L    G+LL IIPG+
Sbjct  130  VAGIGIWAQIALIMASVNPAMGARQALKQSYRLLPAYAWIALLVSLGSIAGTLLFIIPGV  189

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            +  VW+ F  YVL  +++ G +AL +SR  V G WW + GR +   V+ +    L     
Sbjct  190  IVSVWWMFSHYVLVGEHLRGTKALGRSRAYVRGRWWKVVGRMIGFGVLMVAPILLVIAFG  249

Query  287  YVGEA  291
                 
Sbjct  250  GWLYF  254


>MAO63522.1 hypothetical protein [Balneola sp.]
Length=274

 Score = 62.9 bits (149),  Expect = 6e-08, Method: Composition-based stats.
 Identities = 33/199 (17%), Positives = 69/199 (35%), Gaps = 2/199 (1%)

Query  100  LRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAI  159
             + I        +           I  L +  +++   +A++  P               
Sbjct  20   FQYIRVHWKSLGKALLLLVLPFYLISGLLVGDSYSNFLTAVMENPNADPGTLFTGNFLFG  79

Query  160  LLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
            LL         L+     + I     +V L +       +     L+ +L+IL +   + 
Sbjct  80   LLLLAFSSGALLTASLTHVQIARDHGEVQLSQITDRFGGNFLKLFLIYVLIILAIFFSAF  139

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            L IIP +   V  F        +++  + A+ +S  L +GHWW  F  ++++ +IS  +S
Sbjct  140  LFIIPAIYIGVKLFLAPATSILEDLNPIDAITRSWSLTTGHWWFTFAVYLVMNIISSFMS  199

Query  280  FLTARIPYVGEAANLAFSL  298
            ++       G   +     
Sbjct  200  YILIIPM--GIVISFVGMA  216


>WP_182141828.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Schaalia sp. JY-X169]
Length=208

 Score = 61.7 bits (146),  Expect = 6e-08, Method: Composition-based stats.
 Identities = 29/180 (16%), Positives = 61/180 (34%), Gaps = 17/180 (9%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
                   +  L  + +   + + +    + +   + + +    S +   +  +  +    
Sbjct  12   WEALKSHFWRLVGTAVLVWLIVGVVAAVIFIVPIVLIAVVAFASSSGDSLWFLSFLLLII  71

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS---  275
             L I   +   V  +F   +   +      AL +S  L  G +W I GR +L+ +I    
Sbjct  72   PLGIAVTVWVSVRLYFATLIAVVEGATPPTALRRSWALTKGAFWRILGRMLLMSIIVSIV  131

Query  276  ---------LTLSFLTARIP-----YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
                       + F T+ +P     ++    +   S L  PFS  Y  L+Y D +     
Sbjct  132  VGLLGGTISAVIMFATSVLPWAVTAFLLALVSALISGLAMPFSASYTSLMYVDERVRKEN  191


>MBI4366411.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=303

 Score = 63.3 bits (150),  Expect = 6e-08, Method: Composition-based stats.
 Identities = 38/250 (15%), Positives = 72/250 (29%), Gaps = 6/250 (2%)

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
             +       +  ++  I            L+ +  +        +  ++     +     
Sbjct  49   PYLFWMVHFIRTHVPRIPAHATLPQIWTALQGSVGIAVVGFVGLFFGIIFVGNLVQCWGE  108

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
                    +                R         I   LV  GG +  ++PG++  +  
Sbjct  109  ATLALASAHGTARRADACTVFLQAWRKAWPLFWTRIATSLVAFGGMVWCVVPGIVCAIAL  168

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP------  286
                Y+   +++   QAL  S     GH W I GR++LLL +   L+     +       
Sbjct  169  SMAWYIRVLEDVPLSQALAASWERSRGHRWGILGRYLLLLCVVFALAIALQILNLFPLAF  228

Query  287  YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPG  346
             V    +L F   +     +  YLIY DL+        P      + +    F       
Sbjct  229  LVLLPVSLLFQFSIPVMYGVAGYLIYLDLRPADTSAAPPVATPARISIWMPFFAGSAFLP  288

Query  347  LLLVSLSRQN  356
            +L V      
Sbjct  289  ILAVVFFLFW  298


>MBE6812653.1 DUF975 family protein [Ruminococcaceae bacterium]
Length=258

 Score = 62.5 bits (148),  Expect = 6e-08, Method: Composition-based stats.
 Identities = 45/227 (20%), Positives = 76/227 (33%), Gaps = 8/227 (4%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT---LLLILLILVVGGGS  218
                +   G       +F+ I +   G    + +G + + ++T   LL I+  L +  G+
Sbjct  37   LLSPFAQGGFLIGKNKVFLDIARQGSGKLEDIFVGFKSINTYTKGLLLHIIKSLYLAVGT  96

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
             LLI+PG++  + F F + ++AD+NI  L+ALEKSR L       +F   +     S   
Sbjct  97   ALLIVPGIILRLRFLFAELIMADENITALEALEKSRYLTKNRMDEVFTFEL-----SFLP  151

Query  279  SFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAI  338
             FL   IP +G      + +     +   YY    D        +         P    +
Sbjct  152  WFLLGMIPILGWVLMAVYIIPYYKIALAAYYSTLKDEAIRTAERKANFRYPGGTPTRKRL  211

Query  339  FGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNR  385
                                            Q    Q  Q  D  +
Sbjct  212  DAGAAAAQQNFAQQQAYAQQQAYQQQQAYYGYQTAQQQSVQPQDEQQ  258


>RKY65317.1 hypothetical protein DRQ08_06045 [Candidatus Latescibacteria 
bacterium]
Length=218

 Score = 62.1 bits (147),  Expect = 6e-08, Method: Composition-based stats.
 Identities = 35/193 (18%), Positives = 69/193 (36%), Gaps = 2/193 (1%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              ++        A I+    +  +   N         +         L  + +  +    
Sbjct  19   YRLHFSAFFGPVAMIYLPYYIVLSLLRNLPPGPAIVWMGTLGTLAQALATASVVWTTAQV  78

Query  182  ICKTDVGLFRSMKLG-LRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                 +G+  ++    +  VG     L+  + ++  G ++ ++PGLLF VWF     V  
Sbjct  79   REGKQIGIGEALGAISVGVVGRLLGALVPALFLITLGIMIFVVPGLLFLVWFALAGQVAV  138

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI-SLTLSFLTARIPYVGEAANLAFSLL  299
             +      AL +SR LV G  W +F   +L + +  L ++     +P           LL
Sbjct  139  LEGRTFWSALRRSRELVKGRGWRVFYLLILFICLNFLLVALPLTLLPRFAAPLGQVLGLL  198

Query  300  LTPFSFLYYYLIY  312
              P+  +   L+Y
Sbjct  199  FLPYPLIVMTLLY  211


>HGT34381.1 hypothetical protein [bacterium]
Length=401

 Score = 64.0 bits (152),  Expect = 6e-08, Method: Composition-based stats.
 Identities = 49/341 (14%), Positives = 113/341 (33%), Gaps = 20/341 (6%)

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
            ++    + L+G+  + ++         LLL    +   +N   +  + +      L+ + 
Sbjct  36   IYKDWFFKLIGMQAVALLGVLPLTIVLLLLLVPVFTFQENAPVRMIMFVFLGLSGLISII  95

Query  173  WMTGSMFIYICKTDVGLFRSM----------KLGLRHVGSFTLLLILLILVVGGGSLLLI  222
            +M            + +   M           +  R      L+ + + L V   +LLLI
Sbjct  96   FMIYISITAQAGIMITIKNIMAGNAKSIKDNFIEARTYTIKYLVNLCVFLFVLLWALLLI  155

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            +PG++F + +    + L  +  G   AL++SR L++G+ + +F +++ L  + L ++ + 
Sbjct  156  VPGIIFAILYSLAGWALIVEGYGSTSALKRSRELINGYGFEVFLKYLALFFMWLVIAIIF  215

Query  283  ARIPYVGEAANLAFSL----------LLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWL  332
            A    +G        L          +        Y+L  +        P     K    
Sbjct  216  AIPGILGVNEAALVGLRILERIISFIIAPIPIIFTYFLFLNLQSIKADIPSKIKRKEGGG  275

Query  333  PLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQ  392
                A    + I  +++ +L+  +L++ ++ S    I   +            +    P+
Sbjct  276  GAVVAAVAVIFIILMIIPTLAIVSLNSARVKSRDAKISATVAQIQTALEIHYNNFGSYPE  335

Query  393  RLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQN  433
             L S +              G         +       D  
Sbjct  336  NLYSVESLQPTDLVYPQPVNGDCPKDSKYDYRQTADGQDYE  376


>NOX30874.1 hypothetical protein [Actinobacteria bacterium]
Length=290

 Score = 62.9 bits (149),  Expect = 6e-08, Method: Composition-based stats.
 Identities = 23/176 (13%), Positives = 57/176 (32%), Gaps = 2/176 (1%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +L + +         +F+   L      + ++       L+     + + +  +T  +  
Sbjct  65   VLSVVMSMRHSTDGSLFTDPQLIFQLGQSDRSTTQAILALVLGSLSLAILIHALTRLVLN  124

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +       +R +    R    +    +L  + +G G LL  +  L     F+    + A
Sbjct  125  DLVGRVESPWRILLRTFRSFHQWVGTWLLGRIALGLGLLLCGVGILWPWAAFWVLIPIAA  184

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
             + +  + A+ +   L  G    I G  +LL + +          P   +  +   
Sbjct  185  SERLTPIAAMRRCLELTKGARGRILGVALLLEITTALAQVAFFLFP--AQIVSGLV  238


>MBA3901750.1 hypothetical protein [Bacteroidetes bacterium]
Length=287

 Score = 62.9 bits (149),  Expect = 7e-08, Method: Composition-based stats.
 Identities = 32/192 (17%), Positives = 70/192 (36%), Gaps = 2/192 (1%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            LL    + ++       + L    +   N          L A        ++     ++ 
Sbjct  38   LLFAAPIFVLSGVLGAVNQLDGMASFSGNVMFTLLITYFLYAVGTVFSAIIAGKHILIYQ  97

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
               + D+     M           +   +  L++  GS+LL IPG+   +    C  +  
Sbjct  98   KEGRIDISTEEMMTFFKNDFFKIFITSFIYYLIILVGSVLLFIPGIYLAIALSLCMMIRI  157

Query  241  DD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLL  299
             +       + ++S  L+ G+WWA FG + ++ +IS   + +   IPY+   A +  +  
Sbjct  158  LNPEAELSFSFKESMRLIKGNWWASFGFYFIIGIISYVFALVF-MIPYLASLAFIGITSP  216

Query  300  LTPFSFLYYYLI  311
                +  + +L+
Sbjct  217  AGVANEGFRFLV  228


>WP_187590111.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Gordonia sp. OPL2]
Length=338

 Score = 63.3 bits (150),  Expect = 8e-08, Method: Composition-based stats.
 Identities = 30/275 (11%), Positives = 71/275 (26%), Gaps = 49/275 (18%)

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
            W    +    I L    + SA         +    +    +   +               
Sbjct  63   WAAFLVVAGAIGLCGLAVISATDGWGTDAGDAILISGVLLLSTLSGVVTAALCGMFAYPA  122

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC---------  229
                      +  + +     +     L +L++LV    ++ L +  +            
Sbjct  123  NQQAVGRTPTVGETWRRTRSRLAPLFGLYVLVLLVGAAAAIPLFVLSVWSFTLGGGLILV  182

Query  230  ----------------VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
                            V   FC   +  +  G  +A+ +S  L +G +W  F    L++V
Sbjct  183  GLLLLAVIAAGATWLSVRLAFCLPAVVVEGCGVTEAIRRSFGLTAGRFWRTFAVLFLVIV  242

Query  274  ISLTLSFLTAR------------------------IPYVGEAANLAFSLLLTPFSFLYYY  309
            ++   S + +                         +  +    ++  +L+  PF  L   
Sbjct  243  LTSIASAVISGAFQIAGGVVGMAGSTGDIGLLTPVVLVLPLLGSVLATLVTQPFIALTVA  302

Query  310  LIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLI  344
            L++ D +    G      +              + 
Sbjct  303  LLHVDARIRSEGYDLVLAQGASEVAQGGHRDGWIP  337


>HIG36735.1 hypothetical protein [Oceanospirillaceae bacterium]
Length=193

 Score = 60.9 bits (144),  Expect = 8e-08, Method: Composition-based stats.
 Identities = 27/179 (15%), Positives = 60/179 (34%), Gaps = 0/179 (0%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
            +     +   R          L +V               T  +P         +   + 
Sbjct  1    MFTICMKDTVRFFLTHWWKLALIVVPLNLVGELIRGAFSPTATDPLQTGSFGLYITVVIL  60

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
              ++        +   I    + + +S  L +     + +L +L  L V  G L+ IIPG
Sbjct  61   VSIIASVATIHYIDSSIKAAPLSVTQSWLLAVTKFSGYLVLSLLSFLAVATGLLVFIIPG  120

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            ++         Y+   +N    +++++S  L  GH+  +F   +L  +    + ++ A 
Sbjct  121  IVVMARISLASYIYLLENRSANESIKESWALTKGHFGDLFFGTILFSIPGGVVGYMVAG  179


>MBI4492985.1 hypothetical protein [Chloroflexi bacterium]
Length=262

 Score = 62.5 bits (148),  Expect = 8e-08, Method: Composition-based stats.
 Identities = 25/123 (20%), Positives = 44/123 (36%), Gaps = 18/123 (15%)

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFV---------------L  270
            +   V +      LA + IG + +L +S  L    WW  F   +               +
Sbjct  135  IFLGVRWSVSWIALALEGIGPIASLRRSWSLTRAAWWHTFVVILAAGAIGSILGIVAAAV  194

Query  271  LLVISLTLSFLTARIPYV---GEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPI  327
              ++   LSF+      +       +    +LLTPF    + ++Y +L+A   G      
Sbjct  195  FGIVGGILSFVAGSPAILELFSTLGSAVVGVLLTPFEMAIFVVLYYELRARSEGFDLEQR  254

Query  328  KRQ  330
             RQ
Sbjct  255  ARQ  257


>MYC65673.1 hypothetical protein [Acidobacteriia bacterium]
Length=211

 Score = 61.3 bits (145),  Expect = 8e-08, Method: Composition-based stats.
 Identities = 29/190 (15%), Positives = 61/190 (32%), Gaps = 6/190 (3%)

Query  143  KPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLR-HVG  201
                       +  W +         L    +       +    +G+  +++        
Sbjct  1    MKPVDWEKPGPSINWIVSALRFVGGTLFQGTVILVTARGVLGVKIGILNALRQMRPALFL  60

Query  202  SFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHW  261
                  ++ I+ +    L L++PGL++ V +    +V+  +      AL +S  L+S ++
Sbjct  61   RMLGTELVKIIFLIFLYLALVVPGLIYTVRWALSVHVVVLERRVYRDALRRSTELMSYNF  120

Query  262  WAIFGRFVLLLVISLTLSFL--TARIPY---VGEAANLAFSLLLTPFSFLYYYLIYSDLK  316
            W   G  + L+V+            I     V          LL P   +   L+Y D++
Sbjct  121  WRWVGLTLPLIVLLGFELLADRFQIIDLEEGVTFLTATFIHSLLAPVVDVAITLLYFDIR  180

Query  317  ANYRGPQHPP  326
                G     
Sbjct  181  VRNEGLDIEM  190


>WP_016444535.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Gleimia europaea]EPD31424.1 hypothetical 
protein HMPREF9238_01195 [Gleimia europaea ACS-120-V-Col10b]
Length=395

 Score = 63.6 bits (151),  Expect = 9e-08, Method: Composition-based stats.
 Identities = 24/134 (18%), Positives = 48/134 (36%), Gaps = 17/134 (13%)

Query  207  LILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFG  266
             ++ ++     S++L I   L  + F F   ++  +      AL +S  L    +W + G
Sbjct  250  GLVSLVWFFAISVVLQIAFYLIAIRFMFAFPIILVEGQKVRPALSRSWKLTRERFWPLLG  309

Query  267  RFVLLLVISLTLSFLTARIP-----------------YVGEAANLAFSLLLTPFSFLYYY  309
              +L+ +  L ++   + I                   +   +     LL+TP S     
Sbjct  310  TSLLMGLAMLAIAIAFSAIFGVISGLVIFFESTTLLALITAISTALIVLLITPLSVYALT  369

Query  310  LIYSDLKANYRGPQ  323
            L+Y D +    G  
Sbjct  370  LMYVDTRIRKEGYD  383


>GED96985.1 hypothetical protein nbrc107697_10240 [Gordonia crocea]
Length=213

 Score = 61.3 bits (145),  Expect = 9e-08, Method: Composition-based stats.
 Identities = 32/193 (17%), Positives = 75/193 (39%), Gaps = 7/193 (4%)

Query  108  ADSWELFCRRGWGLL-GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY  166
              +W  F     GL  G+ +L  +   A +F+ +           ++    A+ +  +  
Sbjct  10   RWAWRKFTDNVAGLFVGVLVLMALSTVAMLFTIVGFVVLMGSASDSEGAGIAVGIVLLVL  69

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSM------KLGLRHVGSFTLLLILLILVVGGGSLL  220
             +L    ++           + +   +       L  R+ G+   L++L  L V  GS++
Sbjct  70   GILAYVGVSAYFASAYTGGLIDIANGVPTTTSSFLRPRYFGTVFALVLLQTLAVLAGSVV  129

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
              + G++   +  +   V+ D  +G ++A++ S  LV GH    F  F+++  +++ +  
Sbjct  130  FCVGGIIVSFFLAYAVMVVVDKGVGAIEAMKTSVSLVRGHLGDAFVLFLVVTALTMVIGV  189

Query  281  LTARIPYVGEAAN  293
            L      +     
Sbjct  190  LATPFVQLLTVFA  202


>MBA3264083.1 hypothetical protein [Thermoleophilaceae bacterium]
Length=197

 Score = 60.9 bits (144),  Expect = 9e-08, Method: Composition-based stats.
 Identities = 26/155 (17%), Positives = 50/155 (32%), Gaps = 1/155 (1%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD-VGLFR  191
               +F  + +     L               V         +  ++   +       +  
Sbjct  29   ALVLFLPVAILNGIVLTSGGVLAALLSAAIAVIATYWYQGMVVEAVRDILDGRRDHTVGS  88

Query  192  SMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALE  251
                    +       +L  + V  G +LLI+PGL     +      +  D  G + +  
Sbjct  89   LFSSATPFIWPLFGAGVLAGIGVLIGFILLIVPGLFLLTIWAVIVPAIVIDRTGVMGSFG  148

Query  252  KSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            +SR LV G  W +FG  V+L ++ L +  +   I 
Sbjct  149  RSRELVRGSGWQVFGVIVVLFLLQLVIGGILNAIA  183


>WP_054970279.1 hypothetical protein [Alicyclobacillus ferrooxydans]KPV42593.1 
hypothetical protein AN477_16520 [Alicyclobacillus ferrooxydans]
Length=303

 Score = 62.9 bits (149),  Expect = 9e-08, Method: Composition-based stats.
 Identities = 29/215 (13%), Positives = 64/215 (30%), Gaps = 11/215 (5%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L    ++ ++          L          N            A +   +         
Sbjct  92   LSAFVVIPLLYGSVLHIVVSLQYDRQERPCGNWAAFLHAWRRLPAVLGTNVLRWIIYFVA  151

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
            +     +     +       G   L    + + V   SL   +  +   V F F      
Sbjct  152  FAVSAAI-----IIAVGALFGGMGLTSAAVTVAVMILSLTAAVFLIWLAVKFAFVPSATL  206

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP------YVGEAANL  294
            ++ +    +  +S  L  G+ W I G ++++ ++   +S     +        +      
Sbjct  207  EEKVSFGASFRRSFELTRGNMWRIIGYYIVVKLLIFVVSLGFGLLLSLVKAVVLQSILTD  266

Query  295  AFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
              ++++TPFS L   ++Y DL+     P      R
Sbjct  267  LVAVVVTPFSILAMAILYLDLRIRTEAPDLAAWLR  301


>WP_026942390.1 hypothetical protein [Hellea balneolensis]
Length=277

 Score = 62.5 bits (148),  Expect = 9e-08, Method: Composition-based stats.
 Identities = 24/214 (11%), Positives = 69/214 (32%), Gaps = 0/214 (0%)

Query  77   NCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPI  136
               +      L   ++ + +    R +++  +  +         +L   ++  +++    
Sbjct  1    MVDQTPNPQELYEWQKPKPAFQFGRVMNRSFSGLFRNIKPIMIVILISLVVSSLVSAPMY  60

Query  137  FSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLG  196
            F       A   +P          +    + +    +     F +     V         
Sbjct  61   FLDSTDPTALMQSPSYWIITAMSSVFGFLFFMFICVFTDHFAFAHFTNRPVKFRYVALRS  120

Query  197  LRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLL  256
            L+       +++L  +    G + LI+PG++  V +         +      +  +S  L
Sbjct  121  LKLTIPILFIVVLYFIASYIGMIFLIVPGVIISVGWAIIGPSYLHEETSLFGSFGRSWEL  180

Query  257  VSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
              G+ W ++   +++ +I + +  L A +  V  
Sbjct  181  TRGYKWWVWLATIVMGIIMMIVFSLAAVLLSVTA  214


>KKS94333.1 hypothetical protein UV70_C0002G0042 [Parcubacteria group bacterium 
GW2011_GWA2_43_13]OGY68799.1 hypothetical protein A3B94_01565 
[Candidatus Jacksonbacteria bacterium RIFCSPHIGHO2_02_FULL_43_10]OGY70514.1 
hypothetical protein A2986_02200 [Candidatus 
Jacksonbacteria bacterium RIFCSPLOWO2_01_FULL_44_13]HAZ16451.1 
hypothetical protein [Candidatus Jacksonbacteria 
bacterium]
Length=345

 Score = 62.9 bits (149),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 46/272 (17%), Positives = 88/272 (32%), Gaps = 19/272 (7%)

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
            G       +   I + +       + GLR+   F  + ++   ++ GG +   IPG+   
Sbjct  77   GWWAGPAIVDGAIREDNAPFSSVARNGLRYAFPFIGIALVGSWMILGGFIAFGIPGIAIM  136

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP---  286
                F   +        +++L  SRLLV G WWA+FGR  L+ ++      +   I    
Sbjct  137  TQLIFASAIYVHKKTKIMESLRLSRLLVKGRWWAVFGRTALIGLVIYGFLLVLLLIAWLL  196

Query  287  -----------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLT  335
                        +       F L+  P+   Y   +Y DL    + P    I++    L 
Sbjct  197  GTLIGENTGTKIMSAIIMGIFLLVYLPYVMCYIAELYHDLDKTKQQPDEKQIQKNKTFL-  255

Query  336  AAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLS  395
                   +I G++ + L    ++   + S     +      P      +      P   S
Sbjct  256  ----KVFMILGIVGLPLIIGIVTTIAIYSVNNFRKASTNDIPASWQIDDSDTTIIPDDAS  311

Query  396  SADYKLLLSKQRKTTSEGGLSLGPVTLFADRF  427
              +  + L+   +       +L          
Sbjct  312  LENADIPLNGTTQLDEADIEALLQQYQEQSAP  343


>PKL90739.1 hypothetical protein CVV21_11340 [Candidatus Goldbacteria bacterium 
HGW-Goldbacteria-1]
Length=443

 Score = 63.3 bits (150),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 40/408 (10%), Positives = 107/408 (26%), Gaps = 3/408 (1%)

Query  138  SALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGL  197
               +          + ++    +L     +L+    +            +    +   G+
Sbjct  37   PYAVTHYLAAAGKASSSFIIPAMLINAFLMLVNNGALYTLFHKSYNGVKISFSEAFMAGV  96

Query  198  RHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLV  257
            +  G   L+ +L+   +  GS LLIIPG++    ++F    +  ++   +  L+ S  L 
Sbjct  97   KVSGKLLLVGLLITAAIMAGSFLLIIPGIIIAFKYWFAVIAVIVEDKE-IGPLKLSSHLS  155

Query  258  SGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKA  317
             G+   IF   +L +++S+   F+    P           +     + +      +  K 
Sbjct  156  KGNAGIIFLTMLLFVLVSVFTPFIYRLAPVNAFLFFPLMIIFNLLSTIVQSIQYITYAKI  215

Query  318  NYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQP  377
                       +     + A         + L +                 I + +    
Sbjct  216  RASKADEISPDKIKAIDSGAGCAITAALIIFLTAAGIIAAVFVTKSIGFNKILKAIYGNT  275

Query  378  QQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLW  437
                +       E                 +    G   +    + +      ++     
Sbjct  276  AVLSEDINLQMNENWYFIKLPPGHYSYTVIRHNETGKQGIYAAMIRSIELTEVEKTLMGD  335

Query  438  LKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLF  497
              +  S    L  A +       +        +L ++                   N   
Sbjct  336  SGVINSPL-KLIHALEARINNGNESNSIPAKWNLDEKIKILPVMVNGTKWSKAVVPNADE  394

Query  498  SGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQI  545
            S      +   T+ +++  +  +        I++      ++    +I
Sbjct  395  STGTKWNVFFTTKKDRLIYLFYRTAF-DKTDIKTYTEDEKELFSLFEI  441


>RLD98598.1 hypothetical protein DRI91_02770, partial [Aquificae bacterium]
Length=212

 Score = 61.3 bits (145),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 36/203 (18%), Positives = 78/203 (38%), Gaps = 0/203 (0%)

Query  108  ADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI  167
             +    +       L   ++ ++ +F  I SA     +              +  T+   
Sbjct  9    FNYLMGYPVFFVPPLIPAIVSLLASFIFITSATAAIHSGKGLMLAFIMLILGVTITLVVQ  68

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
            L+  + +     +        L  S++  +  +G   +  +++ + VG G LLL++PGL+
Sbjct  69   LVTSAMLIHMAQVVEEGYAPSLKDSLQASMDRLGDIVVASLIVSVAVGIGLLLLVLPGLV  128

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
                F F    +   N   + A+  S  +V G++    G F++L++I L ++ +   +P 
Sbjct  129  LAFLFIFTLQEVVVGNKSAVDAIRGSFEMVKGNFGNTLGFFIMLIIIILIIAGVLGLVPV  188

Query  288  VGEAANLAFSLLLTPFSFLYYYL  310
            +G+                 YYL
Sbjct  189  MGDFVANLVITPYYYIVTTIYYL  211


>MBA2735288.1 protein kinase [Pyrinomonadaceae bacterium]
Length=455

 Score = 63.6 bits (151),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 25/168 (15%), Positives = 58/168 (35%), Gaps = 0/168 (0%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
             +  L I      +  A   S  +   A              +    A  + G +    +
Sbjct  130  FYLPLIIVTFIEPVLDAFYVSGQISNSANNAISILFFIANLFIGFLCASFISGTTTWIVA  189

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
              + +    + L  +++   +         +L  ++   G  + IIPGL+  V +     
Sbjct  190  QTLAVPLRPIRLRPALEATRKKWKRLVGTGLLTGVLSFIGYAMCIIPGLILSVLWALVSP  249

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            ++  +N+ G  A+++S  LV           ++++ + L    +T  I
Sbjct  250  IVMMENLRGRAAMKRSTALVKRSLRTTTAASLIMIFVPLFFGLITGAI  297


>WP_030890737.1 MULTISPECIES: hypothetical protein, partial [Streptomyces]NNG87569.1 
hypothetical protein [Streptomyces cacaoi]GEB53728.1 
membrane protein [Streptomyces cacaoi subsp. cacaoi]
Length=475

 Score = 63.6 bits (151),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 27/133 (20%), Positives = 42/133 (32%), Gaps = 24/133 (18%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
              G L   +  +   V F      L  +  G L A+ +S  LV G WW +FG  +L L+I
Sbjct  319  TLGLLGGCVAAVWIWVSFCLAPPALMLEKQGVLAAMRRSAKLVRGSWWRVFGVQLLALLI  378

Query  275  SLTLSFLTAR-IPYVGEAANL-----------------------AFSLLLTPFSFLYYYL  310
               +S +    +  +G                              S +  P S     L
Sbjct  379  VFLVSTVIQLPVTGIGSLLTGSGDVLATSGTDWPDLLVDGIGSVLASTVSLPLSAGITAL  438

Query  311  IYSDLKANYRGPQ  323
            +Y D +       
Sbjct  439  LYMDQRIRREALD  451


>MBI2442806.1 DUF975 family protein [Candidatus Levybacteria bacterium]
Length=204

 Score = 60.9 bits (144),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 40/188 (21%), Positives = 70/188 (37%), Gaps = 11/188 (6%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
                +V+                   ++      +L   V  + + +S     + + I  
Sbjct  21   LPFFVVVILITGLVNFAPNFLLQGLREDTPMLSGLLSLAVWVLSMLVSLGAIKISLKIID  80

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
                    +  G   + +F L  I+  ++VG G LLLIIPG++F + + F  Y++ D N+
Sbjct  81   NKKAEIVDLFNGYPLLLNFILSSIIYAILVGVGLLLLIIPGIIFGIKYHFYSYLIVDKNM  140

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFS  304
            G L+AL+KS  +  G  W +                    I  VG  A      +  P +
Sbjct  141  GPLEALKKSGEITKGVKWDLL-----------LFGLACGLINIVGALALGIGLFITVPIT  189

Query  305  FLYYYLIY  312
             L Y  +Y
Sbjct  190  MLAYAYVY  197


>RZM09416.1 hypothetical protein EOP67_70085, partial [Sphingomonas sp.]
Length=197

 Score = 60.6 bits (143),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 27/132 (20%), Positives = 46/132 (35%), Gaps = 0/132 (0%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
            I LA+     L    +  +   +     +     +  G R       L I++   +  G 
Sbjct  6    IYLASFVLTQLAQGGLVRATIAHSDHRTISFAACVATGARFALPLLALGIVMGFALLLGF  65

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
               IIPG+L  + +      L  D  G   A  +S  L  G  W +FG  +++LV+   L
Sbjct  66   AFFIIPGVLLYLAWAVAVPALVIDGTGVFGAFGRSAFLTRGARWNVFGIGLVVLVLYYIL  125

Query  279  SFLTARIPYVGE  290
                  +     
Sbjct  126  LAGVGIVTVAIT  137


>MBI5948430.1 hypothetical protein [Chloroflexi bacterium]
Length=241

 Score = 61.7 bits (146),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 22/166 (13%), Positives = 46/166 (28%), Gaps = 15/166 (9%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG-GGSLL  220
                +  +  +    S+   +    + L  ++      +G   ++ IL  LVV    S+ 
Sbjct  63   FEALFGQVARAATIFSVSRAVKGERIRLVNALDPAFTRMGGLLVVAILYGLVVAPVLSIF  122

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
            L    L F + F         D +    A+  S  ++ G+         L + + +    
Sbjct  123  LFPIALYFALRFGLSFEAFVIDGVSPTAAMRTSWRVMRGNLLRFICLLALFVAVLVGPLI  182

Query  281  LTARIPY--------------VGEAANLAFSLLLTPFSFLYYYLIY  312
            L + +                V         +           L Y
Sbjct  183  LLSSLALVDAGSRGGNIAVTAVLTVLQGVVLIPFLSLFTAATTLFY  228


>TLZ73523.1 hypothetical protein E6K14_05275 [Euryarchaeota archaeon]
Length=337

 Score = 62.9 bits (149),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 21/116 (18%), Positives = 47/116 (41%), Gaps = 6/116 (5%)

Query  203  FTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWW  262
            F+   I L+  + G  +L  I  +   V        +  +  G +  L++S  L  GH  
Sbjct  179  FSGAGIALVCGLLGALVLGSILAIYVFVAMSLYAPAIMIEGAGAVDGLKRSWALTKGHRL  238

Query  263  AIFGRFVLLLVISLTLSFLTAR------IPYVGEAANLAFSLLLTPFSFLYYYLIY  312
            ++FG   ++ ++S  ++           +  V   A+   S ++ P++ +   + Y
Sbjct  239  SLFGALFVVFLLSGVVTTAVTFPAGLADLWVVSLVASALASAIVAPWAVILAAVAY  294


>MBJ25907.1 hypothetical protein [Flavobacteriaceae bacterium]
Length=209

 Score = 60.9 bits (144),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 33/188 (18%), Positives = 61/188 (32%), Gaps = 12/188 (6%)

Query  128  GIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDV  187
             I   F       +      L       Q  I       ++  +S       + +     
Sbjct  22   VIPTFFFYGILTQVFSEFLKLFLLFGPLQRGIFDLAYFALVAPMSVGIIIYSLAVVNKKD  81

Query  188  GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGG  246
              F  + +          L+ +L +++ GG +L IIPG++  + F    Y+ A+D  I  
Sbjct  82   FSFNQLFIPYDDYLKTIGLIFVLSIIIIGGFVLFIIPGIILSLIFSMSLYIFAEDPQITI  141

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFL  306
             +AL KS  L  G+   +F            L  +   +  +         + L P   +
Sbjct  142  GEALRKSSELTKGYRTKLF-----------LLGLIYTGLLILSVFTLFIGLIWLIPLFGI  190

Query  307  YYYLIYSD  314
                 Y+D
Sbjct  191  TSANFYND  198


>OIO48197.1 hypothetical protein AUJ33_00165 [Parcubacteria group bacterium 
CG1_02_40_25]PIZ71481.1 hypothetical protein COY09_00630 
[Candidatus Portnoybacteria bacterium CG_4_10_14_0_2_um_filter_39_11]
Length=334

 Score = 62.5 bits (148),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 55/244 (23%), Positives = 105/244 (43%), Gaps = 17/244 (7%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLA-TVAYILL  169
            ++ + +     L I++  ++L  A +    L       N  +        +   +  +++
Sbjct  52   FDFYKKHWKLFLQIFVWPLILNCAWLVLTSLSDLIIKQNEASLFVSVVGWITPLIWLLIM  111

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
             +S  +G+  I+ CK       + +L    +G +  L IL  L+V GG +L ++PG++F 
Sbjct  112  VISIWSGAALIFACKEPTTWQLAYRLAWHKLGDYLWLAILTGLMVVGGIMLFVVPGIIFS  171

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI----  285
            +WFF   +VL  +++ G  AL KS+  V G+WWAIFGR ++  ++   ++     +    
Sbjct  172  LWFFLGAWVLIFEDLKGRDALLKSKFYVQGYWWAIFGRQLVFSLLIAAIAGGAMLLGLLL  231

Query  286  ------------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLP  333
                          +        SLL++P +    Y++Y DLK          + R    
Sbjct  232  SFLVKTISSTTSSILIIVWQCLVSLLISPLTVGLSYVLYQDLKNKKSAVAFEIMPRAKKH  291

Query  334  LTAA  337
            LT  
Sbjct  292  LTWW  295


>MBB71806.1 hypothetical protein [Legionellales bacterium]
Length=262

 Score = 61.7 bits (146),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 35/241 (15%), Positives = 81/241 (34%), Gaps = 13/241 (5%)

Query  93   FRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFS--ALLLKPATWLNP  150
                     + S +   +W+L+      ++ + LLG ++A     +              
Sbjct  1    MTHYAKQPLAYSDIFRHAWQLYRGTFKHVVPVALLGAIIASILEVASGYGFGDAINNTPW  60

Query  151  QNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL  210
                  W + + ++ + LL  + +      ++ +T     ++   GL  + S  L  +L 
Sbjct  61   DYMVVLWYLHIPSLIFALLAYNAIYIRYQAHLDQTACSYGQAWWRGLLRLPSTFLATLLY  120

Query  211  ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
            +++   G  L ++PG++  V  FF   +    +    +AL  S  LV  HWW      ++
Sbjct  121  VVIAVIGLFLFVVPGIIVLVSLFFFYPMTVVRHENSFKALCHSHRLVWPHWWRSLSVIIV  180

Query  271  LLVISLTLSFLTARIPYVGEAA-----------NLAFSLLLTPFSFLYYYLIYSDLKANY  319
             ++  L   F    +                   +   +L   F   +   ++     + 
Sbjct  181  PILSLLIAIFAIQGVSLGMGVLTHGSLLIMRTTGMVLGILFGTFVCPWMIAVFMQQLMDL  240

Query  320  R  320
             
Sbjct  241  E  241


>RZA05935.1 hypothetical protein EOP11_11585 [Proteobacteria bacterium]
Length=213

 Score = 60.9 bits (144),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 28/169 (17%), Positives = 58/169 (34%), Gaps = 0/169 (0%)

Query  128  GIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDV  187
                        + L    +           + L     +          +   I +   
Sbjct  16   AWETFLKAPEIFIALMLGQFALSFVLPRIPGLGLLIWVLVASFTIPSFVLVAEAIRRDGR  75

Query  188  GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGL  247
              F  ++  L+       + +L  ++VG GS+LL +PG+     + F + +   +     
Sbjct  76   AKFDCLRPLLQLAPQLIAVFLLKSILVGIGSILLFVPGVYLFTIWAFAEIIATVEKKTFW  135

Query  248  QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
            +++E SRL+V G+W+ +FG   + L+I  +   L      V        
Sbjct  136  ESMESSRLMVRGNWFPVFGLVAVALIIVFSGVMLLGVGMLVSVPMGTLL  184


>PSQ42289.1 hypothetical protein BRD17_09020 [Halobacteriales archaeon SW_7_68_16]
Length=250

 Score = 61.3 bits (145),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 25/159 (16%), Positives = 59/159 (37%), Gaps = 0/159 (0%)

Query  160  LLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
              A     ++          + I +         +   R +  +T++ ++L+  +  GS+
Sbjct  69   WQAAAVVAIVSFVATFVVAVVAIRRFGADGGTGERSTGRALLQWTVVTVVLVATIALGSV  128

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            LL++PGL+  V F F   V   ++   ++A  +S   V+ HW  +   ++ L VI++   
Sbjct  129  LLVLPGLVAAVVFSFAPVVAVVEDASVVEAFRRSWETVTEHWKTVLPLYLGLAVIAVAWL  188

Query  280  FLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKAN  318
             +                 +  P     + +    ++  
Sbjct  189  IVGGLGTVALSTILPVLGDVFAPVVPATFIVFVLAIQTR  227


>NCT55999.1 hypothetical protein [bacterium]
Length=283

 Score = 61.7 bits (146),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 27/135 (20%), Positives = 47/135 (35%), Gaps = 1/135 (1%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSM-KLGLRHVGSFTLLLILLILVVGGGSLL  220
               A  L+                 +     +  + ++    F    IL  L+VG G + 
Sbjct  132  ILAAVQLIFSMGYINLTIDAARGNKLDYKTLLNHVSIKKAFRFLGASILAGLLVGFGLVF  191

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
             I+PG+ F + + F  Y++ D N    +A   S  L  G    +F   +L L++ +   F
Sbjct  192  FIVPGIYFAIRYMFVGYLIVDKNASIGEAFSGSAALTKGVKLPLFVLGLLFLMLGVLGVF  251

Query  281  LTARIPYVGEAANLA  295
                  YV       
Sbjct  252  ALFIGIYVVAIVATL  266


>WP_193327074.1 hypothetical protein [Trueperella sp. 19M2397]QOQ38335.1 hypothetical 
protein HLG82_01985 [Trueperella sp. 19M2397]
Length=398

 Score = 62.9 bits (149),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 21/249 (8%), Positives = 58/249 (23%), Gaps = 23/249 (9%)

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
               +++      +     +      +          +L +      +             
Sbjct  150  LLGLYLIFFALAIVFVIVLVGIAAMIIQMIGTSSFPVLPLLAAPFAVFTLLFFAFYRLII  209

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG-----  289
                +  ++IG L A+ +S  L  G     FG +V +++I   L+ + + +  +      
Sbjct  210  APSAMVAEDIGPLAAMSRSLRLTKGSLGYFFGLYVAVMIIFTVLTTVMSILLALTIGTTL  269

Query  290  ------------------EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
                                  L  S++++P +     LIY +++          +    
Sbjct  270  ASPDPFSGTGNLIGATSVILMTLVISVIISPIAMSLINLIYINMRMKRENFHQEFLYASG  329

Query  332  LPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEP  391
              + A                                 + +                +  
Sbjct  330  SSVAAVNPQRGPFGVEPTGQGQNGRDQGSPYGQTPYGQESQPRYGQYGQAGEASDEWQGR  389

Query  392  QRLSSADYK  400
             +      K
Sbjct  390  PQWYGDADK  398


>ESS70677.1 hypothetical protein MGMO_120c00640 [Methyloglobulus morosus 
KoM1]
Length=164

 Score = 59.8 bits (141),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 34/157 (22%), Positives = 62/157 (39%), Gaps = 8/157 (5%)

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
               +   + + +      + +G + +    +  IL  + V GG +LL+IPG++  +    
Sbjct  1    MHRVDSLVHQQEDSFSDGLNVGFKTILPVFIGFILYTIAVIGGFILLVIPGIIVSLSMVL  60

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP--------  286
              Y +  D++GG  +++ S  LV GHWW     F +  +I   L  +   +         
Sbjct  61   SIYFIVLDSLGGYASIKASHKLVWGHWWKTATVFTIPTIIICILYGIFGALAAYMGTDKK  120

Query  287  YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
               +      S   TPF     Y+ + DLK    G  
Sbjct  121  LAIDITIQIISAFTTPFLVSVGYVQFHDLKLRKSGSD  157


>OGZ24322.1 hypothetical protein A2896_01665 [Candidatus Nealsonbacteria 
bacterium RIFCSPLOWO2_01_FULL_43_32]
Length=231

 Score = 60.9 bits (144),  Expect = 1e-07, Method: Composition-based stats.
 Identities = 37/167 (22%), Positives = 71/167 (43%), Gaps = 4/167 (2%)

Query  104  SQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWL----NPQNQNWQWAI  159
             Q+  D +            +  +G  + +      ++ K +               + +
Sbjct  62   FQVYKDRFWTLVGIMLPPFLLGWIGYGIWWFLSLVGVITKMSLEDTGGLILFLFLILFGL  121

Query  160  LLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
            +   V  I    S +     I   + D+G+  + ++G   + S+  + IL  L+V G  L
Sbjct  122  IFFVVLIIAGLWSQIALLCAIKEREQDIGIKEAFRMGWHKIISYYWVSILSTLLVLGAFL  181

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFG  266
            L  +PG++  +WF    Y+L  ++  G+ AL +S+ LVSG WW +FG
Sbjct  182  LFFVPGIILAIWFSLALYILIAEDKKGMNALSRSKQLVSGKWWTVFG  228


>ANM29506.1 hypothetical protein ABI59_07795 [Acidobacteria bacterium Mor1]
Length=2021

 Score = 64.0 bits (152),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 16/231 (7%), Positives = 32/231 (14%), Gaps = 6/231 (3%)

Query  28    RCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCL  87
             R  E               +        P              +    +N     + +  
Sbjct  1674  RLYENRGAKYAPRFGRHMQRFIPGQLIGPMGSAYNVPMFFPCPLNWFGMNPFWWGKGWWW  1733

Query  88    QPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATW  147
              P        +            W  +    W           +    +           
Sbjct  1734  HPGAWNMGWWAWRTCWWFGGPWGWWQWRPWVWFGGPFA--SWWVWKPWVTWWYAPFRWWG  1791

Query  148   LNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLL  207
                    + W  +                   +         + +        G F    
Sbjct  1792  WRAWAPWYGWWGIWPYWGAFGWCTWLPVWGWNLINGWWPWWGWNAWVGWWPGSGWFGWCG  1851

Query  208   ILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVS  258
                              G      F++  Y        GL         V 
Sbjct  1852  WWPFWNTWWAWHPYHWNGWWTWSPFWWHGYWGW----SGLGFWWNWWGWVK  1898


>WP_187994411.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Schaalia sp. JY-X159]
Length=166

 Score = 59.8 bits (141),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 27/119 (23%), Positives = 44/119 (37%), Gaps = 17/119 (14%)

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS----  275
            L I   +   V  +F   +   +      AL +S  L  G +W I GR +L+ +I     
Sbjct  31   LGIAVTVWVSVRLYFATLIAVVEGATPPTALRRSWALTKGAFWRILGRMLLMSIIVSIVV  90

Query  276  --------LTLSFLTARIP-----YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
                      + F T+ +P     ++    +   S L  PFS  Y  L+Y D +     
Sbjct  91   GLLGGTISAVIMFATSVLPWAVTAFLLALVSALISGLAMPFSASYTSLMYVDERVRKEN  149


>MBI2568879.1 hypothetical protein [Candidatus Schekmanbacteria bacterium]
Length=274

 Score = 61.7 bits (146),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 26/133 (20%), Positives = 49/133 (37%), Gaps = 0/133 (0%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
                  L  +     +          L   ++L  R +     L   L L++ GG+LLL+
Sbjct  78   GFLIWPLQQAAAILVVAGSYTGELPSLGECLRLAARKLLPVLALSTALGLILIGGALLLV  137

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
             P  +    ++     L  +N+   QA  +S  L  G  W IF   +L+++++  +S   
Sbjct  138  FPAFIALCTYYVAVPALVLENLTWTQAFGRSAELTKGERWPIFWFVLLIILMTAVVSGAG  197

Query  283  ARIPYVGEAANLA  295
                 +      A
Sbjct  198  QVPAAIATLLLGA  210


>WP_003393144.1 hypothetical protein [Brevibacillus borstelensis]EMT49965.1 hypothetical 
protein I532_24849 [Brevibacillus borstelensis AK1]KKX52892.1 
hypothetical protein X546_22295 [Brevibacillus 
borstelensis cifa_chp40]MBE5393893.1 hypothetical protein [Brevibacillus 
borstelensis]NOU54132.1 hypothetical protein [Brevibacillus 
borstelensis]RNB64104.1 hypothetical protein EDM54_07950 
[Brevibacillus borstelensis]
Length=250

 Score = 61.3 bits (145),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 45/226 (20%), Positives = 81/226 (36%), Gaps = 12/226 (5%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
             ++  +   L  I  L      A    A   +     +P             +   L  +
Sbjct  26   RIWLSQFPFLFLIVSLLFWSEQALTGWAQSQQLKEAFSPLIMTPVN-----GILLSLYFV  80

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
                     ++        +++      V    L  +L    V  G+LLLIIPG+L  +W
Sbjct  81   YMGIVLKRSHLGTQWDQQLQALTAFGSLVPVVILASLLEFAGVALGTLLLIIPGILLMIW  140

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI------  285
            F     V+A + +G + AL++S  LV G    + G  ++  V+   L +L   +      
Sbjct  141  FTVFPQVIAFEGMGAIAALKRSLFLVKGSTLRVLGIILIFAVVRAVLQYLPGFLFPAIAS  200

Query  286  -PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
             P +     L   +L+ PF    YYL+Y +L+           + +
Sbjct  201  QPAIAFLLFLIREMLILPFEGAAYYLLYLELRTRKEAFDFDVYQEE  246


>MQG02979.1 hypothetical protein [SAR202 cluster bacterium]
Length=254

 Score = 61.3 bits (145),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 35/201 (17%), Positives = 74/201 (37%), Gaps = 15/201 (7%)

Query  135  PIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMK  194
                 LL      LN QN     ++++      +L  + +      +I    V     + 
Sbjct  43   FGEPFLLQNVMDQLNWQNILILASLIILVWVTSILSTAAIIFGAAQFIQDGKVSQVLCID  102

Query  195  LGLRHVGSFTLLLILLILVVGGGSLLL-----IIPGLLFCVWFFFCQYVLADDNIGGLQA  249
              L        + I L +++    LL      I   +   + + F    +  ++ G + +
Sbjct  103  YALSCSIKLIGVSIALPILLIIPLLLSFILIGIPLLVFLLIRWNFAVCAVVLEDKGVINS  162

Query  250  LEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP----------YVGEAANLAFSLL  299
            L++S LLV+G WW  FG  + +LV+ +  S +   I            V        +++
Sbjct  163  LKRSWLLVTGKWWITFGTVLAVLVLIVIPSAVLGLINTVISSFFENFLVSHILEGITTVI  222

Query  300  LTPFSFLYYYLIYSDLKANYR  320
            + PF+ +   + +  L+ +  
Sbjct  223  IIPFASIATGIYFLGLRMSKE  243


>WP_038672995.1 hypothetical protein [Pelosinus sp. UFO1]AIF53084.1 hypothetical 
protein UFO1_3541 [Pelosinus sp. UFO1]
Length=246

 Score = 61.3 bits (145),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 29/194 (15%), Positives = 69/194 (36%), Gaps = 5/194 (3%)

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
            ++FA      +                 + +      ++G   +   +   +    +   
Sbjct  37   ISFAMSLPYSISHGTVKDYHSFNFLDVFLFVVGFISHIIGSCALIQFIADIVYNKAINWT  96

Query  191  RSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
              M        +  +   + +L +    LL +IPG+++ + + F    +    + G++AL
Sbjct  97   IIMSFARAKSSTAAITYAIFMLRILLLGLLFVIPGIIYTILYIFTLEAVVLRGLRGMKAL  156

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYL  310
            + S+ LV G+WW +F   +L+  I +  +++            L+F  LL P    +  +
Sbjct  157  KYSKELVKGYWWRVFSSLLLIKGIEILFAYIFK-----TYFTYLSFYFLLGPVFKGFEVV  211

Query  311  IYSDLKANYRGPQH  324
                L        +
Sbjct  212  FTLLLFVRLEQLNN  225


>MAF18131.1 hypothetical protein [Oceanospirillaceae bacterium]
Length=225

 Score = 60.9 bits (144),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 35/216 (16%), Positives = 68/216 (31%), Gaps = 2/216 (1%)

Query  104  SQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLAT  163
                      FCR  W  +   ++ + +    + + L             +     +   
Sbjct  1    MSEWLRDSLSFCRTYWWAIVAIVVPVAIMREFLLANLGWYEMGTGASTELSTLLTAIGVI  60

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
            V +       +   +   + K ++   +     L  +  F  + IL+   V  G L  I+
Sbjct  61   VIFESFIQIKLILLVQGVLAKQNLSFSQRSSRALVALLPFIFMQILISFGVMAGLLAFIL  120

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
            PG+   V +    Y L     G L +L+ S     G+ W +F   ++  ++S   +    
Sbjct  121  PGIYVFVRWVLAPYFLLIQGQGALASLKSSWKFSRGYTWDLFFGLLITALVSAIPTLFIL  180

Query  284  RIPYVGEAA--NLAFSLLLTPFSFLYYYLIYSDLKA  317
                 G     N   S+L +  S     L Y     
Sbjct  181  NGASGGNVILSNAVLSILSSILSAWSVVLFYRAYDY  216


>MBC19125.1 hypothetical protein [Planctomycetaceae bacterium]
Length=363

 Score = 62.5 bits (148),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 40/354 (11%), Positives = 88/354 (25%), Gaps = 57/354 (16%)

Query  2    PTVRCPHCGAERNTPSSK------LPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATC  55
                CP C A               P     +  P          A S  +    +    
Sbjct  3    IEFVCPECRATLRVGDDARGKKAQCPQCGKISELPSQIAPPSPSQAASTLSVEQSDTTFL  62

Query  56   PHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFC  115
                + R             ++      S               L     ++A++   + 
Sbjct  63   GQSHITRGPMLPVRIDFGSVLSRGWRIASRRYGLALLGSTLFLLLNIAGGIIANAISEWG  122

Query  116  RRGWGLLGI------------YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLAT  163
            R     L I              L   +    +                  + W IL   
Sbjct  123  RLAVDPLSISFASTVGRILFDTWLFGGVTLFFLKLTRGQTARVADLFAGGKFLWRILGVN  182

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLI---------------  208
            +  +L   + +     I      +    +        G+  ++                 
Sbjct  183  ILIVLSLSAILIIGCGIPALIGYLSAPDAAVRDRAEDGATKVVATETPSGKTNNGLTTDE  242

Query  209  -------------LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
                           +  +G G +L+ +P ++  + F     ++ D  +G L+A+ +S  
Sbjct  243  SADQEMRQNPRTAAAMAGIGIGLMLVFLPVIILGIMFSQAVLLVVDRGMGSLEAMRQSIR  302

Query  256  LVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYY  309
            +  G+ + +FG           L  + + I ++G  A       + P  ++   
Sbjct  303  ITRGNRFTLFG-----------LGLVLSFISFLGVLAFFIGLFFVIPVCWIIGT  345


>MYE53787.1 hypothetical protein [Chloroflexi bacterium]
Length=309

 Score = 61.7 bits (146),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 24/211 (11%), Positives = 61/211 (29%), Gaps = 24/211 (11%)

Query  144  PATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVG--  201
                           + + T+         +  ++ +      + +          V   
Sbjct  92   LTGDEIVGFATSMAILFIVTLILQTFASGVIVAAVAMQYATGRIDVGACYGRAWWRVISL  151

Query  202  ----SFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLV  257
                     LI+L++      ++  I  L   +++      +  +    + AL +S  LV
Sbjct  152  VILGLLLFGLIVLMIAGFALFIVPGIVILALIIYWSVDVPAVVIEGCKPISALRRSFELV  211

Query  258  SGHWWAIFGRFVLLLVISLTLSFLTARIPYV------------------GEAANLAFSLL  299
             G+WW  F    L+ +  +  + +   +                          +  + +
Sbjct  212  RGNWWRTFATITLMTLTLIGFTVVLTLLLSAPLALLGDDGLSETMTQVSSSLLGMLTNAI  271

Query  300  LTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
            + P +     LIY DL+A         + ++
Sbjct  272  ILPIAATVGALIYLDLRARNEDYDTGALSQE  302


>MBI4249967.1 hypothetical protein [Candidatus Uhrbacteria bacterium]
Length=301

 Score = 61.7 bits (146),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 30/192 (16%), Positives = 57/192 (30%), Gaps = 0/192 (0%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY  166
                   + +       +    ++          L          + +W   + L     
Sbjct  93   YEVMHNSWLQTKKHFWVLASFVLIQIIVLNLPTFLFPKMFGSLYASSSWLRNLALVVQFA  152

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
            +LL       ++   I       F    +G      F    IL + V   G  LL++PG+
Sbjct  153  LLLYTQAGFVNVSFRIIDGKTLSFVHFFVGGFKYVKFASAGILHVFVSTVGLFLLLLPGV  212

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
             +   FF   Y++ D N   + AL +S +L       +F   V +  ++L          
Sbjct  213  AWYTRFFLWPYIVIDKNASPIAALRESSVLTFHKRKDVFLFIVFISAVNLLGVLALLVGA  272

Query  287  YVGEAANLAFSL  298
            +V          
Sbjct  273  FVAIPLTSIALA  284


>HHM02727.1 hypothetical protein [Caldithrix abyssi]
Length=285

 Score = 61.7 bits (146),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 27/190 (14%), Positives = 65/190 (34%), Gaps = 2/190 (1%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             +        F  + S       T+    +  +   +    +   ++ L      +    
Sbjct  48   LLNYFFFSKIFGAMVSNPSDPATTFNLMFSPQYLLLMFTQVLVTTVIALIVNFYVLDYVE  107

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                  + R  +L L     F +  I++  ++  G +  ++PG+   +           +
Sbjct  108  GTGPTSVARMWQLVLNKTPIFLVYNIVIFFLMMLGMMFFVLPGIYLAIALVLFIPAAIQE  167

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
            N+G   A+++S  L+  +WW   G  +L ++I   +  +      +      A     +P
Sbjct  168  NLGLGAAIKRSIHLIKNYWWFTLGLMLLSIIILYIVYVVIELPFMLLGV--GAVFSAGSP  225

Query  303  FSFLYYYLIY  312
            FS   +  +Y
Sbjct  226  FSMEEFGNMY  235


>HGN33993.1 zinc ribbon domain-containing protein [Candidatus Bathyarchaeota 
archaeon]
Length=511

 Score = 62.9 bits (149),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 35/151 (23%), Positives = 65/151 (43%), Gaps = 5/151 (3%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
             T   +++    +   + + + +    L  S+++G R     T   I++ L+VG G LLL
Sbjct  275  VTFLLLVMVAGIVIAFVHVIVEQGRPALLESIEIGFRRYLKLTATTIIVGLIVGVGLLLL  334

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS--  279
            IIPG+ F   +      +  +N G  ++L +S+ L +G     F  F++  +I L +S  
Sbjct  335  IIPGIYFATVYALAPQAVIVENAGVRESLRRSKELTNGAKLKTFLLFLMFGIIYLIISQL  394

Query  280  ---FLTARIPYVGEAANLAFSLLLTPFSFLY  307
                L+  IP   + A       L+      
Sbjct  395  INYILSTMIPIQLQLAGSLVFPFLSILLPYP  425


>MBC7405857.1 hypothetical protein [Candidatus Parcubacteria bacterium]
Length=287

 Score = 61.3 bits (145),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 23/131 (18%), Positives = 52/131 (40%), Gaps = 10/131 (8%)

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
               V +    +    +  ++ L  +L+ L V  G +L I+PG+   V   F   ++ D  
Sbjct  133  NKVVPIVDLFRFNQGYYLNYALASVLVGLSVFAGIILFILPGIYLAVRMQFVFNLIVDGK  192

Query  244  IGGLQALEKSRLLVSG-HWWAIFGRFVLLLVISLTLSFLTARIPYV---------GEAAN  293
               L+A+ +S  L +G  +W+I    ++ + + +  S +   +  +             N
Sbjct  193  HSALEAISESFRLTAGDRFWSILLINIIFVGLYIGFSLILGIVLTIIESVTNTNASTPIN  252

Query  294  LAFSLLLTPFS  304
            +   + + P  
Sbjct  253  ILIGIFIAPLF  263


>WP_092816523.1 hypothetical protein [Afifella marina]RAI17562.1 hypothetical 
protein CH311_17965 [Afifella marina DSM 2698]SCZ46337.1 hypothetical 
protein SAMN03080610_03634 [Afifella marina DSM 2698]
Length=200

 Score = 60.2 bits (142),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 21/135 (16%), Positives = 52/135 (39%), Gaps = 2/135 (1%)

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
            +    +    +  + +       +     +   L+ +    ++ IL+ +++  G  LLI+
Sbjct  27   LIIYQVMNGMLVLAAYDAKVGRPIRPGTYVTSALKRLLPLIVMAILVYVLIMTGFALLIV  86

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
            PGL            +  +  G   A+ +S  L  G+   +    V++ +I   ++++  
Sbjct  87   PGLWVLGVTAVFVPAVMIEGAG-FGAIGRSARLTKGYRRPVVLFIVVVYIIESVVAWILG  145

Query  284  -RIPYVGEAANLAFS  297
              +   G AA L  +
Sbjct  146  DLVAIFGGAAFLLIA  160


>MBD3231235.1 hypothetical protein [Candidatus Dependentiae bacterium]
Length=235

 Score = 60.6 bits (143),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 35/209 (17%), Positives = 70/209 (33%), Gaps = 2/209 (1%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
                   + L  +  L   +      +   L         +  +  ++  + +    L  
Sbjct  26   FFAWSICFILSFVSSLLWFIPGILSLTKSNLFTGWSSIFLSIWFLLSLCFSIILIFGLWA  85

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
             ++   + ++   T           +  +       IL+IL+V GG +L +IPGL     
Sbjct  86   GFLRFCLKLHDFGTSSLKLLFKNFKIIQLLRLIGANILIILLVFGGIILFVIPGLYIAAR  145

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
                 Y++ D N+  + A++ S  L   +   +     + L+I+  +  L   I      
Sbjct  146  LILVNYLIIDKNMRIINAMKNSIRLTKNNVLVLLMITSIYLLINQIIFGLFNFISVDYSV  205

Query  292  ANLAFSLLLT--PFSFLYYYLIYSDLKAN  318
              L F  +    P SF  +  IY  L   
Sbjct  206  KGLLFLFIKIYNPLSFFAFAKIYRQLSWR  234


>WP_095133100.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Anaeromicrobium sediminis]PAB59671.1 hypothetical 
protein CCE28_08885 [Anaeromicrobium sediminis]
Length=335

 Score = 62.1 bits (147),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 19/136 (14%), Positives = 49/136 (36%), Gaps = 4/136 (3%)

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
             I + ++ +   +   +    VG          ++ +        ++ +   +++ +   
Sbjct  129  LIGVTIAGIIWFIPFLVGIGVVGFIYGKSANSIYMSNLINTGPASMVGIVISTIIFVAII  188

Query  226  LLFCVWFF----FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            +L    +     F  +    +++   +AL+KSR LV G +W IFG  +   +I   + F 
Sbjct  189  MLIIYTYITLHRFAIHTAILEDLSFFKALKKSRQLVKGEFWKIFGILMAFYIIVGGIKFS  248

Query  282  TARIPYVGEAANLAFS  297
               +           +
Sbjct  249  IYALMGEISLIGGLIT  264


>MYA18471.1 hypothetical protein [Gammaproteobacteria bacterium]
Length=246

 Score = 60.9 bits (144),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 27/169 (16%), Positives = 59/169 (35%), Gaps = 0/169 (0%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
             ++   +     F+ ++           +  +        A  LLG   +          
Sbjct  37   VMVTWPIGIFLAFAVIIQMDGVANGSTIRWHRALGQAWRRALPLLGCLGVYALAVALTFG  96

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
            T +     +           LL ++  L     SL+ ++P ++  +++     ++A + +
Sbjct  97   TVLTFAGLVLRQFILDLPDALLTVVAGLAGMAVSLVALVPLVVLFIYWCLALPLVASEGL  156

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAAN  293
            G ++AL KS  LV G+WW       +   I   ++ L      +  AA 
Sbjct  157  GAIRALRKSWRLVRGNWWRTLVIVSVAGFIVFAVASLAGIAGMLLVAAC  205


>WP_167605441.1 hypothetical protein [Maribellus sp. Y2-1-60]
Length=284

 Score = 61.3 bits (145),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 26/216 (12%), Positives = 63/216 (29%), Gaps = 2/216 (1%)

Query  77   NCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPI  136
              +                S   +R   + L  +  ++    +   GI L    +     
Sbjct  1    MEQHFEFRKIRDFGTVMNDSFDFIRLEFKRLGKTILIYILPFFVFTGILLAYSQIRMYGS  60

Query  137  FSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLG  196
                    +           + ++   + Y +L              K +          
Sbjct  61   ILDGSAMDSVSSLYIRMIPSYIVM--LLNYTVLTTVLYQYINLHRTTKGNFEPEDLWPGL  118

Query  197  LRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLL  256
            L        +  +   +V   S+ L++PG+   + F F   V+  + +    A+ +   +
Sbjct  119  LTVGLKMLAVYFVTFFIVVIASIFLLVPGIYLGIVFSFFPAVIIFEELSFGTAMNRCFAV  178

Query  257  VSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
            +  +WW  FG  ++  +I    S + +    +  A 
Sbjct  179  IKENWWQTFGIIIVGGLIVYIFSIIMSVPLIIATAL  214


>WP_145122374.1 hypothetical protein [Rosistilla oblonga]
Length=222

 Score = 60.6 bits (143),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 36/192 (19%), Positives = 67/192 (35%), Gaps = 11/192 (6%)

Query  135  PIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMK  194
               +  ++K     N    +            I L +      + I + +        + 
Sbjct  40   MALTVAMVKVTGDPNSPVNSVVNIATSLISQLIQLWIGIGAIRLGIAVARGQAVELGMLF  99

Query  195  LGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSR  254
             G   V  +  + IL  L +  G LLLI+PG+ F + + +C Y L D + G ++A   S 
Sbjct  100  SGGPFVLRYIGVSILFTLGMYLGLLLLIVPGIYFSLTYCWCFYFLVDRDCGVMEAFRLSG  159

Query  255  LLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
                G+    F            L  +++ +  +G    +A +L+  P   L   + Y  
Sbjct  160  QHAKGNRLNTF-----------VLGIVSSVLNILGFLMCIAGALVTIPVGMLAMSICYLM  208

Query  315  LKANYRGPQHPP  326
            +        HP 
Sbjct  209  MTGQRYWQPHPS  220


>MBI5354740.1 hypothetical protein [Chloroflexi bacterium]
Length=300

 Score = 61.3 bits (145),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 39/236 (17%), Positives = 78/236 (33%), Gaps = 14/236 (6%)

Query  99   GLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWA  158
                                + LL + L+        I +       T  N         
Sbjct  10   MDFFDFISEGFKIFFTRVGDFSLLALGLIIPTSLLLVITANGFTSQNTKTNVIKIILLCL  69

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
            IL   V   L+        +   +    + L +++ L L   G   +  +   L++ G +
Sbjct  70   ILALEVVVGLIASMSSKLIVESIVKNKPISLAKAINLALSKWGRAFVTQLFTSLIILGLT  129

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            LLL IPG+++ +++ F    +A  +  G +AL+ S+ L+ G WW IF   + + +I    
Sbjct  130  LLLFIPGVIYSIYYLFMLDAVALRDKDGREALKYSKTLIEGQWWRIFWISMGIGIIFSIF  189

Query  279  SFLTARI--------------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
            + L   +                +       F ++ T       ++ +  L     
Sbjct  190  NGLITFLLSKVSANPYFAIVPNAITLYIASIFGVVSTVLFLNIDFVYHRRLAKRKE  245


>WP_091526934.1 hypothetical protein [Microlunatus soli]SDS97780.1 hypothetical 
protein SAMN04489812_3703 [Microlunatus soli]
Length=429

 Score = 62.1 bits (147),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 28/290 (10%), Positives = 79/290 (27%), Gaps = 13/290 (4%)

Query  154  NWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILV  213
                   +  +    L L  ++    +      V +   +             ++ LI +
Sbjct  153  WNGTRGFVLRILPSYLLLLAVSFVAILLYAGLSVVMIAQIIGSADTPERIYGPVVGLIGI  212

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
                 +L ++  ++     +    V+  + +GGL+AL ++  L  G+    FG  ++  +
Sbjct  213  TLAFEVLYLVGYVIIGTRLYPLIPVIVIERVGGLEALRRAWGLTKGYGLRTFGYSLVASL  272

Query  274  ISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLP  333
            I + + +  + +       + A +  ++P +                    P I    + 
Sbjct  273  IPVGVIYAVSFVSV--MIGSGALTATISPMADDPSSF-----------SPGPLIGSVLVM  319

Query  334  LTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQR  393
                I   +++   + +  +   +   +    G    Q             +    +P +
Sbjct  320  YLPLIVVSVIVMPFVGIFQTVYTIDLLRRERLGLRPGQPTRPYQPYPAAPQQPYGNQPPQ  379

Query  394  LSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELS  443
             S  +        +     G                 D       +    
Sbjct  380  QSYGNQPQPPYGNQPPQPYGQPPQQFPQYGPPGSQHPDPEQRGPQQTGSQ  429


>SEV91228.1 Uncharacterized membrane protein [[Clostridium] fimetarium]
Length=361

 Score = 61.7 bits (146),  Expect = 2e-07, Method: Composition-based stats.
 Identities = 24/156 (15%), Positives = 60/156 (38%), Gaps = 1/156 (1%)

Query  136  IFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL  195
            +   +++     +          + L     I+  +      M + I + +      +  
Sbjct  23   MALTIVVYFVLCVAAGLVAIIPIVGLIATILIVPSIELGLIMMVLKIARLESVQVSDLFS  82

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSR  254
            G   +     +  ++ L     + LLIIPG++    +    Y+LAD+  IG ++A+  S+
Sbjct  83   GFNFIFKAFAVTFMVGLFTFLWTCLLIIPGIIASYRYSMAMYILADNPEIGVMEAIALSK  142

Query  255  LLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
             +  G+ W++F   +  +  ++  +F    +     
Sbjct  143  KMTKGYKWSLFVLQLSFIGWAILANFTFGILYLWLT  178


>NES04179.1 hypothetical protein [Okeania sp. SIO2F4]
Length=244

 Score = 60.6 bits (143),  Expect = 3e-07, Method: Composition-based stats.
 Identities = 24/190 (13%), Positives = 63/190 (33%), Gaps = 10/190 (5%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
            +      +            ++ + L    + + +L     +L   +  + W      V 
Sbjct  1    MQYQQINVTNLLDKTFATFRVIYLPLLIIGLPAVILFVLLYFLPESSTQYYWLYWFLVVL  60

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS-------  218
                  +     ++ Y+    + L ++ + G +   S  +L ++ +  +           
Sbjct  61   INPWLFAIRYLYIYKYLRGIRISLVQAFRRGFKKFISLAILSLITLDPLILIKDKLRSNI  120

Query  219  ---LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
               +L  I  + F     F + ++  +     +A+ +S  L  G++W I    ++  V+ 
Sbjct  121  GLSILFYISLIYFATRIGFYEQMVLIEGTSPFRAITRSWQLTKGYFWVIIRANLIFFVVF  180

Query  276  LTLSFLTARI  285
                FL   I
Sbjct  181  CFPMFLFDTI  190


>KPV49372.1 hypothetical protein SE17_33025 [Kouleothrix aurantiaca]
Length=283

 Score = 60.9 bits (144),  Expect = 3e-07, Method: Composition-based stats.
 Identities = 26/148 (18%), Positives = 51/148 (34%), Gaps = 17/148 (11%)

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
            M   L  + +         ++      L+I+P       + F    +  +  G + AL++
Sbjct  118  MLPMLACIFALAFNTRNTGVLTVVAVALMIVPVAFLLTRYVFVTQAVVLEESGPVDALKR  177

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTL-----SFLTARIPYV------------GEAANLA  295
            S  L  G +W I    V + ++S  L       L+  + +             G   +  
Sbjct  178  SWQLTRGSFWRIAAVVVAVSLLSALLTRIPVMLLSWTVVFFGLSDLLVAAQLGGMVISQV  237

Query  296  FSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
              +L  P     Y L+Y D++  + G  
Sbjct  238  GLMLAMPIQLAIYTLLYYDVRVRHEGYD  265


>MBC8229532.1 hypothetical protein [bacterium]
Length=273

 Score = 60.9 bits (144),  Expect = 3e-07, Method: Composition-based stats.
 Identities = 30/223 (13%), Positives = 68/223 (30%), Gaps = 24/223 (11%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYIL----LGLSWMTGSMFI  180
              + I         +    P     P    +   I++     +     +  +  T  +  
Sbjct  39   VPIVIFCILILSTISSAPMPKAGEIPTEAVYMVPIMMLIFFLMYSFSTVVAAAGTIVISE  98

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV--------GGGSLLLIIPGLLFCVWF  232
                 ++ +  + +     +      +IL  +++            +  II  +   VWF
Sbjct  99   SFLGREIKIVDAYRKVRNRIFPLLGAIILTSVIIGLATTLGMFLCVIPGIIGWVYLSVWF  158

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT----------  282
             F   ++  +  GG+ A++ SR L+             ++   L                
Sbjct  159  GFIAQIVMLEGEGGMGAMKCSRTLIKEDSRKSLIVIGSVMAAILIAWIFLVAGNVMATQF  218

Query  283  --ARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                +  VG   +++  +L+ P  F    L+Y DL+    G  
Sbjct  219  NIVILSPVGNLFSISALILIEPIRFTATTLLYYDLRIRREGFD  261


>KPQ20317.1 Protein of unknown function (DUF3426)/zinc-ribbon domain, partial 
[Porphyrobacter sp. HL-46]
Length=72

 Score = 55.9 bits (131),  Expect = 3e-07, Method: Composition-based stats.
 Identities = 12/42 (29%), Positives = 20/42 (48%), Gaps = 1/42 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAE  42
           M  + CP CG     P S + ++  + RC +C  +   +P E
Sbjct  17  MI-IACPACGTRYAVPDSAIGSEGRTVRCAKCKHSWFQEPPE  57


>PQM43996.1 hypothetical protein C1Y40_05845 [Mycobacterium talmoniae]
Length=153

 Score = 58.6 bits (138),  Expect = 3e-07, Method: Composition-based stats.
 Identities = 17/139 (12%), Positives = 38/139 (27%), Gaps = 21/139 (15%)

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
                   F   ++  + +    A  +S  LV   +W + G  +L  +++  ++ +     
Sbjct  10   YLFTMLSFTSPLIVLERLPIFAAARRSFALVRNSFWRVLGILLLAGLVAYLVAGVVGSPF  69

Query  287  ---------------------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHP  325
                                  +    +    ++  PFS     L+Y+D +         
Sbjct  70   SIAGALLTVSSGSTTPAPLAYVLSTVGSTIGQIITAPFSAGVVVLLYTDRRIRAEAFDLV  129

Query  326  PIKRQWLPLTAAIFGWMLI  344
                     TA      L 
Sbjct  130  LQTGAAGHPTATDSTDGLW  148


>CUP52935.1 Protein of uncharacterised function (DUF975) [Roseburia hominis]
Length=161

 Score = 58.6 bits (138),  Expect = 3e-07, Method: Composition-based stats.
 Identities = 16/112 (14%), Positives = 39/112 (35%), Gaps = 6/112 (5%)

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
             L  L  +++           +   +  F   Y+  +  I G +AL++S  L+ GH   +
Sbjct  30   GLTALSGILLFVLLFAFAFVMIWINLRMFAMTYLHQEYGIKGFEALKESFTLMKGHCTDL  89

Query  265  FGRFVLLLVISLTLSFLTARI------PYVGEAANLAFSLLLTPFSFLYYYL  310
                +  +   L  + ++  I        +        S  +  +++   Y+
Sbjct  90   LLINLSFIGWILLCAIISIMIGGVFGDNGIAGLLGSILSAAIAAYTYQPAYI  141


>MSR46955.1 hypothetical protein [Planctomycetes bacterium]
Length=274

 Score = 60.9 bits (144),  Expect = 3e-07, Method: Composition-based stats.
 Identities = 24/184 (13%), Positives = 64/184 (35%), Gaps = 28/184 (15%)

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG--  225
             +    +    F  + +    L  ++  G+R      +  ++   +V    + + +    
Sbjct  82   GVLSGMVIYVAFKRLMREPAALGTAISTGMRRFVPLLVTGLITAALVVLPLVAMFVLSQG  141

Query  226  ---------------------LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
                                 ++    F      +  +++G L AL++S  L   + + I
Sbjct  142  ANSDLKTSLLLMVGVVAGIASMIVTAMFAAASGAIIIESLGPLAALKRSLQLTKDYRFPI  201

Query  265  FGRFVLLLVISLTLSFLTARIPYVG-----EAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
            F    L+ +I+  ++ +   I  +      +   +  ++ +TP +     ++Y DL+A  
Sbjct  202  FWCSFLVGIIAGIVTGIFTLIGTLALLSLPQLMPILIAVAVTPLTSTLSAVVYHDLRATK  261

Query  320  RGPQ  323
             G  
Sbjct  262  EGVD  265


>MBC7328698.1 hypothetical protein [bacterium]
Length=549

 Score = 62.1 bits (147),  Expect = 3e-07, Method: Composition-based stats.
 Identities = 42/156 (27%), Positives = 75/156 (48%), Gaps = 4/156 (3%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            W+++ ++    LGI  + I      +F    L+ +++      +    +L A    + L 
Sbjct  34   WQVYRKKFRSSLGIMAVPIGFYIFYLFLFYFLQFSSFRYSFFYSLLLFLLGAGYLVLSLC  93

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
                    F    K D+G+  + + G + + S+  +  + + +V GG +LLI+PG LF +
Sbjct  94   SIPALLYSF----KEDIGVKNAYRRGWQILPSYIWVTFIYLAIVIGGLILLIVPGFLFSI  149

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFG  266
            WF    +VL  +   G  ALE+SR LV GH+W+ F 
Sbjct  150  WFCLYIFVLIYEGRKGFSALERSRRLVKGHFWSTFW  185


>NBX42790.1 thioredoxin [Rhodobacteraceae bacterium]
Length=61

 Score = 55.5 bits (130),  Expect = 3e-07, Method: Composition-based stats.
 Identities = 10/36 (28%), Positives = 16/36 (44%), Gaps = 0/36 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
            + CP+CGA      + +PA+    +C  C  T   
Sbjct  2   RLTCPNCGARYEIDDALIPAEGRDVQCSNCSTTWFH  37


>HHK85603.1 hypothetical protein [Candidatus Buchananbacteria bacterium]
Length=288

 Score = 60.9 bits (144),  Expect = 3e-07, Method: Composition-based stats.
 Identities = 35/207 (17%), Positives = 85/207 (41%), Gaps = 6/207 (3%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
                 +                +   +      +++       + ++   +  +   +S 
Sbjct  72   MKWIVYIPFDQLEFHKNDNIVSLLGFVFSISENFISNFPLMITFGLVFVILELVAYLISL  131

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
            +   + +      +     +     ++   ++ LI   L+   G +LLIIP ++F V+F 
Sbjct  132  IGQIVVLKNADKKIDFQLIVNQIPHYLSRVSIFLIFYYLIFLLGLILLIIPAIIFIVFFN  191

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG----  289
            F ++V+  ++    +A ++S+ LV  +WW +F R+V  ++ SL +      +  +     
Sbjct  192  FGKFVIIFEDTSVKEAFKRSKELVKSYWWPVFKRYVFWIIFSLLIGISMLFLRIISNSDY  251

Query  290  --EAANLAFSLLLTPFSFLYYYLIYSD  314
                 +   SL+LTP S +Y+++IY +
Sbjct  252  LTAVFSNLISLILTPLSIIYFFVIYEN  278


>KKR16226.1 hypothetical protein UT44_C0017G0018 [Candidatus Levybacteria 
bacterium GW2011_GWA1_39_32]KKR49741.1 hypothetical protein 
UT87_C0026G0003 [Candidatus Levybacteria bacterium GW2011_GWC1_40_19]KKR72849.1 
hypothetical protein UU15_C0025G0018 [Candidatus 
Levybacteria bacterium GW2011_GWC2_40_7]KKR95132.1 
hypothetical protein UU45_C0004G0035 [Candidatus Levybacteria 
bacterium GW2011_GWA2_41_15]OGH27720.1 hypothetical protein 
A3D82_04665 [Candidatus Levybacteria bacterium RIFCSPHIGHO2_02_FULL_40_29]OGH32247.1 
hypothetical protein A3E70_01985 
[Candidatus Levybacteria bacterium RIFCSPHIGHO2_12_FULL_40_44]OGH50919.1 
hypothetical protein A3J18_02045 [Candidatus Levybacteria 
bacterium RIFCSPLOWO2_02_FULL_40_18]OGH52293.1 hypothetical 
protein A3H20_00280 [Candidatus Levybacteria bacterium 
RIFCSPLOWO2_12_FULL_41_12]OGH54350.1 hypothetical protein 
A2596_01325 [Candidatus Levybacteria bacterium RIFOXYD1_FULL_40_21]OGH57482.1 
hypothetical protein A2186_03380 [Candidatus 
Levybacteria bacterium RIFOXYA1_FULL_41_10]OGH69884.1 
hypothetical protein A2396_05075 [Candidatus Levybacteria 
bacterium RIFOXYB1_FULL_40_17]HBB76946.1 hypothetical protein 
[Candidatus Levybacteria bacterium]
Length=199

 Score = 59.4 bits (140),  Expect = 3e-07, Method: Composition-based stats.
 Identities = 25/176 (14%), Positives = 56/176 (32%), Gaps = 0/176 (0%)

Query  126  LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKT  185
               +           +L                 +    + + +        + + +   
Sbjct  19   FFFLFGLLVIFALLSVLNSTVQAALVGIGVLVFFVGLAFSLVQVLFELGFLKIMLKLIAG  78

Query  186  DVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIG  245
                +R + L       F L  +++ L++ GG +LLI+PG+   +   F    +AD  + 
Sbjct  79   TKPTYRDLYLHYPQFIDFLLAGLIMGLLIAGGFILLILPGIYLAIRLQFTPLFVADKGLK  138

Query  246  GLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
             + A++ S  L  G W  +F   +L + I++           V         +   
Sbjct  139  PVDAVKGSWELTKGKWMKVFVFDLLTVGINILGLLALVVGLLVTIPMTSLAYVYFY  194


>CRH61016.1 Membrane domain of glycerophosphoryl diester phosphodiesterase 
[Chlamydia trachomatis]CRH91599.1 Membrane domain of glycerophosphoryl 
diester phosphodiesterase [Chlamydia trachomatis]
Length=383

 Score = 61.7 bits (146),  Expect = 3e-07, Method: Composition-based stats.
 Identities = 32/204 (16%), Positives = 67/204 (33%), Gaps = 26/204 (13%)

Query  148  LNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLL  207
             N         I L+ V   L  L          +    V  + + +   + +  F L  
Sbjct  187  FNRFKSRLLAMIGLSIVWVFLFALIASVTFGIFLLVGLGVADYVTYRSQTQTMLVFLLFG  246

Query  208  ILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGR  267
             +  +      +          V F F Q V A +N+   +AL++S  L  G  +   GR
Sbjct  247  SVGAITTFLVCVFFQ-------VRFAFAQTVCAVENLSPRKALKRSWTLTQGVAFKTLGR  299

Query  268  FVLLLVISLTLSFLTARIPY-------------------VGEAANLAFSLLLTPFSFLYY  308
             +L+ ++   +  + + +                     +   A+    +L+ P S +Y 
Sbjct  300  SILIAMVMGAVGGVFSGVLSTFTAFTISTDPQMYLTAIPLAAVASTIVQMLVMPLSQVYI  359

Query  309  YLIYSDLKANYRGPQHPPIKRQWL  332
             L+Y D +       +  +++   
Sbjct  360  ALMYVDERIRKENYAYTLMEQIHH  383


>WP_193485678.1 hypothetical protein [Anaerotignum lactatifermentans]MBE5076579.1 
hypothetical protein [Anaerotignum lactatifermentans]
Length=277

 Score = 60.6 bits (143),  Expect = 3e-07, Method: Composition-based stats.
 Identities = 29/158 (18%), Positives = 60/158 (38%), Gaps = 1/158 (1%)

Query  132  AFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFR  191
              +    AL  + +  L  Q       ++  TV    + +  +  +    +        +
Sbjct  64   QVSLSGMALSPQESINLIQQMMTNNVLLMAVTVFLEPIFIIGVAKAAKWRLEGRRFSASK  123

Query  192  SMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALE  251
            +    +        + I+ +++   G+LL+I P +   V +    Y +A      + AL 
Sbjct  124  AFVEAMSLEPVVVKVGIVYMILFLLGTLLVI-PAIYLGVVWCLYLYCIALGGRRSVDALG  182

Query  252  KSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG  289
             SR+LV   WW  FG  ++L  IS   + + + I  + 
Sbjct  183  HSRVLVRSRWWRTFGFLIILAAISYCWNSVLSLIFMLF  220


>HDH90286.1 zinc ribbon domain-containing protein [Candidatus Bathyarchaeota 
archaeon]
Length=285

 Score = 60.6 bits (143),  Expect = 3e-07, Method: Composition-based stats.
 Identities = 48/230 (21%), Positives = 91/230 (40%), Gaps = 9/230 (4%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
              +       L I L+  +L+        +     +L        +  L  T+ +  + +
Sbjct  16   RRYFSFVLPFLVISLVEWLLSIFFRGFTPVCPVHPFLIGFVILGSFGSLFLTLFFDAISI  75

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
              +T          +    +S+   L  V S  + +I++ ++V  G ++LIIPG++F VW
Sbjct  76   GIVTQLAADEFLGKETSFKKSVNSALDVVVSLIVGIIIVSVIVIIGLIILIIPGIIFLVW  135

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE-  290
            F     V+  +  G  +A+ KS+ L  G+W+ +FG  +L+ +I L  S +   I  +   
Sbjct  136  FCLTPIVIVLEKRGATEAMSKSKELTKGNWFHVFGVLILIAIILLVASTIGNVISSLFAP  195

Query  291  --------AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWL  332
                            L+ P S +   ++Y DL A  R     P+     
Sbjct  196  VLPKPLSLLIEKIILSLINPVSGVILTVLYFDLLARKRTTVVSPLPPAPY  245


>MYC93824.1 hypothetical protein [Caldilineaceae bacterium SB0661_bin_32]
Length=289

 Score = 60.6 bits (143),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 24/180 (13%), Positives = 53/180 (29%), Gaps = 17/180 (9%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +   +     +  +        D          +  +      ++L  + +    +L 
Sbjct  107  VVIVLAIPIWVGLVKTDIALGEILDAFSGPPGPADIEAIMKVLDDVLLGGIGLCLSGVLA  166

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            +I        +   +  L  +  G L +L +S  L  G+     G  +LL +    ++ L
Sbjct  167  VIVLSYLSARWMAAEVALMVEETGPLDSLARSWNLSRGYILRTVGYLLLLAIPLGIVAGL  226

Query  282  TARI-----------------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
               +                      AA++  +++ TPF      L Y DL+        
Sbjct  227  FGVLIDFVVFPMIPAIDESSRAGFSSAASMLLTIITTPFYVSAIVLYYFDLRVRKEKYSF  286


>MBI5794293.1 hypothetical protein [Candidatus Uhrbacteria bacterium]
Length=249

 Score = 60.2 bits (142),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 38/181 (21%), Positives = 66/181 (36%), Gaps = 1/181 (1%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
               +      L+G          AL L      +           L+ +   L     +T
Sbjct  13   WSLYRKHLWSLVGYSAWMLLPVGALFLLTFAPDHWLVFAVAMLCTLSEIFLALWMTIAIT  72

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
             S+     K +V         L  + +     +L  LVV GG LL I+PG++F  W+ F 
Sbjct  73   HSVNRLSQKQEVNHIAISHDALIRLPTLLKTAVLQGLVVIGGLLLFIVPGVIFAFWYAFA  132

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL-LLVISLTLSFLTARIPYVGEAANL  294
            Q     D    ++AL  S+ LV G +W +  R +   + ++L  S +   +  +  +   
Sbjct  133  QLSTILDGKRPVEALTASKELVKGRFWTVAWRLIAGPIFLALVYSTVIGLVFMLVASLTG  192

Query  295  A  295
             
Sbjct  193  V  193


>MBE9605143.1 hypothetical protein [Acetobacteraceae bacterium H6797]
Length=249

 Score = 60.2 bits (142),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 28/159 (18%), Positives = 56/159 (35%), Gaps = 12/159 (8%)

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
            V         M       +    +GL+ +++  L       +L +++ +  G G  LL++
Sbjct  76   VLVPCFIGVVMLRLTIASLEGERMGLWLALRQSLMRALPLAVLAMIIAVFCGVGFALLLV  135

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
            PG++    +      L  +    + AL +S  L  GH W +F  ++L  V       +T 
Sbjct  136  PGIMLWCRWAIAWPALTAEGGSPVTALRRSATLTKGHRWDVFFVYLLQAVFFFMAGLVTV  195

Query  284  R------------IPYVGEAANLAFSLLLTPFSFLYYYL  310
                           ++   A    SL+L     +   +
Sbjct  196  LGITVLLAETYDRWGFLVWMAAAVVSLVLGYVGVIMAIV  234


>MBM65180.1 hypothetical protein [Myxococcales bacterium]
Length=211

 Score = 59.4 bits (140),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 25/159 (16%), Positives = 54/159 (34%), Gaps = 11/159 (7%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
            A +            +M   +   D      +  G    G   ++ ++   VV  G +L 
Sbjct  36   AVMIIAGPVRGGYDLAMLRILRGDDSVDIGDVFAGFERFGKLLIVYLVYGFVVFFGIVLC  95

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            I PG+   +  +    V  + +   +  ++++  L   ++W + G           LS L
Sbjct  96   IFPGIYLAIALYPSFLVAMESDDPPIDCMKRAYALTRPYFWQLLG-----------LSIL  144

Query  282  TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
            +  +  +G  A     L+  P   L +   Y ++     
Sbjct  145  SFGVTVLGLLACCVGLLVAGPVVQLAWMAAYDEMTMAQG  183


>PSP67089.1 hypothetical protein BRC85_07690, partial [Halobacteriales archaeon 
QS_1_69_70]
Length=135

 Score = 57.5 bits (135),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 25/132 (19%), Positives = 47/132 (36%), Gaps = 4/132 (3%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            +  GS+ L++PGL   V F+F +  +A ++   + A+  S  L  G    +F   +L++V
Sbjct  3    IIVGSVFLLVPGLFAAVAFYFFRQEVALEDRTLIDAMAASWRLTRGDRLNVFALGLLVVV  62

Query  274  IS---LTLSFLTARIPYV-GEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
            +S     +S L   +  + G         +L  F        Y  L       +      
Sbjct  63   VSQLDAVVSLLVGEVSVLAGTIVAAVLGGVLAAFGAAVVTRAYLQLTGPEAADEPAEPAD  122

Query  330  QWLPLTAAIFGW  341
             +          
Sbjct  123  PYDAALGPEDIP  134


>KAA0205949.1 hypothetical protein EDM68_03710 [Candidatus Uhrbacteria bacterium]
Length=254

 Score = 60.2 bits (142),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 29/96 (30%), Positives = 48/96 (50%), Gaps = 0/96 (0%)

Query  199  HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVS  258
            H   F  +L+L  + V GG ++  +PG+   V   +    L +D    L AL+ S  LV 
Sbjct  102  HFFPFLWVLVLQSVAVFGGLMIFFLPGIWLSVLLGYSLPSLVEDGTRSLDALKASADLVK  161

Query  259  GHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
            G WWA+FGR +++ ++   L+ LT  +  V     +
Sbjct  162  GRWWAVFGRNLVVGIVVGLLASLTTLLVLVIVGLFI  197


>MBE6287268.1 hypothetical protein [Mediterranea massiliensis]
Length=251

 Score = 60.2 bits (142),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 24/184 (13%), Positives = 61/184 (33%), Gaps = 1/184 (1%)

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
            + +  I +L    +       L    A        +W    ++ T       +      +
Sbjct  48   YYVPYIEMLATYGSGITEEEWLATMFADDDVWSWFSWLIVAVIFTFFANNYLVVVGCRML  107

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
               +    V +   +K        +    I+  ++V  G++  +IPG+   V   F   +
Sbjct  108  NAAVRGNKVDMTLELKDARHTFWFYLGAYIVYSVIVMAGTIFCVIPGIFLGVRLMFVPMI  167

Query  239  LADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
             ++   I   +A  +S  +  GH+W +FG  ++++++++           +         
Sbjct  168  ASNHPEIPFSEAFSRSWKMTEGHFWRLFGLGIVMILLNIVGLIFCCIGYLLTIVITSLAY  227

Query  298  LLLT  301
                
Sbjct  228  ACAY  231


>MBE6816950.1 DUF975 family protein [Ruminococcaceae bacterium]
Length=395

 Score = 61.3 bits (145),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 37/288 (13%), Positives = 83/288 (29%), Gaps = 17/288 (6%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRH-----VGSFTLLLILLILVVGG  216
                 +      + G     +   ++     +    +           LL ++  +++G 
Sbjct  119  IAALVLGPLKIMLAGLYSQLVRGNNMSFGDGLGFVFKKTFDKDYIQKFLLNLVQAILLGL  178

Query  217  GSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
              +  IIPG++F   ++F  Y++A++  +   QA+  S+ + + H   +F   +  +   
Sbjct  179  LYICFIIPGIIFNYKWYFTAYIMAENPELTFEQAMNTSKKMTNNHKGELFVLDLSFIPWY  238

Query  276  LTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLT  335
            L        IP +G       S+ LTP+    + L Y + KA                  
Sbjct  239  LL------MIPTIG-----LVSIYLTPYVSTTHALYYENFKARALQEGAITQYDFMSSAQ  287

Query  336  AAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLS  395
                     P +   +  ++N        A +         PQ   +             
Sbjct  288  QIAAQAQTAPQMNTYAQPQENTYYTPAPQAPQQDTYYTPVAPQPAAEQYYQPAAPVAPQP  347

Query  396  SADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELS  443
            +A+       +                 A+    +   P    +    
Sbjct  348  AAEPVDEPIAEPTAEPVSEPVAETFESPAEAPTEEYYTPTEPPQSTEE  395


>MBE6292719.1 tetratricopeptide repeat protein [Bacteroidales bacterium]
Length=612

 Score = 62.1 bits (147),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 50/378 (13%), Positives = 106/378 (28%), Gaps = 16/378 (4%)

Query  134  APIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSM  193
               F+          + +  +++  +     A+  + ++ +       +    +      
Sbjct  67   FVAFAYFAYYVTNQGDVRAVSFKDMLKSLGSAFGRVFVASLLPLGLAVLF--VLVFLIYF  124

Query  194  KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKS  253
             + +  VG     + +  +V+    LL+ +  +      ++  Y  +  + G   ALE S
Sbjct  125  FVMILIVGGGFDNVSIAFIVIVLIPLLVFVLYVTPIFSIYYIHYYFSCKSKGYWDALEDS  184

Query  254  RLLVSGHWWAIFGRFVLLLVISLTLSFLTARI---------PYVGEAANLAFSLLLTPFS  304
              +V GHWW+  G  VL  ++ L +                 ++G       S       
Sbjct  185  FRMVRGHWWSTMGFIVLFEILQLIVLLPVILFLQEKTTFISGWIGNLLIFIVSFFTIHVV  244

Query  305  FLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLS  364
             LY Y     LK      +   I    + +T  +   + I G          L     + 
Sbjct  245  ALYQYGHLKALKEENVEAEQKKIITVKMVITTVVILLVCIVGACNAEKINSILPNLASMF  304

Query  365  AGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFA  424
                 Q  +G +      + +   E  +    A  K   + Q      G      + +  
Sbjct  305  GNSSAQNEIGRKYLNGDGVEQDYEEAVKWFLKAANKGNAAAQYNL---GNCYREGLGVEQ  361

Query  425  DRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFH  484
            D   +         +       NL +       +E+D                 E  A H
Sbjct  362  DSLESFKWINKAAAQGLPIALNNLGVCYMSGYGVEVD--KGKARELYIQAAEKGEPMAMH  419

Query  485  WVGINQTDENDLFSGIRS  502
             +       N +    + 
Sbjct  420  NLAQIYMSGNGVEKDEKE  437


>TAK89232.1 hypothetical protein EPO04_04015 [Patescibacteria group bacterium]
Length=268

 Score = 60.2 bits (142),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 35/171 (20%), Positives = 69/171 (40%), Gaps = 0/171 (0%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            ++ ++++ +    A I +A ++                +LL    Y L   +++   M  
Sbjct  41   IVSLFVIALFFVIAVIVAAGMVYSGLSGGAALFGGLLLLLLLFSLYALYIQNFLLVIMLA  100

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                  + +    K  +R +    ++ +L  L V GG +L IIPG +F  WF        
Sbjct  101  AAQGQRLSIGEGSKRAIRLIPKLFVVGLLYALAVIGGFILFIIPGFIFMAWFALASLAAI  160

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
             +++G + AL++SR LV  H    +G   L   I   +   +  + Y+   
Sbjct  161  HEDLGPVAALKRSRQLVRDHLIETWGLLGLQSTILGIVMLASMPVRYLQLV  211


>MBJ7597377.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Candidatus Dormibacteraeota bacterium]MBJ7611687.1 
glycerophosphoryl diester phosphodiesterase membrane 
domain-containing protein [Candidatus Dormibacteraeota bacterium]
Length=378

 Score = 61.3 bits (145),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 33/180 (18%), Positives = 65/180 (36%), Gaps = 18/180 (10%)

Query  160  LLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
             +  +  +      +T +         V L   ++  LR   +   L +L +L       
Sbjct  187  YIVAILLVPFVEGAITLAAMEVALGRPVTLASCLRGVLRRYWALLGLALLGLL--LFPLF  244

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL--------  271
            L     +   V +      L  + +G ++A+++S  L   HWW +FG  V++        
Sbjct  245  LCFPVAIWILVRWSVSVPALLAEGVGPVRAIQRSWELTRAHWWRLFGILVVVVLIQLVMN  304

Query  272  LVISLTLSFLTARIPYVGEA--------ANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
             ++      L A IP+V            +   + ++TP  +L   L+Y DL+       
Sbjct  305  SIVGAAALPLAAVIPFVSPLVRGSISLTVSTIGTAVITPVLYLCLVLLYFDLRIRKESFD  364


>NUN52367.1 hypothetical protein [Planctomycetaceae bacterium]
Length=276

 Score = 60.2 bits (142),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 34/235 (14%), Positives = 67/235 (29%), Gaps = 21/235 (9%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY  166
             A+S  L+ R     + + L             +L             +           
Sbjct  30   FAESLSLWGRNVVPFVLLTLAVYSPLVLYTLYVVLSGTENLTPKSADAYDKIHQWGGGLL  89

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS--------  218
              L  + +   +F  +      L      GL  V       IL+ +              
Sbjct  90   NSLVSAAIIFGVFERLRGKSPPLGEMFARGLGRVIPALWTGILVAVATMVPMAPGFLVAA  149

Query  219  -----------LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGR  267
                       L+  +  L+     +        +   G  AL +S  L  G    IF  
Sbjct  150  AGGFAFGAVLVLVGSVLSLVVSTALYVSVAAAVVEKCNGFPALRRSLDLTRGAKARIFCI  209

Query  268  FVLLLVISLTLSFLTARIPYVGEAANLA--FSLLLTPFSFLYYYLIYSDLKANYR  320
             +L+ +++         I   G    L   F++++  F+ ++  +IY  L+A+  
Sbjct  210  MLLVGIVAFLFGKGVEFIGSPGARLLLIQGFAVVMASFTAVFAAVIYYRLRADRE  264


>EFB62692.1 SPFH/Band 7/PHB domain protein [Lactobacillus gasseri 224-1]
Length=583

 Score = 61.7 bits (146),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 25/215 (12%), Positives = 65/215 (30%), Gaps = 8/215 (4%)

Query  81   CNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSAL  140
               ++  +           +          W +      G++ ++L  I           
Sbjct  306  MYLTWKGKQNENQPQRIKKIACKQLKGNWLWVIGLLIIPGIINLFLEYITNYVWTGSLNT  365

Query  141  LLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGL--R  198
                            + I+ A     ++       ++          +F++M       
Sbjct  366  NNLSLISWWQGLGWSIFTIITAL-IATMIAWGVQYATLAFRDTGKKPNVFKAMFSSFTNG  424

Query  199  HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD-----DNIGGLQALEKS  253
            +     L  +L  L      LLLI+PG++    +    Y++ D       +   +A+ +S
Sbjct  425  YFFKTFLTSLLTTLFTFLWGLLLIVPGIIKSFSYAMTPYIMKDMIDSKHEMTATEAISES  484

Query  254  RLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
            R ++ G+   +F  ++   +    +      I ++
Sbjct  485  RKIMKGNKTTLFIIWLTFNIWYFIIGLAGIAIAFL  519


>WP_163843582.1 hypothetical protein [Nocardia cyriacigeorgica]NEW32687.1 hypothetical 
protein [Nocardia cyriacigeorgica]
Length=861

 Score = 62.1 bits (147),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 43/413 (10%), Positives = 90/413 (22%), Gaps = 44/413 (11%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM----TG  176
            L  + ++ I  A     S      A  L           +   +   +   + +      
Sbjct  256  LFLLGMISIGAALISGISMEGENFAAGLFATLAALVLLFVSLLIVIGVPADAAINAVTIV  315

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL---------  227
            +  + I    V       L    + +   L+ +   V+    LL     L+         
Sbjct  316  TADMAIRGEPVRFAAVFALVRPRLFALCRLMAVFYAVIVLPGLLGPYIVLIAVGLPAAIP  375

Query  228  -----------FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI--  274
                             F   V+  +  G +++  +S  LV      + G  +L +    
Sbjct  376  VLVLIVAAGFVVGTLCAFAPIVMTLEGAGVVESFRRSVALVRPSLARVLGLELLWVAACV  435

Query  275  ---------SLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHP  325
                     +L  +        +    +L               L+Y+DL+         
Sbjct  436  VAFAATAYPALAFASAVPGGEVIVYPVSLVIIAASAVLIRTLQALLYTDLRIREGSYDPT  495

Query  326  PIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNR  385
            P      P                +       +A   ++               +     
Sbjct  496  PQPPPGPPTPPEGTAGDDRAPGADIPGRLAGTAAPMRVAG---------ESAAVSAQDAA  546

Query  386  SLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDF  445
            S P  P          + +    T + G          A+       +P    +L     
Sbjct  547  SDPPVPMAAMEVPGSDVATPTAPTDAFGQPFTPATGEGAESAGNRPPSPPTRPQLTKPAT  606

Query  446  PNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFS  498
                +       +         A      Q + E  A     +          
Sbjct  607  GAAPVVPTTVPSMRDGSFAQPGAHQPPHGQTTAEPTAIAGRNVGDLHGGAEAP  659


>PYK40849.1 hypothetical protein DME60_06390 [Verrucomicrobia bacterium]
Length=139

 Score = 57.5 bits (135),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 24/131 (18%), Positives = 49/131 (37%), Gaps = 0/131 (0%)

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
            +      M +     ++     +   L    ++ L L +  L V  G +LLI+PG+   +
Sbjct  1    MIVGLHLMALKSVDGEIPRMGDLFGSLERGPAYLLALGIYCLAVTVGLVLLIVPGIYLAI  60

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
             +     V+ D +   L AL K+ +L  G+W  +   F    ++++    L      +  
Sbjct  61   RYCLFAQVITDTSASALAALRKAAVLAHGNWAPLGALFFTAFLLNIAGMALLGMGLIISF  120

Query  291  AANLAFSLLLT  301
              +L       
Sbjct  121  PVSLLAIAGFY  131


>HGX27654.1 hypothetical protein [Candidatus Woesearchaeota archaeon]
Length=212

 Score = 59.4 bits (140),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 32/151 (21%), Positives = 58/151 (38%), Gaps = 8/151 (5%)

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
                      +S+    + + + K ++     +  GL       +  +   ++VG G LL
Sbjct  67   FFLAILAAFLMSFAYACVVLKVNKPNLKTSEMLVQGLFKSFKLFIATLAYSVLVGIGFLL  126

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
            LI+PG+   V            N  G   + +S  L  G +W IF  ++L+ ++S     
Sbjct  127  LIVPGIYLLVRLGLFAPAAVLGNGFG---ISESWNLTKGKFWDIFVLYLLIFLMS-----  178

Query  281  LTARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
            +   IP VG+         +T  +    YLI
Sbjct  179  IIGAIPIVGQIIATLLISPVTVTALTKAYLI  209


>NER49523.1 hypothetical protein [Symploca sp. SIO1A3]
Length=176

 Score = 58.6 bits (138),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 35/164 (21%), Positives = 58/164 (35%), Gaps = 20/164 (12%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
               A+  L +  +  +      +       SM  G R         I+  L+V  G + L
Sbjct  7    IEFAFGPLYVGAILDAASRLKQELGTTYGESMAQGARRSFKLLGTRIVTGLIVLLGFIAL  66

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV---ISLTL  278
            +IPG++  + F     V+  + + G  A   S  L  G  W I G  +L  +   I++ L
Sbjct  67   VIPGIILSLRFALVDPVVVLEGVAGKDARNLSAELTQGKRWQILGTMILTFIGVAIAIIL  126

Query  279  SFLTARIPY-----------------VGEAANLAFSLLLTPFSF  305
            S L   +P                  +G    L   L+L  F +
Sbjct  127  SSLVLYVPLSLVGQDENFVIAVINECIGNIVVLVPILVLFLFYW  170


>MAZ38870.1 hypothetical protein [Legionellales bacterium]
Length=272

 Score = 60.2 bits (142),  Expect = 4e-07, Method: Composition-based stats.
 Identities = 27/213 (13%), Positives = 66/213 (31%), Gaps = 15/213 (7%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
             + I ++      + I         +    Q               +      +      
Sbjct  32   FVLIAVIVSAGLSSKIIDDHNFFFNSLSLYQFHWMHIFRNFIIALVVCWCYVGIFVQYHS  91

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             + +   G+ ++    ++H     + + L   ++  G ++ I+PG+   V       ++A
Sbjct  92   VLQQQKTGVKQTAIHAIKHFFPLFITMFLYFSMLSFGLVVFILPGIFIGVACTLAIAIVA  151

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL---------------TARI  285
             +    ++AL++S  LV  +WW      V   ++ L + +L                  I
Sbjct  152  TETKNPIKALKRSYQLVVPNWWRALVLPVAPFILLLLIGYLSNTFAKFLFIHGMNNLTMI  211

Query  286  PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKAN  318
              +    +       T + F    ++  D K  
Sbjct  212  LSIRMIFSAVLGFFFTTWFFSLKVIVLHDFKLR  244


>HEY94520.1 hypothetical protein [Dehalococcoidia bacterium]
Length=271

 Score = 60.2 bits (142),  Expect = 5e-07, Method: Composition-based stats.
 Identities = 33/228 (14%), Positives = 81/228 (36%), Gaps = 3/228 (1%)

Query  98   SGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQW  157
                 +   + ++        W       +          +A  +    +          
Sbjct  33   FWKNIVIVAITNAIIWGAYWLWAKSLFNSITDFSMTTVSLTATSIVIVIFSWILGVLMNG  92

Query  158  AILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLL--ILLILVVG  215
             ++  T    +   + +  ++   + +    +  S+ L L  +  + ++   +L+  +  
Sbjct  93   VLIRLTSTIYVDTENRIGFTIVASLKRLVPTIIASLILALIIIAVYFIVSLFMLVPFINI  152

Query  216  GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
               L+ II G +  V   F  + +  +  G ++A  +S  LV   W   FG  +++ +IS
Sbjct  153  VALLVFIIVGTIIGVRLAFIFHAIFIEGAGPVKAFHRSAELVKDEWRHTFGYLLVIGIIS  212

Query  276  LTLSFLTARIPYVGEAAN-LAFSLLLTPFSFLYYYLIYSDLKANYRGP  322
            + +S     I +    A  +  ++ +TP   +   L+Y DL+A     
Sbjct  213  IIISICIYFILFFNSYAGEIIATIFVTPIQIIGTTLLYWDLRARNERC  260


>OGL66433.1 hypothetical protein A2856_01940 [Candidatus Uhrbacteria bacterium 
RIFCSPHIGHO2_01_FULL_63_20]
Length=247

 Score = 59.8 bits (141),  Expect = 5e-07, Method: Composition-based stats.
 Identities = 31/153 (20%), Positives = 58/153 (38%), Gaps = 6/153 (4%)

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
                    +         + +  L+      ++L+L    +  G  +   PG+   +   
Sbjct  94   YAAVGRAALATHPASFMDTGRAALKDALPMFVVLLLTFFAIMLGFAVFFFPGVYVLIAVA  153

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG----  289
            F   V   +   GL AL +SR LV G+WWAIFGR++ ++ + +    + +          
Sbjct  154  FVVQVALAEGKRGLSALRRSRDLVRGYWWAIFGRYLAIVALIVIARSILSTAFGAAGQDA  213

Query  290  --EAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
                 +L   L+  PF      ++Y +L A   
Sbjct  214  SDWFESLFDILVFFPFWICLDTVLYRELVAIKG  246


>NJK90035.1 hypothetical protein [Myxococcales bacterium]
Length=69

 Score = 55.2 bits (129),  Expect = 5e-07, Method: Composition-based stats.
 Identities = 11/54 (20%), Positives = 16/54 (30%), Gaps = 0/54 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCP  56
            V CP C    +   +++P    + RCP C                  N    P
Sbjct  2   KVSCPSCAKRYSVDDARIPPTGVTIRCPNCQHQFEARHGFDDPAPACPNCGHMP  55


>PYQ05917.1 hypothetical protein DMF82_07395 [Acidobacteria bacterium]
Length=305

 Score = 60.6 bits (143),  Expect = 5e-07, Method: Composition-based stats.
 Identities = 33/203 (16%), Positives = 67/203 (33%), Gaps = 11/203 (5%)

Query  102  SISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILL  161
               +     +           G+  L   L F  I SA    P+     +     W +++
Sbjct  28   QWRRHFRRIYPAVAIPLALAAGLVPLSQWLFFRTITSAGRSGPSAGAMIRGMVGFWLVMM  87

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +A   LG   + GS    + +  + +    +  LR   +    L L  +V   G +  
Sbjct  88   LYLAVYWLGYLVLFGSAVDALAQRPLAMGHEWRRVLR--PAVFGTLALSTVVTVLGFMAC  145

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA---------IFGRFVLLL  272
            I+PG+   + F     V+ DD + G  A+ +S  L   +            +F    +  
Sbjct  146  ILPGIYLGLLFSLTLPVIIDDGLLGTAAMRRSAELTRYNPQRALDADPRFKVFVIVFVGT  205

Query  273  VISLTLSFLTARIPYVGEAANLA  295
            ++   +  +      V +   + 
Sbjct  206  LLGYVIGTVVQLPMVVVQQVMML  228


>WP_188939660.1 hypothetical protein [Nakamurella endophytica]GGL86042.1 hypothetical 
protein GCM10011594_02150 [Nakamurella endophytica]
Length=293

 Score = 60.2 bits (142),  Expect = 5e-07, Method: Composition-based stats.
 Identities = 24/170 (14%), Positives = 61/170 (36%), Gaps = 11/170 (6%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
               A +    + +   +   +     G    ++             +L  L+V  G + L
Sbjct  111  VVGAALNFFAAGVATPLAAEMALRPQGARVGLRRLRTRWPVLLGAGVLAALLVMVGMVFL  170

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            ++PGL+          V   + +   + L ++  L +G  W +FG  +L  ++   ++ +
Sbjct  171  LVPGLILLAALQPVGAVAVMEGLSVRETLIRAARLTAGRRWRVFGVSLLGSLVVAPVALV  230

Query  282  TARI-----------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
               +             + + A++  S +   ++ +   L+Y DL+    
Sbjct  231  GVALAQRLADHATTAFVLQQVASVLISTVTVSWTAMIGALLYVDLRMRTE  280


>RME80321.1 hypothetical protein D6785_10490 [Planctomycetes bacterium]
Length=198

 Score = 58.6 bits (138),  Expect = 5e-07, Method: Composition-based stats.
 Identities = 29/148 (20%), Positives = 55/148 (37%), Gaps = 0/148 (0%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +    L  S +       I    V +  S+  G+R +    +   L ++    G + L
Sbjct  14   LYLLGWGLTWSSIVYFSLQQIHGKKVSIMESLNKGIRKLIFVGITFFLFLVFTFLGLMAL  73

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            IIPG++     +    V+  +++G   +  +S  L  G   +I    +LL + +     L
Sbjct  74   IIPGIVIWCGLYLSIPVVILEDLGPFASFSRSWKLTYGFKRSILNAVILLGLAAGAFFIL  133

Query  282  TARIPYVGEAANLAFSLLLTPFSFLYYY  309
               +  V  A      L +  F  LY +
Sbjct  134  LFLLGGVILALFHGGPLGIAIFVVLYLF  161


>MBI3305221.1 hypothetical protein [Candidatus Parcubacteria bacterium]
Length=338

 Score = 60.6 bits (143),  Expect = 6e-07, Method: Composition-based stats.
 Identities = 40/249 (16%), Positives = 83/249 (33%), Gaps = 2/249 (1%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            F +       ++ +   +        ++              Q A+ LA +   +L    
Sbjct  59   FTQTRRSYGYLFRIFWPVFALLFVLQVVPSFLPGGASAAVVLQGAVWLAALVLTMLLPIA  118

Query  174  MTGSMFIYICKTD-VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
            M  ++           L    +        F  + ++  LVV GG + L++PGL+  +  
Sbjct  119  MILALDQRERANPTPNLVGLWREARSKFWGFVWVGMVSTLVVMGGFVALVVPGLVLSIQN  178

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
            FF  Y+   +   GL A+E SR +V G  W +FGR  L+ ++++ +  +   I  +    
Sbjct  179  FFASYIYVLEGRRGLAAVETSRQMVRGLGWPVFGRLFLMGLMAMAVFLVVVVITVLLAIP  238

Query  293  NLA-FSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVS  351
                 +    P+      L      A         +    +      + +  +  L    
Sbjct  239  AFIHITPGSAPYPSPEPKLTPMQEAAVNGLQGLANLFLFPVFAAFPYWLYREVRRLRGAP  298

Query  352  LSRQNLSAE  360
              +Q   + 
Sbjct  299  EVQQPSQST  307


>PSP25025.1 hypothetical protein BRC55_05155 [Cyanobacteria bacterium SW_8_48_13]
Length=175

 Score = 58.2 bits (137),  Expect = 6e-07, Method: Composition-based stats.
 Identities = 39/174 (22%), Positives = 66/174 (38%), Gaps = 21/174 (12%)

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
            M    + Y+    + +  + +          L   LL+L+      LLIIPG+   V   
Sbjct  1    MIYYTYQYLSGNRISIGSAFRRARSQTRRLILGYFLLLLISISAPYLLIIPGIYLSVRLG  60

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE---  290
            F  + +  DN+   + +  S  LV GHWW+ F   +++L++SL           +G    
Sbjct  61   FVLHGITVDNLSFTRGIRHSWNLVKGHWWSTFIAMIVVLIVSLIFLTPIYVGILIGATLI  120

Query  291  ------------------AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPP  326
                                 +   L++ PF  +YY L+Y  L+AN       P
Sbjct  121  PDSANSQVISIFTFTGVSVVFVFIYLVIAPFMTVYYTLLYKFLRANKNKSNQLP  174


>WP_185343483.1 DUF975 family protein [Listeria rocourtiae]MBC1436061.1 DUF975 
family protein [Listeria rocourtiae]
Length=350

 Score = 60.6 bits (143),  Expect = 6e-07, Method: Composition-based stats.
 Identities = 39/325 (12%), Positives = 95/325 (29%), Gaps = 3/325 (1%)

Query  132  AFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFR  191
             +      ++L           N+   +                   F+ I + +     
Sbjct  18   NWGLAIGGMVLAYVILGAVSMINFIPFLGTIGYYIFAGPFMIGVTWFFLAITRQEKPDIG  77

Query  192  SMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQAL  250
             +  G +  G   L  +L+ +     SLL +IPG++    +     +L DD NI  L A+
Sbjct  78   YLFSGFKTFGRNLLAAVLVFIFTFLWSLLFLIPGIIKSFSYSQTFMILRDDPNISPLDAI  137

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYL  310
             +SR  ++GH   +FG  +  + +    + +   +  +  A    F           YY 
Sbjct  138  TESRHRMNGHKGRLFGLILSYIWVFFIPAIIMMVLAIIAAATTAIFEYGYDASYSYDYYA  197

Query  311  IYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQ  370
              +D+           +    +          + P +L+      +       +   D+ 
Sbjct  198  SSTDMSPV--FLLVFILGYFLMLAIMFGLAIWIYPQVLVSCAVFYDDLYASTGTFSDDMD  255

Query  371  QRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWAD  430
                T      D +             + +    + +   ++  ++        +     
Sbjct  256  TTDYTTTIPEDDTSFGAIPPVDSTPPEEPEAPHFENKTWGADSMVNPESPVTSPESEAEP  315

Query  431  DQNPHLWLKLELSDFPNLSLAQKGS  455
            +Q       +  +  P+     + +
Sbjct  316  EQPTQQNDTISEAVRPSEPADSEET  340


>WP_158539300.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Renibacterium salmoninarum]
Length=183

 Score = 58.2 bits (137),  Expect = 6e-07, Method: Composition-based stats.
 Identities = 21/179 (12%), Positives = 44/179 (25%), Gaps = 28/179 (16%)

Query  208  ILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGR  267
             + I ++    L +    +            +  +N+    A+ +S  L +G++W  FG 
Sbjct  5    WVTIPIMFLIMLGVAALAIWIGTKLILAPASIVVENLSVFAAIARSWKLTNGNFWRTFGT  64

Query  268  FVLLLVISLTLSFLTARIP----------------------------YVGEAANLAFSLL  299
            ++L   I   +S + +                                +    +     +
Sbjct  65   YLLSQFIVGAISGIVSVPVALVIGILSRLINPNPNPRDAIAILIVTQVISYLISALIGAV  124

Query  300  LTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLS  358
               F      LIY DL+    G     +         A                     
Sbjct  125  TLAFQTGVLALIYVDLRIRKEGFDLELLASLETTPDGAGLEIPGSRQYQPPPGRGWVSQ  183


>MBI5090966.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Candidatus Hydrogenedentes bacterium]
Length=273

 Score = 59.8 bits (141),  Expect = 6e-07, Method: Composition-based stats.
 Identities = 19/127 (15%), Positives = 45/127 (35%), Gaps = 3/127 (2%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
                  + +     ++        V +  + +  L+ +       +L    +  G+ L I
Sbjct  61   LSLIAAMSMGAFYFALSRLAQDEPVTVSSAFRAALQRLWPLFWTPLLSDAAIILGAFLCI  120

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL---VISLTLS  279
            +PG+   + F     V+  + + G  AL +S  L   + + I     L +   +  L L+
Sbjct  121  LPGIYLSLRFSLVLPVVMLEGLSGRSALSRSWRLTEEYLFPILILGALSIPCELARLALA  180

Query  280  FLTARIP  286
               + + 
Sbjct  181  GAFSGVN  187


>WP_014682444.1 hypothetical protein [Solitalea canadensis]AFD09222.1 hypothetical 
protein Solca_4232 [Solitalea canadensis DSM 3403]
Length=296

 Score = 60.2 bits (142),  Expect = 6e-07, Method: Composition-based stats.
 Identities = 21/153 (14%), Positives = 53/153 (35%), Gaps = 0/153 (0%)

Query  140  LLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRH  199
            +              +   + L     ++    +     ++      +      +   + 
Sbjct  70   IGNYNIFRDLFNPNYFLAILFLIISFILVFQSVYGYLVAYLEDKNRPITTDMVWEHVKQK  129

Query  200  VGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSG  259
                   +I+L + +   SL+ +IPG+   V F     ++A + I   + + +   L+ G
Sbjct  130  FFFSFFAIIVLGITMMLASLVFLIPGIYLAVAFQLALIIIAMEKIDVFELISRCIYLIRG  189

Query  260  HWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
             WW+ FG   ++ +I   +  +     Y+   A
Sbjct  190  KWWSTFGLIFIISMIQGMMGVVFNVPLYIITFA  222


>MAZ30238.1 hypothetical protein [bacterium]
Length=315

 Score = 60.2 bits (142),  Expect = 6e-07, Method: Composition-based stats.
 Identities = 46/277 (17%), Positives = 91/277 (33%), Gaps = 15/277 (5%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              + +  + L    +   L +  A            A L+A +  I + +    G  +  
Sbjct  38   WYVLISLVPLVLVALPVGLGVWAAASAPQLLWAAVVAGLVALLVMIWVMVVASAGLFYAV  97

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
              +  +      +           + +LL+LV+    + L +P  +  V+  F       
Sbjct  98   AQEERIRFIEGWRWARPRFWQIAWIGVLLMLVLTTAFMALFLPAFIIMVYTMFYFLSFLR  157

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS--------FLTARIPYV-----  288
             +  GL AL  S  +V G +W + GR   +++I + +S         L   IP       
Sbjct  158  HDHRGLNALAASTNVVYGRFWGVLGRLAFMMLIIIVVSLAITVIFEILAGIIPVAENALV  217

Query  289  --GEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPG  346
              GE  N   S +LT     Y  L+Y+   A     Q  P+   +       +   ++  
Sbjct  218  ATGELINTVVSFVLTIVMMRYLVLLYTAADATTPAYQAEPLSTSYKLYRVLAWVGGVVFV  277

Query  347  LLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDL  383
             + +  +    +      A    +  L    ++  D 
Sbjct  278  AMSLGGAVLLGTWASEGFALPADEAELEQWFEELEDA  314


>WP_165202354.1 hypothetical protein [Roseimicrobium sp. ORNL1]QIF01386.1 hypothetical 
protein G5S37_07585 [Roseimicrobium sp. ORNL1]
Length=242

 Score = 59.4 bits (140),  Expect = 6e-07, Method: Composition-based stats.
 Identities = 21/182 (12%), Positives = 62/182 (34%), Gaps = 1/182 (1%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
              +       L  + + + ++F       ++            +          + ++  
Sbjct  14   RTWELFYRHWLLFFAIHLTVSFPVNALVGIISTRVNPAENLLQFFLFSGAIGGLFGMVAP  73

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
            + +  ++            ++  + +  +GS  L+   + +    G  + I PG L    
Sbjct  74   AAIFAALSRAWSGATPTYGQTWFVTMDRIGSVMLVNFCVFIFSSAGFTMCIFPGFLVLAA  133

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF-VLLLVISLTLSFLTARIPYVGE  290
              F    + D++     AL++S +LV   +W +FG   ++   +   ++F+   +  +  
Sbjct  134  LAFSLCAVMDEDHDAWVALDRSVILVRSRFWVVFGYLALVFAPLYFGIAFVLLILSAIIA  193

Query  291  AA  292
              
Sbjct  194  LV  195


>MBA3567853.1 hypothetical protein [Pyrinomonadaceae bacterium]
Length=320

 Score = 60.2 bits (142),  Expect = 7e-07, Method: Composition-based stats.
 Identities = 35/231 (15%), Positives = 72/231 (31%), Gaps = 27/231 (12%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            +       L + L+  +         ++L       P+  +     +   +A + L    
Sbjct  45   YSEYFPKFLKLSLIAHIPVIITTALMVVLMLIEKAQPRGLSANRITVFVALAIVGLLQIV  104

Query  174  MTGSMFIYICK--------------TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
                    I                  V L     L  +    F    I + L V  G +
Sbjct  105  AYFFAASAISGMTAVIVTQLSAAPLRPVELRTPFALLKKRWKPFLKTSIRVTLRVVIGMI  164

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            LL+IPG++  V +     V+  + +    A+ ++R L S  W  +    +L  +I   +S
Sbjct  165  LLVIPGIIMSVRYALYAPVVLLEGLEKKAAMRRARELASRSWRTVIIITILQFLIPTMVS  224

Query  280  FLTARIPY-------------VGEAANLAFSLLLTPFSFLYYYLIYSDLKA  317
                R                + E  +   ++ + P   +   L+Y  ++ 
Sbjct  225  AFFGRFTVGRKQPGAGVTMRTIYEQFSGLVNIFIMPLMSIVPALLYLKMRQ  275


>HDS28937.1 hypothetical protein [Candidatus Acetothermia bacterium]
Length=175

 Score = 57.9 bits (136),  Expect = 7e-07, Method: Composition-based stats.
 Identities = 34/165 (21%), Positives = 63/165 (38%), Gaps = 5/165 (3%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
            G       L        ++L     ++    +    +    +A +L  L      + + +
Sbjct  10   GFDAFARRLPLLLGVWTVVLIVQQTVSLLVPDQWLWLEALLLALLLPPLHAGQYRVALRV  69

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
             + +   F S   G+R         +L+ ++   G   LI+PG+L  + F F    L D+
Sbjct  70   VRGERCTFSSFVEGIRRWKDALPAYLLIGVLTALGLFALIVPGILVALAFSFTLLCLLDE  129

Query  243  -----NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
                  +  L+A+ +S  L  G+   +FG  +LL V    LS L 
Sbjct  130  EARGRRLSALEAMRESLQLTRGYRGVVFGMGLLLAVPYFLLSLLI  174


>OGG13937.1 hypothetical protein A3D77_03470 [Candidatus Gottesmanbacteria 
bacterium RIFCSPHIGHO2_02_FULL_39_11]
Length=216

 Score = 58.6 bits (138),  Expect = 7e-07, Method: Composition-based stats.
 Identities = 34/188 (18%), Positives = 73/188 (39%), Gaps = 3/188 (2%)

Query  126  LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKT  185
             +   L    I + +    ++WL          + +   A+ +   + +   + +     
Sbjct  26   NIVTALFLGLISAGVSFIFSSWLKDARAGGLSLLYIFRSAFTVSVSAILILYLHLKEQNK  85

Query  186  DVGLFRSMKLGLRHVGSFTLLLILLIL-VVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
               L  ++          T++  L ++ ++ GG +  IIPG+LF +WF    Y +  +N 
Sbjct  86   KADLRDAVYRFFSKYFLVTMVTALALVAIIIGGFIFFIIPGILFAIWFSQTYYFILLENK  145

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFS  304
                AL  S+ L  G   AI   F++  +IS  +S +  ++           ++    F 
Sbjct  146  SPADALRASKRLTEGRRIAISVIFLIQGIISAVISAVVGKLFPDAAV--GITAIFPGTFF  203

Query  305  FLYYYLIY  312
            +   Y++Y
Sbjct  204  WFVDYVLY  211


>WP_150120661.1 zinc-ribbon domain-containing protein, partial [Sulfitobacter 
sp. HI0040]
Length=184

 Score = 58.2 bits (137),  Expect = 7e-07, Method: Composition-based stats.
 Identities = 15/77 (19%), Positives = 25/77 (32%), Gaps = 1/77 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + CP+C A+   P   +PA     +C  C QT      +    +        P    
Sbjct  1   MRLI-CPNCDAQYEVPDEVIPASGRDVQCSNCGQTWFQHHPDHAPDEAETEARQEPDPDE  59

Query  61  QRRIPSDRLEIQSKTVN  77
           +   PS+   +      
Sbjct  60  EVVQPSEAAPMPQPQPQ  76


>HBS52247.1 hypothetical protein [Coxiellaceae bacterium]
Length=197

 Score = 58.2 bits (137),  Expect = 7e-07, Method: Composition-based stats.
 Identities = 29/179 (16%), Positives = 64/179 (36%), Gaps = 1/179 (1%)

Query  134  APIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSM  193
                S    +              +  + T+  ++   + +     +     +V L  S+
Sbjct  1    MAFLSDKQCQFKANEITTANVICVSAHVTTLLLMIYLGALIFYKTHVISEGQNVTLRGSL  60

Query  194  KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKS  253
               L+     T+ L++++     G +  ++PG+   V F   Q ++  DN G   AL+ S
Sbjct  61   LFVLKKYFKITVALMIVLFAGAIGVIAFVLPGVFLSVLFIMVQPLILFDNHGVFSALKGS  120

Query  254  RLLVSGHWWAIFGRFVLLLVISLTLSFLTAR-IPYVGEAANLAFSLLLTPFSFLYYYLI  311
              LV G+WW  F     ++ ++          +        +   +L+  F +   Y+ 
Sbjct  121  CKLVWGNWWRTFVVVYPVIFLNFLTGLGIQFSLTRGYWYVIMGGMMLVLTFFYPLLYVF  179


>WP_144301311.1 hypothetical protein [Desulfovibrio indonesiensis]TVM19850.1 
hypothetical protein DPQ33_01050 [Desulfovibrio indonesiensis]
Length=308

 Score = 59.8 bits (141),  Expect = 7e-07, Method: Composition-based stats.
 Identities = 35/252 (14%), Positives = 81/252 (32%), Gaps = 2/252 (1%)

Query  81   CNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSAL  140
             +   CL            L  +    +  WE            Y+  +++ +     A 
Sbjct  47   MSPYICLYFAAILSGVWPVLELMVMPTSVDWEQVHPYIVRYYYGYVCIVIMWYIIGTFAG  106

Query  141  LLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHV  200
                    N   +          VA         T   +  +      + + +  G+R +
Sbjct  107  GALTFLIANNYLERPVSEKRALRVAVSKWWPLTKTMVCYGPLFLVLAYIQKVVDFGIRSL  166

Query  201  GSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGH  260
                    +   VV G  +LL++    F V +F    ++  +++ G +A ++S +L+ G 
Sbjct  167  IFSFGNNTISYAVVSGVGILLMVLFGFFAVRYFLIFELVILEDLSGRKAFKRSAMLMKGA  226

Query  261  WWAIFGRFVLLLVISLTLSFL--TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKAN  318
            +   F        +S           IP +    N A S+L+  +  +   ++Y   ++ 
Sbjct  227  YGKGFFLLFFHTALSFVAESACKFVSIPLLQLVINFAASILMIIYFVVAGNILYFANRSR  286

Query  319  YRGPQHPPIKRQ  330
            +       + ++
Sbjct  287  HENFDIEILTQR  298


>HAN31510.1 hypothetical protein [Myxococcales bacterium]
Length=170

 Score = 57.9 bits (136),  Expect = 7e-07, Method: Composition-based stats.
 Identities = 17/143 (12%), Positives = 45/143 (31%), Gaps = 0/143 (0%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
              +  ++      +    ++                 G   V    +L +L   ++  G 
Sbjct  11   FFVIKLSIAGPLRAGYDMALLRITKGDQSVEVGDFFAGFHKVVPLAILGLLHGSIITAGM  70

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            L L++PG++  +  + C  +  +  +  + AL+ +  L  GH  A+F   +      +  
Sbjct  71   LCLVVPGVVLALGLWPCYLLAMEQELSPIDALKAAWRLTQGHKLALFWLVIASFGAIVVG  130

Query  279  SFLTARIPYVGEAANLAFSLLLT  301
                    ++         +   
Sbjct  131  LLALVVGVFIAAPVVQLAWINAY  153


>OQB68089.1 hypothetical protein BWX91_01341 [Spirochaetes bacterium ADurb.Bin133]
Length=291

 Score = 59.8 bits (141),  Expect = 8e-07, Method: Composition-based stats.
 Identities = 35/209 (17%), Positives = 66/209 (32%), Gaps = 1/209 (0%)

Query  103  ISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLA  162
              +     + +F    + ++G+  L     +    +                       +
Sbjct  23   NFKNYLKIFAVFYVTLFLIIGLIALKAYFIYNNAKNLEYSLFFNENLYILIILGVIGTFS  82

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
             +   + G    T                S         S      L  LVV  G+LL I
Sbjct  83   AIILSIFGAGVTTDLFSKSFLGEYWDFRSSFLYVKNKFWSIFASSFLSALVVLCGALLFI  142

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            I      V        L ++++ G  ++++S  L    +W++FG F+L  + + TL F+ 
Sbjct  143  IGMYPAAVLVSCVIPALINEDLTGSGSIKRSFELTKNKFWSLFGSFILFSLFTSTLGFVY  202

Query  283  ARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
                 +   A  A  L    FS  +  +I
Sbjct  203  QFFVSIFTFAAAAP-LKFIAFSGTHISII  230


>OGO58413.1 hypothetical protein A2V85_06000, partial [Chloroflexi bacterium 
RBG_16_72_14]
Length=290

 Score = 59.8 bits (141),  Expect = 8e-07, Method: Composition-based stats.
 Identities = 32/228 (14%), Positives = 70/228 (31%), Gaps = 32/228 (14%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG------  215
                + LL +  +T ++         G+  +   G+R +  +  + IL  LV+       
Sbjct  63   VGTLFSLLSVLAVTAAVDELWQGRAAGVGDAFARGIRALPRYLGVAILFGLVIFVLVAIP  122

Query  216  --------------------GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
                                 G+L+L+   L           V+  D  G   ++ ++  
Sbjct  123  VALILAAPATASGGLVVLFGLGALVLLPVALWVGARLTLLVPVIVLDPAGVTGSIRRAWE  182

Query  256  LVSGHWWAIFGRFVLLLVI----SLTLSFLTARIP--YVGEAANLAFSLLLTPFSFLYYY  309
            L  GH   +F   + + +I        S   A +P   +G  A    +L+  P S +   
Sbjct  183  LSRGHALMLFALSIAIGLIGALPLWGASLFAAFVPNAIIGGIALAISTLVYQPLSLIAIT  242

Query  310  LIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNL  357
            + + D             + +     A +   +     ++ +      
Sbjct  243  IAWGDRIGGRHADSEAMARGRGRGTGALLVFGLSAILFVIGAGVAAQY  290


>WP_081629009.1 PQQ-binding-like beta-propeller repeat protein [Smaragdicoccus 
niigatensis]
Length=1254

 Score = 61.3 bits (145),  Expect = 8e-07, Method: Composition-based stats.
 Identities = 34/379 (9%), Positives = 92/379 (24%), Gaps = 38/379 (10%)

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS---------  218
            +L        +   +      L    +           + ++  +V              
Sbjct  285  VLINGATIAGVEDALHNRRARLGEVFRTARSRFFPLLRMHLVFYVVAFVPDAVIAWVLLR  344

Query  219  -----------LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGR  267
                       +++        V        L  +  G + +L++S  L    +W +   
Sbjct  345  GFGPDGMYVSAIIMTPLMFALGVLLGLSPLALVIEGRGVVDSLKRSVELTRRAFWRVVAL  404

Query  268  FVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY--------  319
             +L + +     +LT+    +  A+  +FSLLL P    Y+ +++   +A          
Sbjct  405  HLLWVALVFGGLWLTSVPIVIVAASLPSFSLLLFPVVMAYFAVVFPLFRAMQAALFSDLS  464

Query  320  ------RGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRL  373
                        P + +           + +  +               +     +    
Sbjct  465  VREGGAPPTASIPAEARSARFDLRPHHVVRVLAVCTAIPVIFGPDRLPWIVLLGVLVGAD  524

Query  374  GTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQN  433
                      + +       L  A +    +   +  ++      P T           +
Sbjct  525  FLLRATDRQWSGNTALLLSDLGFARFDTSAATAPEEVAQPVSPWEPATASERELEPSVLD  584

Query  434  PHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDE  493
                  +     P+     +   R+         A      +   +   +  +    +  
Sbjct  585  SAPEAAVVDHTLPDEDTKPRNKPRVFAATENVPPA----PARDPVQTDPWGSMPPITSPP  640

Query  494  NDLFSGIRSIYLRQGTQAE  512
              + S  RS      T+ +
Sbjct  641  GAVGSDQRSWTAPAATREQ  659


>WP_193642973.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Dermacoccus barathri]MBE7370871.1 glycerophosphoryl 
diester phosphodiesterase membrane domain-containing 
protein [Dermacoccus barathri]
Length=448

 Score = 60.6 bits (143),  Expect = 8e-07, Method: Composition-based stats.
 Identities = 23/164 (14%), Positives = 54/164 (33%), Gaps = 26/164 (16%)

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS--  218
                   +   + +       +      +  S+++G+R + S   + +L++L++      
Sbjct  176  PIAGLAAMASAAGLVHVFAQAVMGRRATVSASLRVGVRRMWSMLGVGLLMVLIMVAVMVP  235

Query  219  ------------------------LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSR  254
                                    +  I   +   +   F  +++  +  G   AL +S 
Sbjct  236  LILAIVLAASQDNGGPIFLTVLVGIATICAVVWVGIRLCFAGHIVVMEKAGPTTALRRSW  295

Query  255  LLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
             L  G +W   G  +L  +I   +S++   I  +     L  SL
Sbjct  296  QLTKGRFWRTLGITILAQLIVSAISWVVQFIVSLIMLIFLGVSL  339


>HBF66094.1 hypothetical protein [Clostridium sp.]
Length=224

 Score = 58.6 bits (138),  Expect = 8e-07, Method: Composition-based stats.
 Identities = 21/131 (16%), Positives = 50/131 (38%), Gaps = 0/131 (0%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +    +G+  +       IC   + +   +   +  + S  +  +    ++    +L 
Sbjct  92   MYLFLEPIGVISIAKIAKKSICGDPIHMQEIIGESMNCLWSVIITAVPYFFLIFIAGMLF  151

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            IIPG+   + + F  Y +      G +ALE S  L  G +W      + + +I+L   ++
Sbjct  152  IIPGVYLGIIWTFYVYAIGLRQKRGWKALEYSMKLTKGRFWQTVLVILTIFLIALGWDWI  211

Query  282  TARIPYVGEAA  292
               +  + +  
Sbjct  212  FGTVXXLLQVL  222


>MBA2760881.1 hypothetical protein [Segetibacter sp.]
Length=250

 Score = 59.0 bits (139),  Expect = 8e-07, Method: Composition-based stats.
 Identities = 23/157 (15%), Positives = 50/157 (32%), Gaps = 3/157 (2%)

Query  145  ATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT  204
            + +       +    L       +  +      ++  + K    +        ++     
Sbjct  31   SPFWMFNGTYFVVVALAWINFIAMSAVISCYMKLYDRLQKQAPTIQEVWDEFKKYFLKVL  90

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
            L  + + L+   G    +IPG+   V F      +  ++     A  +   L+  ++W  
Sbjct  91   LYSVPIALLTAAGFAFCLIPGIYLAVVFVPFSIAVIVEDETFGGAWNRCFALIKNNFWNS  150

Query  265  FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
            FG ++L+ +I    SF    I  V        S   T
Sbjct  151  FGTYLLVYIIY---SFSAGIISGVFALLTGLASYFTT  184


>WP_013314060.1 hypothetical protein [Spirochaeta thermophila]ADN02219.1 hypothetical 
protein STHERM_c12780 [Spirochaeta thermophila DSM 
6192]
Length=341

 Score = 60.2 bits (142),  Expect = 8e-07, Method: Composition-based stats.
 Identities = 29/179 (16%), Positives = 68/179 (38%), Gaps = 3/179 (2%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            L ++ L ++L        +       ++    N   A+     A   L L W+  +  + 
Sbjct  85   LWVWELVLMLVQWVGTLVVGFLAWDHVHRGEGNLPGALEGVVRALGRLLLQWLVVAGLVL  144

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            +    +G+  ++   L  +    +  +L+ +++    +   +  + F V   F  Y +  
Sbjct  145  LSLIGLGVVGAVGGFLASLLDGRVGSVLITVLIVLLVVAWYVFLVWFLVAVSFAAYAVIF  204

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI---PYVGEAANLAFS  297
            ++ G    + +S  LVSG WW +FG  +L  ++   +  L + +   P      +    
Sbjct  205  EDRGAWAGIRRSLELVSGSWWRVFGYLLLFWLLVGFMGLLLSFLVQSPLSLSLFSRILL  263


>WP_115373442.1 stage II sporulation protein M [Adhaeribacter pallidiroseus]RDC64264.1 
hypothetical protein AHMF7616_02877 [Adhaeribacter 
pallidiroseus]
Length=610

 Score = 60.9 bits (144),  Expect = 8e-07, Method: Composition-based stats.
 Identities = 27/145 (19%), Positives = 64/145 (44%), Gaps = 0/145 (0%)

Query  151  QNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL  210
             + N+   +  A V+++LL L   +  +     +  V          ++          +
Sbjct  397  SSFNYLIGVFCAYVSFLLLSLVLYSYILEYMDNQGQVLPGTVWYRVKQNFMRVFFSSFGI  456

Query  211  ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
             L+   GS+LLI+PG+   V   F   V+  +++G ++ +E+   L+ G+WW+  G  ++
Sbjct  457  FLLWALGSVLLIVPGIYLAVALSFYLMVMLREDLGFVETVERCLYLIKGNWWSTLGMLLI  516

Query  271  LLVISLTLSFLTARIPYVGEAANLA  295
            + +I   ++ +     +V +   + 
Sbjct  517  ISLIQALMALVLGFPVWVLQIMQVL  541


>WP_110520752.1 hypothetical protein [Bacillus lacisalsi]PYZ96813.1 hypothetical 
protein CR205_14125 [Bacillus lacisalsi]
Length=231

 Score = 58.6 bits (138),  Expect = 8e-07, Method: Composition-based stats.
 Identities = 37/211 (18%), Positives = 74/211 (35%), Gaps = 2/211 (1%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
              +      +L   L  ++L +A             +    +     I       +   L
Sbjct  19   RSYWGSVLKILPAVLPPVLLIYALFVVGYNSFLTPGMGETAEATVLTISSLLFLSLNGFL  78

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
                             L    +    H G      IL  +++  G+++ I+P L+  + 
Sbjct  79   FAFIYGADKSNSGFAGKLKEGARTVGAHFGPLLSTSILFFVLIVAGTIMFIVPALIMIIL  138

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE-  290
            F+   +V+ ++N    +AL+ S  LV   W  + G  +L  V  L  + + + +      
Sbjct  139  FYLYPFVVIEENKKNFEALKGSAALVKKGWLRVMGWIILFYVSFLFAATVVSELAPAYGE  198

Query  291  -AANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
             A  +    ++ PF  L +Y +Y  LKA  +
Sbjct  199  QAGEIIGGAVVLPFEALLFYFVYKKLKAKQQ  229


>MBD3366060.1 hypothetical protein [candidate division WWE3 bacterium]
Length=353

 Score = 60.2 bits (142),  Expect = 8e-07, Method: Composition-based stats.
 Identities = 33/143 (23%), Positives = 67/143 (47%), Gaps = 0/143 (0%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              + +I    ++MT        +  V +   +    + +  F LL I+   ++G G +LL
Sbjct  158  LGLGWIFFMATFMTLLPIKIHSQNLVPIKELIGEAFKKLFKFALLTIISAFIIGIGYILL  217

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            I+PG++F +WF F  Y+L  +++G  +AL+KS+ +  G    +FG+ +   V+ +     
Sbjct  218  IVPGVIFTLWFLFAPYILLTEDVGVFEALKKSKQMTKGFRGNLFGKGLGYTVLMMLAMIP  277

Query  282  TARIPYVGEAANLAFSLLLTPFS  304
               + Y  +        + TP +
Sbjct  278  LLFVMYATQGLLTPIISVFTPLA  300


>WP_152891139.1 hypothetical protein [Clostridium tarantellae]MPQ44560.1 hypothetical 
protein [Clostridium tarantellae]
Length=276

 Score = 59.4 bits (140),  Expect = 9e-07, Method: Composition-based stats.
 Identities = 32/208 (15%), Positives = 75/208 (36%), Gaps = 26/208 (13%)

Query  136  IFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL  195
            + S          +         I +      +LG   +   +   +       +  +K 
Sbjct  60   VLSNFSNNIKYLFSNSLIFLWIPISILLCLIPILGNFAIVKLINSKLKDEKTIWYECIKY  119

Query  196  GLRH----------------VGSFTLLLILLILVVG-------GGSLLLIIPGLLFCVWF  232
                                + +FT++L + I+ +           + ++    +  ++ 
Sbjct  120  AFSKTGSALLLILITLVISTIWTFTIVLFITIVCIFTFFRGLPIAIITILALTAILFIFL  179

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF---LTARIPYVG  289
             F    + ++++G L A++ S  LV   +W  FG+  LL ++    +    + A IP+VG
Sbjct  180  QFIIQSIVNEDLGVLAAIKHSVNLVKIRFWNAFGKIALLKLVCFLFNAGLTMFAIIPFVG  239

Query  290  EAANLAFSLLLTPFSFLYYYLIYSDLKA  317
               +   +L LT F  +   +I++D + 
Sbjct  240  IILSTLITLCLTIFQHIAVTIIFNDYEQ  267


>HDH01909.1 hypothetical protein [Nitrospirae bacterium]
Length=180

 Score = 57.9 bits (136),  Expect = 9e-07, Method: Composition-based stats.
 Identities = 21/110 (19%), Positives = 40/110 (36%), Gaps = 0/110 (0%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                +       +   M     K    L          + S      +L +++  G +L 
Sbjct  71   VGWIFYSFAQGMVASMMVEVEDKGQTSLGSGFGRANEMIVSLMAAGFILGVLLLLGFMLF  130

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            +IPGL+    F F   V+  +  G + A+++S  +V  +    F   +LL
Sbjct  131  VIPGLIVAFIFSFTFIVIMLEKRGPVDAMKRSVQIVRANASYTFKLLLLL  180


>NLE89106.1 FHA domain-containing protein [Myxococcales bacterium]
Length=459

 Score = 60.6 bits (143),  Expect = 9e-07, Method: Composition-based stats.
 Identities = 35/190 (18%), Positives = 65/190 (34%), Gaps = 14/190 (7%)

Query  145  ATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT  204
              W+          + L  +A   +    M            +   ++    L+   S  
Sbjct  258  TGWIPIVGWILGVLVALVELAMAPIAAGAMWRWSLAAASGERLTWKQAWGAALKSPVSEW  317

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
            L + ++  V+  GS+ L+IPGLL      F       +      A  +S  LV       
Sbjct  318  LNIAVMSFVIAVGSMFLLIPGLLVG---MFAGPAYLLEKKTFFGANMRSLELVLKDPGRH  374

Query  265  FGRFVL-------LLVISLTLSFLTARIPYVGEAANLAFSLL----LTPFSFLYYYLIYS  313
             G  +L       +++++   + +   +P++G AA    S+     L PF  L +  IY 
Sbjct  375  LGLALLVLVTVIPVMIVTTIATVILGFVPFIGAAAAQLISVTVLTLLVPFVSLLWASIYF  434

Query  314  DLKANYRGPQ  323
            D +       
Sbjct  435  DARQRAERED  444


>HIP40010.1 hypothetical protein [Desulfocapsa sulfexigens]
Length=541

 Score = 60.6 bits (143),  Expect = 9e-07, Method: Composition-based stats.
 Identities = 46/302 (15%), Positives = 86/302 (28%), Gaps = 13/302 (4%)

Query  5    RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRI  64
             C  CG + +     L    S   C  C   ++    + +        A           
Sbjct  206  VCSVCGDKFHPD--FLQEIDSKLYCGICQPEVVETVLDGEVIAAGTGTAVAATAAAVATT  263

Query  65   PSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGI  124
             S+                                 +      A  W         + GI
Sbjct  264  DSEDDSEDVAVEQVDEEEVPETFTDFTVGELIKEAWQKTKGTKASIWGGVLFMYLVIFGI  323

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
                          AL              +   + + T     + ++ +      +  +
Sbjct  324  --------SFAGVFALQGMVGQMAPNSAIGFNVGLQVVTSWLSSMFMAGLMLIGVRHAKE  375

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
              +     +  G     S T+ L L +++V  G +LL++PG+   V +     ++ +  +
Sbjct  376  QRISWKM-VFAGFSRALSITIALTLQLILVVIGFVLLVLPGIYLSVGYALALPLILEKGL  434

Query  245  GGLQALEKSRLLVSGHWWAIFG--RFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
            G  +ALE SR  +   WW +FG    +LLL +   +      I  V     L   L +  
Sbjct  435  GPWEALETSRKAIHKKWWTVFGLYLVMLLLYMVSIVPLGLGLIWTVPMLFVLVGVLYVRF  494

Query  303  FS  304
            F 
Sbjct  495  FG  496


>PYK18012.1 hypothetical protein DME55_08090 [Verrucomicrobia bacterium]
Length=197

 Score = 57.9 bits (136),  Expect = 9e-07, Method: Composition-based stats.
 Identities = 28/189 (15%), Positives = 62/189 (33%), Gaps = 0/189 (0%)

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
            +   +    L     G+ +    +     +  A  +  ++      I+L     +  G+ 
Sbjct  1    MSHFQIGWRLFKARAGVFVVSMILLFLSGIVVALLVLYRSGFAIGLIILLAWLLLCSGMI  60

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
                 M +      V         L    ++ L L      V  G  LLI+PG+   + +
Sbjct  61   VGLRVMALKSVDDRVPRVDDSFGSLALGPAYLLALTFYCAAVSLGFALLIVPGIYLAIRY  120

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
                 ++ D + G L AL  +  L  GH+  +   F++  +++   + +      +    
Sbjct  121  CLFAQIITDKSAGPLAALRDAAGLARGHYAQLGALFLIAFLLNAAGAAILGLGLVISFPV  180

Query  293  NLAFSLLLT  301
            +L       
Sbjct  181  SLLAIAGFY  189


>WP_133753375.1 hypothetical protein [Naumannella halotolerans]TDT32778.1 hypothetical 
protein CLV29_0366 [Naumannella halotolerans]
Length=417

 Score = 60.2 bits (142),  Expect = 9e-07, Method: Composition-based stats.
 Identities = 33/278 (12%), Positives = 64/278 (23%), Gaps = 23/278 (8%)

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
              +  S+ + +    V +                   L +L +    + +++   L    
Sbjct  138  GVIGRSLGVILLFGGVVMLLLALTVWLFGPRLFGAEQLPVLPLVAAGIAVVLSWTLISAR  197

Query  232  FFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT---LSFLTARIP-  286
             +  + V A +    GL A+ ++  L +G +        L   +      ++ L   +P 
Sbjct  198  LYPLRPVFALEPQHRGLAAVGRAWRLTAGRFLRTLSSVALFGSMVWLARTIALLPLALPS  257

Query  287  ------------------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIK  328
                               V  A +   ++  T F      L + DL A           
Sbjct  258  ATSGDLSTRLVGQLWPWTLVTVALSGIVAVCSTAFLITAQTLYHRDLVARQPVAPTLNWA  317

Query  329  RQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLP  388
                     + G    PG L  S   Q  +     +        +         L     
Sbjct  318  PPTTAPAGGLGGGGWQPGALSGSAPAQWPNPSGWSAGQAAPPDSVVAPQHWPAPLAPQQW  377

Query  389  EEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADR  426
              P   S                 G             
Sbjct  378  PAPGVPSEGATGQQQHPAGSQDWPGPQGQQQWPAPDQP  415


>OGY28135.1 hypothetical protein A2802_01820 [Candidatus Woykebacteria bacterium 
RIFCSPHIGHO2_01_FULL_43_29]OGY29514.1 hypothetical protein 
A3F33_01620 [Candidatus Woykebacteria bacterium RIFCSPHIGHO2_12_FULL_43_10]OGY29613.1 
hypothetical protein A3J50_00160 
[Candidatus Woykebacteria bacterium RIFCSPHIGHO2_02_FULL_43_16b]OGY31628.1 
hypothetical protein A3A61_00350 [Candidatus 
Woykebacteria bacterium RIFCSPLOWO2_01_FULL_43_14]
Length=339

 Score = 59.8 bits (141),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 56/278 (20%), Positives = 99/278 (36%), Gaps = 7/278 (3%)

Query  135  PIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMK  194
             + + ++                 ++L  +  I          + I   K    L   +K
Sbjct  63   LMSAGVIQDSTALPIMVTVTCITFLVLLALLAIFNIAIGNAQILVIDAHKDPSSLTTLIK  122

Query  195  LGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSR  254
             G R      L  +L  L++ GG  + I+P  LF   F F  Y +  D  G + AL +S 
Sbjct  123  GGYRVAVPMLLTGLLGGLLIIGGFFVFIVPAWLFMFLFSFSAYAVVLDGYGPIPALRRSI  182

Query  255  LLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
             LVS H+W++F R VLL VI   +S+L    P V    +  F+LL+   SF+   ++   
Sbjct  183  ALVSSHFWSVFTRIVLLYVIYFFVSYLF---PKVLGLVSGDFNLLVQGLSFVADVVLGWY  239

Query  315  LKAN----YRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQ  370
            + A     Y+  +      Q       +   +    ++L+S+         +     +  
Sbjct  240  MLAYTLTLYKQIRATSGNEQGSLKWLVLTALIGWGLIVLLSIGMFRFVQSDVFKKYSEDF  299

Query  371  QRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRK  408
                 +  Q  D N   PE     +    +       +
Sbjct  300  TESIRKNIQQEDKNFQAPELDFYNTEIPSETPSFPTSE  337


>WP_196492246.1 zinc-ribbon domain-containing protein, partial [Erythrobacter 
donghaensis]
Length=40

 Score = 53.6 bits (125),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 9/38 (24%), Positives = 17/38 (45%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  + CP CG     P + + ++  + RC +C  +   
Sbjct  1   MI-IACPACGTRYAVPDAAIGSEGRTVRCAKCKHSWHQ  37


>EGT3615446.1 hypothetical protein [Clostridium perfringens]
Length=210

 Score = 58.2 bits (137),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 20/139 (14%), Positives = 58/139 (42%), Gaps = 0/139 (0%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
             ++  +      +   +  +I    +    ++K   ++      + ++    +  GS + 
Sbjct  52   ISIFSLGFSSIAIIKLVNNFINGKKISWIETIKSSFKNSIFPLGVFMIQNFAISLGSSIF  111

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
               G++  ++F         +NI  +++++KS  LV  ++  I  +   L+++    + +
Sbjct  112  APAGIILSIFFIIAMQCSIFENISVIESIKKSFSLVKNNFLDILLKQFSLVILINLATTM  171

Query  282  TARIPYVGEAANLAFSLLL  300
             A      +A+ + FSL+L
Sbjct  172  FAMFLNQSQASIIIFSLVL  190


>KPK43062.1 hypothetical protein AMK72_13880 [Planctomycetes bacterium SM23_25]
Length=408

 Score = 60.2 bits (142),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 34/326 (10%), Positives = 77/326 (24%), Gaps = 22/326 (7%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  V C  C     T  +         +C    +      A +                 
Sbjct  1    MIHVICLKCQTRFTTSDTNAGKTGRCPKCNSPLRVPPLSRAAAAAPPQPVQAGPPAAAAP  60

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
               +   ++                 L            L     ++   +         
Sbjct  61   AGGMGMGQVFRWIGEGIACGLRHVVPLTLGLLIMLILVSLAWTLCVVPGVFLE------P  114

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
             L    + I+L  A   S LL +     +         I+             ++  + +
Sbjct  115  PLVAGFVLIMLGAARGESGLLGRLFAGFSEGRFWPSMGIVWLLGLIFAGVSIPISLLVLV  174

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                                       ++ +LV+     ++  P +       +   ++ 
Sbjct  175  VAA--------------SSAMLLVSEGVVGLLVLAVAIPVIYSPMVYLASRLGWAMPLVV  220

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
            D     +++L  S  L  G     FG   + +++   +S +   +   G  A L    L 
Sbjct  221  DGRARVVESLTVSWRLT-GKVARGFGL-FVQMLVLTVISLIVQALMVGGVLAILGLGGLA  278

Query  301  TPFSFLYYYLIYSDLKANYRGPQHPP  326
               +    ++ Y +           P
Sbjct  279  AVSAASPKFVAYQEALTAVENMAAMP  304


>HCO84956.1 hypothetical protein [Arenibacter sp.]
Length=113

 Score = 55.9 bits (131),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 29/107 (27%), Positives = 53/107 (50%), Gaps = 2/107 (2%)

Query  198  RHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLV  257
            ++  +  L  +L   ++G G +LLI+PG++      F  Y++ D N+  + A+EKS  + 
Sbjct  1    KNYINIILANLLTFAIIGLGFVLLIVPGIILACRLAFVSYLVMDKNMEPVTAIEKSWEMT  60

Query  258  SGHWWAIFGRFVLL--LVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
             GH W IFG  +L+  +VI   L F+   +  +   +    S+    
Sbjct  61   KGHGWQIFGMGLLIIPIVIGGLLCFIVGIVFSIIWISAAFASMYHAI  107


>OFT38584.1 hypothetical protein HMPREF3163_05675 [Actinomyces sp. HMSC08A01]PLB80975.1 
hypothetical protein CYJ21_01970 [Actinomyces 
sp. UMB0138]PMC93619.1 hypothetical protein CJ188_07635 [Actinomyces 
sp. UMB0918]
Length=296

 Score = 59.4 bits (140),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 26/225 (12%), Positives = 65/225 (29%), Gaps = 24/225 (11%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             I  L I +    +   ++      +  Q  +                 + +   + + +
Sbjct  60   LISYLPIFIGQILLIPLMVFVVMKAVEGQKVSIGQTWKAVGRKIWRYLGASIILWLVMML  119

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                +                     L IL +    + +++   +  V F F    +  +
Sbjct  120  ILVVITATIGGIFYASGAFETIGSGGLAILFLLILGIGILLAISIVAVRFSFYGQAIVIE  179

Query  243  NIGGLQALEKSRLLVSGHWWAIFGR-FVLLLVISLTLSFL--------------------  281
              G   +L++S  L+SG++    G  ++   +I +  + +                    
Sbjct  180  GAGAFASLKRSWQLMSGYFLRCLGITWLGQFIIGIIAASITTPISAVSALIAVAIGSGNA  239

Query  282  ---TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                  I  +    +L  S++  P       L+Y D++    G  
Sbjct  240  ASQMTIILVLSVITSLLASIITLPLQTSLMSLLYVDVRFRKEGLD  284


>WP_144831788.1 MULTISPECIES: glycerophosphoryl diester phosphodiesterase membrane 
domain-containing protein [Microbacterium]
Length=291

 Score = 59.4 bits (140),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 36/262 (14%), Positives = 67/262 (26%), Gaps = 33/262 (13%)

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL------  272
            ++L+   L      F    V+  +  G  +A+ +S  L  G +W+  G  +++       
Sbjct  1    MVLVAGYLWLAPKLFLVPSVIILERAGVFRAVARSWRLTRGRFWSTLGVLLIISIAFGIV  60

Query  273  --VISLTLSFLTARIP---------------------YVGEAANLAFSLLLTPFSFLYYY  309
              +IS+ L  ++  IP                      +G+   L    +          
Sbjct  61   AQIISIPLGLVSGLIPVLVDPTGSSGVEAFVGYLVVQLIGQLGILLVQCIALVVQSTSAA  120

Query  310  LIYSDLKANYRGPQH----PPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSA  365
            L+Y D +       H       +R     +      + +  ++         SA      
Sbjct  121  LVYVDARMRVEALDHDLQTYVEQRDAGATSLPDPYLVGVGRVVERPAPAPVGSAPAYAGW  180

Query  366  GKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFAD  425
            G          PQQ           P    +  Y           ++G            
Sbjct  181  GAPAYGPGAGAPQQGYGAPAQGYGAPYGAPAQGYGAPAQPGYGAPAQGPSGPPVPPAPGT  240

Query  426  RFWADDQNPHLWLKLELSDFPN  447
               A       W      D P 
Sbjct  241  PAAAPAPRDPSWAPPSTGDSPR  262


>HCA46319.1 hypothetical protein [Armatimonadetes bacterium]
Length=311

 Score = 59.4 bits (140),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 17/142 (12%), Positives = 40/142 (28%), Gaps = 9/142 (6%)

Query  154  NWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILV  213
             +   +       I L +  +     + +    V L  S                     
Sbjct  125  WFPLMLTTILYGLISLIVVAVPLIPVVVLAGGAVFLGDSSAPEFGIAAIIAG--------  176

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
               G L+     +       F    +  ++     AL +S  L  GH+  +     ++ +
Sbjct  177  -LLGMLVAAGLSIWMTFKLIFAPLAVVLEDQSPADALMRSWRLTDGHFIRVAATIFVISL  235

Query  274  ISLTLSFLTARIPYVGEAANLA  295
            ++  L+++ A    +  A    
Sbjct  236  LAGVLTYIVAIPVQLAGALLQL  257


>NMB70371.1 hypothetical protein [candidate division WWE3 bacterium]
Length=157

 Score = 57.1 bits (134),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 26/139 (19%), Positives = 53/139 (38%), Gaps = 1/139 (1%)

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                T   L   + L  + V    + +++   ++  G LL IIPG+L  + + F +  L 
Sbjct  2    VANGTTEPLKPVLILSFKRVWPMFVHMVVKGFILLIGFLLFIIPGVLLSIRYMFSELSLL  61

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF-LTARIPYVGEAANLAFSLL  299
            ++N+G + AL +S  L  G+   ++ + +    I   L   +       G        L 
Sbjct  62   NENLGPIAALGRSSKLTKGYRLNLYLKALGFTGIQFLLLVPILVFATIFGNNPMGPIFLE  121

Query  300  LTPFSFLYYYLIYSDLKAN  318
            +  F     ++ +      
Sbjct  122  IYAFVSQLVFVYFYLDLLR  140


>MAK60876.1 hypothetical protein [Ponticaulis sp.]
Length=267

 Score = 59.0 bits (139),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 28/165 (17%), Positives = 59/165 (36%), Gaps = 0/165 (0%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +L ++ L   +    +   +  +    L            L     I    + +T  +++
Sbjct  39   VLLVFGLLPSIFMDFVVLRIEHELIYALTDYGALSYTLYSLVYNTLIAPAFAAVTMIIYM  98

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +          +   L  +    ++ ++   +   GS  L+ PGL+F     F   + A
Sbjct  99   IVRDKTAHTPSILIQTLLILPGLLIVSLIQQALTSIGSFFLVFPGLIFMTILAFAVPIFA  158

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
                G + AL +S  +  G+ W IFG  VL ++I    +     +
Sbjct  159  TRRSGIIDALSRSTAMTKGNRWKIFGLIVLTILIGALAAAGYWLV  203


>RME15377.1 hypothetical protein D6801_07530, partial [Alphaproteobacteria 
bacterium]
Length=79

 Score = 54.4 bits (127),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 10/38 (26%), Positives = 14/38 (37%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  V CP+CGA+       +P      +C  C      
Sbjct  1   MRLV-CPNCGAQYEVDDRVIPESGRDVQCSACGHAWFQ  37


>NLT64447.1 zinc-ribbon domain-containing protein [Clostridiales bacterium]
Length=326

 Score = 59.4 bits (140),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 38/297 (13%), Positives = 87/297 (29%), Gaps = 20/297 (7%)

Query  6    CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIP  65
            C  CGA             ++  C  C   +  D  +      +D  +  P        P
Sbjct  4    CQKCGAALE---------NTAKFCHICGAPVSSDEQQVSEKHLSDYNSEQPRQPSCSMEP  54

Query  66   SDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIY  125
                    +  N                + +       S    ++   F      +    
Sbjct  55   DVPPPFPIEPQNQHCHQNHQYYAYANLAKRNMGYPLPFSSGAENAKITFYPTIKDIFSKS  114

Query  126  LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKT  185
               I+     ++   LL     L          I +  +  + +G++ +    +    + 
Sbjct  115  FAFIIKKPILLWGLSLLYILLSLLAIVLAVLPIISIPIILVLSVGMTSVFLESY----RG  170

Query  186  DVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL---LIIPGLLFCVWFFFCQYVLADD  242
            +      +  G ++   F   +  ++L +    L+    I+  ++    + F  Y+L   
Sbjct  171  NEISSNQLFAGFKNFFHFCGGMAWMLLWIFIWGLIPAAGIVFAVIKAYSYRFVPYILLAQ  230

Query  243  -NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS---LTLSFLTARIPYVGEAANLA  295
             +I    AL +S     G+   +F   ++++      L   +L + IP +G      
Sbjct  231  PDISPFDALRESMKQTKGYCGRMFCADIIIVACIAGALLFFYLFSYIPLMGIIFKTI  287


>WP_196103494.1 hypothetical protein [Pontivivens sp. MT2928]QPH54285.1 hypothetical 
protein I0K15_00455 [Pontivivens sp. MT2928]
Length=220

 Score = 58.2 bits (137),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 24/123 (20%), Positives = 45/123 (37%), Gaps = 1/123 (1%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +    +    +  S +             +    R       + + ++L++  G +LL
Sbjct  54   IGLFLSAVINGLLCLSAWDAREGAAPDPMEDLNTAFRMAFPLLGVTLAVVLMIAFGFVLL  113

Query  222  IIPGLLFCVWFFFCQYVLADDN-IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
            IIPGL     F      L  +  I  + A+ +S  L  G+ W  FG  +++  + L L  
Sbjct  114  IIPGLFAMTVFSVSVPALLLERPISMMDAMIRSTELTRGYRWQAFGGLIVIAALGLFLMI  173

Query  281  LTA  283
              A
Sbjct  174  ALA  176


>WP_185800092.1 hypothetical protein [Parasphingopyxis sp. GrpM-11]MBC2776842.1 
hypothetical protein [Parasphingopyxis sp. GrpM-11]
Length=297

 Score = 59.0 bits (139),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 19/153 (12%), Positives = 51/153 (33%), Gaps = 2/153 (1%)

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
            +     ++    + +         F      +    +       + ++V    L ++   
Sbjct  114  FTHFFPAFGIQLLVVLALGAIYVGFLFFVGVISVAAASGGSAASMTVIVLLSFLAVVAVL  173

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL--LLVISLTLSFLTA  283
            + F   +         +  G  ++L +S  L  G+ W IF  +++  +++++L ++    
Sbjct  174  MFFATGWAVPLVARLVEGTGVFRSLGRSWFLAKGNRWRIFVIYIVEFVILLALIIAMAAV  233

Query  284  RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLK  316
             IP  G  A+  F+       F         + 
Sbjct  234  MIPLFGSTASGGFTATWLFVLFPLQLAFSCLIW  266


>MXW30800.1 hypothetical protein [Chloroflexi bacterium]
Length=217

 Score = 57.9 bits (136),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 28/192 (15%), Positives = 58/192 (30%), Gaps = 6/192 (3%)

Query  132  AFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFR  191
              A +  +     A  +  Q          A V  + +                 V L  
Sbjct  14   NLAGLLLSPSGFVAALVWSQIIAALITGFQAPVWLLAVLARRGDPLTSAAALYGVVTLLP  73

Query  192  SMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALE  251
               +    +G    +L+L+   +   +L  +   + F V F      +  +    LQAL 
Sbjct  74   RFFIVGLVLGVGGGMLLLISYYLPALALPALPVLIYFAVRFSLSGPAIVLERRTPLQALV  133

Query  252  KSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI------PYVGEAANLAFSLLLTPFSF  305
            +S  +V G+WW  F   + +L+ ++ L   +           +          +  P   
Sbjct  134  RSWKVVEGNWWRTFLVQLPVLLFAIILVAASGAASSAVENALLSAVVGAVALGVSAPLVA  193

Query  306  LYYYLIYSDLKA  317
            L    ++ +   
Sbjct  194  LVETALFEEYSG  205


>HIE43473.1 hypothetical protein [Candidatus Omnitrophica bacterium]
Length=245

 Score = 58.6 bits (138),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 28/201 (14%), Positives = 67/201 (33%), Gaps = 0/201 (0%)

Query  110  SWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
                F      +   Y  G+++          L   T  +        A+ + +    +L
Sbjct  15   YRTNFKFLISIVFLSYFWGLIIRDLQGELKPYLLSITHFSSAGMALYGALHVISPFLTIL  74

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
            G   +  ++       +  +  +    +R      ++ I+   ++ GG +  I PG++F 
Sbjct  75   GYIAIVVALKGLREGGETTINSTYSEAIRLFLPVLVVFIIEAFIILGGFIFFIAPGIIFS  134

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG  289
            +W+ F  Y          +AL  S+ +V      + G  +++ +     S L   +   G
Sbjct  135  IWYLFSLYAAVVHRKRKAEALALSKAVVRTSIGRVIGYVLVIEIFGSLPSILFYFLVQKG  194

Query  290  EAANLAFSLLLTPFSFLYYYL  310
              ++  +         +   L
Sbjct  195  VFSSSTYLFWTIIHGIINSIL  215


>PYU97057.1 hypothetical protein DMG25_00030 [Acidobacteria bacterium]
Length=495

 Score = 60.2 bits (142),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 44/352 (13%), Positives = 87/352 (25%), Gaps = 63/352 (18%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            +  +    W  +GI  +  +L  A       L          +    A  L      LL 
Sbjct  18   FSYYRSHFWVFVGIMAIPQILLVAVALLTDALAQPVGPVDVERLSAQAQWLVLGGAGLLI  77

Query  171  LSWMTGSMFIYICK------------TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
            ++      +                    G+  + +     +G    ++  + + V GG 
Sbjct  78   VAVAYAFAYTAALGATTYAISEIHLGRPAGIRSAYRAMRGRIGRLFNVMGSVGIRVLGGF  137

Query  219  LLLI--------------------------------IPGLLFCVWFFFCQYVLADDNIGG  246
            +LL                                    +L  + +      L  +N+G 
Sbjct  138  ILLSFAAGLLGAALRVATGSRTIVGLLVLSGLLVAACLAVLLMLRYAVAVPALLLENLGA  197

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE----------------  290
             QA+ +S  L  G+   IF   VL++++S   +F+      +                  
Sbjct  198  RQAMSRSVQLTKGYRSRIFMIGVLMVLVSAAAAFILRAPFSIAVVVEAARNHRATPWLAY  257

Query  291  ---AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGL  347
                A      L  P   +   L Y D++    G     +        A       +   
Sbjct  258  TADLAAGVSGALSGPLLMIGLALAYYDVRVRKEGLDLQLMMASLGERDAVAAAPAGVEPE  317

Query  348  LLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADY  399
                 +            G         + ++  +  RS          A  
Sbjct  318  AKFKKTNVVGVILLSFITGGIYFPIWFLRRREAINRLRSPEGIGWAAPFAAT  369


>PIT95734.1 hypothetical protein COT94_04060 [Candidatus Falkowbacteria bacterium 
CG10_big_fil_rev_8_21_14_0_10_37_14]
Length=251

 Score = 58.6 bits (138),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 38/226 (17%), Positives = 75/226 (33%), Gaps = 9/226 (4%)

Query  96   SGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNW  155
                 ++ ++L+            G++ +           +     +         +   
Sbjct  19   WKYYWQNFNRLMQLMLIAIPVGILGMIAVVGFISWDRVISLVLTNTVSIPMIGIAGSLIM  78

Query  156  QWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG  215
              +++  + A IL  +        +    +     +  KLG     ++   + +      
Sbjct  79   IVSLMFLSSAIILGLVMKTAIFSMVLKKDSKSSFKQLWKLGREKYEAYFKAVFIADFFTL  138

Query  216  GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
              SLLLIIPG++  + + F   V  D       A ++SR L++G WW +F R VL  ++ 
Sbjct  139  LWSLLLIIPGIIKGLQYSFAGLVALDKKTQSRDAFKESRRLITGRWWGVFWRLVLPTILF  198

Query  276  LTLSFLTARIPYV---------GEAANLAFSLLLTPFSFLYYYLIY  312
               + + A I                N      LTP    Y   +Y
Sbjct  199  SVGTSVLAVIMGAQLGNKMDPTYGIVNSTAIFFLTPLFMAYIVELY  244


>MBI4101263.1 hypothetical protein [Candidatus Microgenomates bacterium]
Length=293

 Score = 59.0 bits (139),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 43/193 (22%), Positives = 79/193 (41%), Gaps = 6/193 (3%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              + + L+ L+  T          ++     +K+G      +  L++L+ L +  G +LL
Sbjct  107  IGIVWWLIILTGFTKYGIATARGQEIDFKSLLKVGYNKAAGYLGLMLLITLAILAGLILL  166

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            IIPGL+F  WFFF  Y+  D N+G ++A+ +SR LV G    I G      + +L     
Sbjct  167  IIPGLIFMYWFFFAPYIYIDQNVGVVEAMRQSRRLVKGKLVEILGLIGATYLFALPAY--  224

Query  282  TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGW  341
               +P++G           +  ++ Y Y+    L    +          W  +   I   
Sbjct  225  ---LPFLG-IFYQLIYSPASVVAWSYRYVSAKKLADANKAKPPTDNANWWGIILILIVPV  280

Query  342  MLIPGLLLVSLSR  354
            ++    +  + SR
Sbjct  281  LIALVFIASNSSR  293


>NLW13745.1 hypothetical protein [Trueperella sp.]
Length=336

 Score = 59.4 bits (140),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 32/279 (11%), Positives = 70/279 (25%), Gaps = 22/279 (8%)

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
                 + M G + I I    V       + L       +    +         + +    
Sbjct  57   FGRLAARMFGLLLIMIAAIIVFTIIYASVILALFDVVDVRNTTVFYGAFLLPFVTLAIAF  116

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            L    F      +  ++IG ++ + +S  L  G      G  + ++V+++ LS + + + 
Sbjct  117  LAFYRFTVTAPAMVAEDIGPVRGISRSWHLTKGSLGYFIGLLLTVIVLAIALSIVVSLLL  176

Query  287  YVGE----------------------AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
                                         L  ++++ PF+     L+Y +++        
Sbjct  177  AFVAAFGVTSSSNAEFFLATSAIGTVLVTLFLTIIIAPFATAVTNLVYVNMRMKRESFHQ  236

Query  325  PPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLN  384
              +           +G               N    +  + G        T         
Sbjct  237  DALFHAGRDQLPGSYGPSNESPYGQQFDPSGNQYGSESNAGGYGQWSSQPTPHNGWQSDI  296

Query  385  RSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLF  423
             S      R S A  +        + SE           
Sbjct  297  PSDSSYGTRDSQAQQRPQWYGDAPSESETDDDSPNPFSP  335


>HGT40621.1 hypothetical protein [Schlesneria paludicola]
Length=212

 Score = 57.9 bits (136),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 21/133 (16%), Positives = 56/133 (42%), Gaps = 0/133 (0%)

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
             A +   L      +++ + + D     ++  G R+     L  ++ +L VG G L+ +I
Sbjct  66   FAAVQCFLMPGLTRLYLGVSRGDKVSLGTVFAGGRYFFRTVLSTVVFVLAVGLGLLMCLI  125

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
            PG+   +  +   Y+L D+++  L++L+++  +   +        ++   I++    +  
Sbjct  126  PGIFVALRLWPYLYLLVDEDLPALESLKRAADITRDNMGTSLILGLVSFGINVAAQLVCG  185

Query  284  RIPYVGEAANLAF  296
             +       +  F
Sbjct  186  ILQLFTLPLSGLF  198


>OGL26321.1 hypothetical protein A2708_00885 [Candidatus Saccharibacteria 
bacterium RIFCSPHIGHO2_01_FULL_49_21]OGL37813.1 hypothetical 
protein A3E49_00505 [Candidatus Saccharibacteria bacterium 
RIFCSPHIGHO2_12_FULL_49_19]OGL38603.1 hypothetical protein 
A3B63_02770 [Candidatus Saccharibacteria bacterium RIFCSPLOWO2_01_FULL_49_22]
Length=236

 Score = 58.2 bits (137),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 30/189 (16%), Positives = 55/189 (29%), Gaps = 5/189 (3%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            F                       S      +             +        ++    
Sbjct  40   FAVWSSLSWFTDDKTTWTDERLSLSNWFGSFSGLNVETPAGLGAGLAFLLGITAVIFGLL  99

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
                         V L    +   +      LL I L +++  G +LLIIPG++     F
Sbjct  100  QVILALRVSSGKKVELGDIWEEFKQKGFRLFLLEIALGILILVGLVLLIIPGIILFWRLF  159

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAAN  293
               ++L D N G  +AL +S     G     F   +  +++   L  L+  IP++G   +
Sbjct  160  LAPFILLDKNTGVEEALRQSWRSTKG-----FAWPIYAVLLFGILLSLSGIIPFIGAIVS  214

Query  294  LAFSLLLTP  302
                +    
Sbjct  215  FVLGVAYAC  223


>NOY65511.1 hypothetical protein [Nitrospirae bacterium]
Length=222

 Score = 57.9 bits (136),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 30/203 (15%), Positives = 65/203 (32%), Gaps = 3/203 (1%)

Query  110  SWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
             + L          I    +      +    ++         + +    + L       L
Sbjct  11   FFILKDNTMIMFPPILAFFLTNLLLILVLGNMVSEGVEQRKLSVSMILLVTLIGFGIQSL  70

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
              +         + +    L       ++ +G      IL+ ++   G +L +IPGL+  
Sbjct  71   SHAITVVMAQQALSEKRCSLKYGFNESMKKIGPVFSAGILIGILFTIGMMLFVIPGLVVL  130

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI---FGRFVLLLVISLTLSFLTARIP  286
              F F   V+   N+  L+A++KS  ++  +       F   +   +I   +  +  R+P
Sbjct  131  YVFMFTVVVIILRNLHTLEAMKKSYQIIRANITDTLFLFLSLLGSFLIIWIVGRIFLRVP  190

Query  287  YVGEAANLAFSLLLTPFSFLYYY  309
              G+  N      L  F  +   
Sbjct  191  LFGQIINALLMGGLFAFLAVVLV  213


>WP_075603862.1 hypothetical protein [Saccharicrinis aurantiacus]
Length=309

 Score = 59.0 bits (139),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 28/213 (13%), Positives = 65/213 (31%), Gaps = 19/213 (9%)

Query  109  DSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYIL  168
              W L       +   ++LG +L      +   +   T          +      +    
Sbjct  27   HIWNLLKLMMVLVGPFFILGGILIGRVYGTMFSMFETTLEPNPFDFALFIPAYILLIIGG  86

Query  169  LGLSWMTGSMFIYICKT---DVGLFRSMKLGLRHVGSFTLLLILLILVV-----------  214
            +       S      K    ++ L        ++   +    ILL + +           
Sbjct  87   IFYQASIVSYMKMSLKHAKDEITLQLIFTDIKKYFWKYLGAGILLGIGISFVGFIVTLVL  146

Query  215  -----GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFV  269
                   G  +++   +   + F    + +  ++     +  +S  L+ G+WW+ FG ++
Sbjct  147  SIISPVLGVFVMMFGLIYLMIAFSIYPFSIGIEDASITDSFGRSFELIKGNWWSTFGYYI  206

Query  270  LLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
            LL  I   +S +     Y+     +   L+  P
Sbjct  207  LLYFIQGFISAIVIVPFYILAFYKMFSQLITNP  239


>OGL95714.1 hypothetical protein A2348_03745 [Candidatus Uhrbacteria bacterium 
RIFOXYB12_FULL_58_10]OGL99857.1 hypothetical protein A2501_05525 
[Candidatus Uhrbacteria bacterium RIFOXYC12_FULL_57_11]
Length=304

 Score = 59.0 bits (139),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 43/274 (16%), Positives = 82/274 (30%), Gaps = 2/274 (1%)

Query  79   RRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFS  138
                  F +   R    S       +     +  L       L  + +L      A   +
Sbjct  1    MTHMHEFRIPGWRTIIQSIVPFYQKNVWYLVAIPLLMGAIAILPLVAVLSFYAMTALAIA  60

Query  139  ALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLR  198
               L+    L            L  + ++   L   T           +G          
Sbjct  61   GAGLQAPGTLLNVFLALLGVASLVWMLFVSGALQIATVRASYDGNAVPLGTLIRGSFDAH  120

Query  199  HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVS  258
             +      L L  ++V  G +L I+PG+   V   +  YVL  D +G + ++++S  L  
Sbjct  121  LIARVLGGLFLYAVMVMVGLVLFIVPGIYLAVRGSYLPYVLVRDKLGVVDSIKQSWALTR  180

Query  259  GHWWAIFGRFVLLLVISLTLSFLTARIPY--VGEAANLAFSLLLTPFSFLYYYLIYSDLK  316
            G+WW    R+  L + S+  S   A +                   F    + ++++   
Sbjct  181  GYWWVTALRYGWLALASVAGSAAFAIVFSSPAARGVGSLLDFAWQMFVVGPFAMVFARHM  240

Query  317  ANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLV  350
             +            +  LT+A  G      +L +
Sbjct  241  YDSVRAGKGQDPEAFHALTSAEKGQFAAIIVLFL  274


>MBI1728251.1 hypothetical protein [Candidatus Rokubacteria bacterium]
Length=229

 Score = 58.2 bits (137),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 31/217 (14%), Positives = 67/217 (31%), Gaps = 6/217 (3%)

Query  102  SISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILL  161
                ++A+++ L  R       I L   +     I        +              L+
Sbjct  1    MRRDIIAETFRLLARHFHLFTLIALTVWLPGHLIINYIDFFATSKGAADAGARGFRVGLM  60

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                +  L ++    ++            ++M  G+       +  ++   +V  G + L
Sbjct  61   VEGFFGPLVVAATLNALARITRGEPASYVQAMWHGVAMWPRLFIARLVASALVLAGLIAL  120

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            ++PG+L  V   F   ++  D      AL  S  L  G   AIF     L V   +++  
Sbjct  121  VVPGVLILVRLCFVDALVVLDGAPLGAALRASNALTVGRRPAIFWTGGFLFVAVFSVAMT  180

Query  282  TARI------PYVGEAANLAFSLLLTPFSFLYYYLIY  312
             + +        V    +   ++    F+   +    
Sbjct  181  LSALAGAADHFVVQVLVDCVIAVTQVVFTIALFLFYR  217


>HEB57611.1 hypothetical protein [Gammaproteobacteria bacterium]
Length=285

 Score = 58.6 bits (138),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 39/254 (15%), Positives = 81/254 (32%), Gaps = 43/254 (17%)

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
                    L  I +LG++          LL   +            I    +    + ++
Sbjct  26   WKRGMLIILYMIVVLGVLAGILTATFVPLLDDPSGPPGFLIGLALVIAPLFLFAWTIFMN  85

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG-----------------  215
             +T          D+ +  ++++GL  +       +L +L+V                  
Sbjct  86   ALTAFFGALYRGVDLHIGDALRVGLNKMFPVIGWYLLFLLMVFGAMLPGGIIIGIGIASQ  145

Query  216  ----------GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
                       G  +  +P     V F F  Y++  +N+G ++A  +++ LV G+WW  F
Sbjct  146  EPAIIAMTMFFGFFVFAVPYFFVIVLFVFGPYLIVIENMGVMEAFAEAKRLVWGNWWRTF  205

Query  266  GRFVLLLVISLTLSFLTARI----------------PYVGEAANLAFSLLLTPFSFLYYY  309
               +++ ++   +S + +                    V +  N   SLL+ P S     
Sbjct  206  AYGIVVTLLVWIISMVISLPLGVAGFVMDLNGEVGGSLVLQVINNLLSLLVYPLSVAMTL  265

Query  310  LIYSDLKANYRGPQ  323
                D+     G  
Sbjct  266  SYMFDMMLRRSGSD  279


>WP_011745950.1 hypothetical protein [Chlorobium phaeobacteroides]ABL66151.1 
conserved hypothetical protein [Chlorobium phaeobacteroides 
DSM 266]
Length=218

 Score = 57.9 bits (136),  Expect = 1e-06, Method: Composition-based stats.
 Identities = 32/196 (16%), Positives = 66/196 (34%), Gaps = 12/196 (6%)

Query  128  GIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDV  187
              +                 ++  +        +   A      +  +   F  +   ++
Sbjct  35   WEMFQKNIGEFIGFTLVVFVISALSARMNAFGSIIFSALAASLYAGYSIVAFKRLIGQEI  94

Query  188  GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGL  247
                    G  +     L  +   L+V  G +LL+IPG+   V + F  +++ D  +   
Sbjct  95   QFSD-FFKGFNYFLPLFLAGLASGLLVSLGIVLLVIPGIYLAVSYVFVTFLIIDHRMDFW  153

Query  248  QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLY  307
            QA+E SR +++  W+A+FG  + L             I  +G  A     L+  P +   
Sbjct  154  QAMEISRKIITKEWFAVFGLALALF-----------AINLLGVLALGVGLLVSAPVTACA  202

Query  308  YYLIYSDLKANYRGPQ  323
              + Y D+   +    
Sbjct  203  AAIAYKDIVGLHASEW  218


>NDO19887.1 zinc ribbon domain-containing protein [Lachnospiraceae bacterium 
MD329]
Length=252

 Score = 58.2 bits (137),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 28/199 (14%), Positives = 63/199 (32%), Gaps = 8/199 (4%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
             +  IY     +          L   +            + + ++  I+   + M     
Sbjct  1    MIKQIYTRAFNILMKMPLKLWGLSLLSGFLTILVLIFGFLPIISIPVIVTLNAGMAAVYL  60

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL---LIIPGLLFCVWFFFCQ  236
                  +V     +  G ++ G     +    L V    L+     +  ++  + + F  
Sbjct  61   DGYNGKEV-YSDQLFSGFKNFGHVAGGMCWKSLWVLLWFLIPIAGPVIAIIKSLSYAFTP  119

Query  237  YVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT---ARIPYVGEAA  292
            Y+L  +  I   +AL KS    +G+   +F   ++  +    LS +    A IP++G   
Sbjct  120  YILTQEPEINATEALRKSMEKTNGYKANMFLAIIVPSIAFAMLSIILNTLAMIPFIGILF  179

Query  293  NLAFSLLLTPFSFLYYYLI  311
                 ++   F+       
Sbjct  180  AAVSFIVSLIFALFAPMFF  198


>WP_082152352.1 zinc-ribbon domain-containing protein [Candidatus Rhodobacter 
lobularis]KMW60005.1 MJ0042 family finger-like domain protein 
[Candidatus Rhodobacter lobularis]
Length=265

 Score = 58.6 bits (138),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 14/65 (22%), Positives = 21/65 (32%), Gaps = 1/65 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + CP+CGA+     + +P +    +C  C  T    PA        D     P    
Sbjct  1   MRLI-CPNCGAQYEVDDAVIPEQGRDVQCSNCGHTWYQQPAHIDAETAEDLQDHAPVEEE  59

Query  61  QRRIP  65
                
Sbjct  60  VAPED  64


>HDH91445.1 hypothetical protein [Candidatus Aenigmarchaeota archaeon]
Length=212

 Score = 57.5 bits (135),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 32/211 (15%), Positives = 75/211 (36%), Gaps = 18/211 (9%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             + L G     + I   +             +    +LLA++             +    
Sbjct  1    MLILNGFNSITSLIILKISPVEIFKNLLFYASLAIPLLLASLLISFFLSCAYCEIVRQAY  60

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILV-----------VGGGSLLLIIPGL-----  226
             K  + L +S K+G +         +L  ++           +  G   +++  +     
Sbjct  61   SKRKISLVKSFKVGKKRFIPLFGTYVLAFVILLFFNLLLIPLIFLGIWGILLFLILIVLL  120

Query  227  --LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
              +  + +F    V+  +N  G+ A++KS  +   ++W++     L   I   ++   ++
Sbjct  121  NSIAFILYFEIPAVVVLENSSGITAIKKSVEIGRKNFWSLVFVIFLTFFIVGIVNSSLSQ  180

Query  285  IPYVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
            IP+VG   +L  SL L  + ++     Y + 
Sbjct  181  IPFVGLIVSLIGSLFLNAWIYMLPATFYLEF  211


>WP_163952864.1 hypothetical protein [Paenibacillus sp. SYP-B3998]
Length=417

 Score = 59.4 bits (140),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 29/220 (13%), Positives = 71/220 (32%), Gaps = 9/220 (4%)

Query  209  LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
            + I+     ++  ++ G  F + + +   ++A         L +S  L  G++W +F  F
Sbjct  193  IGIIFFILLTIAGLLTGAYFFLRWGYYLPIVALGEDSIG--LSRSWRLTRGNFWRLFLMF  250

Query  269  VLLLVISLTLSFLTARIPYVGE-------AANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
             +L++I      +   I                  S+L+ P   L Y + + DLK    G
Sbjct  251  FVLIIILYLFQAVIQLIVTAAFGLGLGAQLLLSLVSILIAPLGILSYAISFFDLKVRNDG  310

Query  322  PQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTP  381
                 +    +  + +   +        +    ++    QL       ++  G + Q+  
Sbjct  311  LGLETMIHNTINPSGSEQLYQPERIEPALPRVVESELPNQLEEQSLPREESQGREQQEAF  370

Query  382  DLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVT  421
                      ++ +         ++ +  +E     G   
Sbjct  371  LEPDPFDPFDEQENQQALNKAEDREPQNQNEASQKDGKPE  410


>HEA68366.1 hypothetical protein [Desulfobacterales bacterium]
Length=256

 Score = 58.2 bits (137),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 26/201 (13%), Positives = 53/201 (26%), Gaps = 3/201 (1%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  V CP+C A      SKLP   +  RC EC         E    ++     +      
Sbjct  1    MKKVECPNCKAVHRIDESKLPENGAYGRCRECKSRFFIGKNEPHPKESQKEKYSRQETEK  60

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
                P    E      +C +C   +    +++           ++   +           
Sbjct  61   TETCPKCDYERTQGDESCPKCGIIYEKYSDKDRIDLKKDNEKHTESETEKSNSRGLELKQ  120

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
             +GI      +         ++      N           +  +   +L   ++    F 
Sbjct  121  AVGII---GSIILFIGVFMPVVSVPVIGNFNYFQNGKGDGIIILFLSVLSFIFILLKKFK  177

Query  181  YICKTDVGLFRSMKLGLRHVG  201
             +  T +G    +     +  
Sbjct  178  KLWITGIGSLAVLAFTFIYFQ  198


>ERH25412.1 hypothetical protein HMPREF1979_00521 [Actinomyces johnsonii 
F0542]
Length=297

 Score = 58.6 bits (138),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 28/184 (15%), Positives = 58/184 (32%), Gaps = 27/184 (15%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
            ++   V+ + L    +   +   I    +            + +  L+L+L +++     
Sbjct  113  MIQIIVSVVTLAFLAIAAVIVWGILGGFIANGVDEDSVGTVLLTILLILVLALVLGLAVF  172

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
             L                  L  +NIG  + + +S  L  G++W + G  +L ++I    
Sbjct  173  ALTCK--------LSLAPAALILENIGVFEGISRSWALTRGYFWRVVGIRLLSIIIIGVA  224

Query  279  SFLTAR-------------------IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
            S   +                    I  +    N      + PF      L+Y+DL+   
Sbjct  225  SQAASFALSSITNGIMLAAPNTMIAIMTISILVNSLIQAAIMPFDASVVALMYTDLRMRS  284

Query  320  RGPQ  323
             G  
Sbjct  285  EGLD  288


>NBV61863.1 thioredoxin [Rhodobacteraceae bacterium]
Length=209

 Score = 57.5 bits (135),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 14/80 (18%), Positives = 21/80 (26%), Gaps = 1/80 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V CP+CGA+   P   +PA     +C  C  T      +S                 
Sbjct  1   MRLV-CPNCGAQYEVPEDVIPAAGRDVQCSNCGHTWFVAHPDSADATAMATEVAPEWETP  59

Query  61  QRRIPSDRLEIQSKTVNCRR  80
             +  +              
Sbjct  60  ASQPAAADDSAAYDGDLGEW  79


>OGG50252.1 hypothetical protein A2763_04725 [Candidatus Kaiserbacteria bacterium 
RIFCSPHIGHO2_01_FULL_54_36]OGG75071.1 hypothetical 
protein A3A41_02155 [Candidatus Kaiserbacteria bacterium RIFCSPLOWO2_01_FULL_54_22]
Length=236

 Score = 57.9 bits (136),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 31/146 (21%), Positives = 57/146 (39%), Gaps = 1/146 (1%)

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
            + +  S+      +        L  S  L    +  +TL  IL I  +  G++ LI+PG+
Sbjct  68   VSIVASYWMALFTLQAHDEPSALSISAILRWDSLVPYTLGGILYIGGILIGAVFLIVPGI  127

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            +F   + F    + D  +G + + ++S  +  G  W IFG       I L  S L+  + 
Sbjct  128  IFATVYIFAYLFIIDKRLGVIASFKESARITKGARWRIFGLLAATAGIILGGSALSGLVA  187

Query  287  -YVGEAANLAFSLLLTPFSFLYYYLI  311
              VG+  +L               ++
Sbjct  188  GIVGQLLSLEPIAYYVALVIPTLAML  213


>RXZ76672.1 hypothetical protein EBB07_34185 [Paenibacillaceae bacterium]
Length=351

 Score = 59.0 bits (139),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 28/197 (14%), Positives = 68/197 (35%), Gaps = 9/197 (5%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
             +   +     +   + + +  T   L     + +      + + I++++ +   + +L 
Sbjct  142  LMMIGIYFALIIAVVVVMLLFGTITSLIGVGLMEISSDPFSSGVTIVVLIALLYVAAVLA  201

Query  223  IPGLLFC--VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
            +  L     + F F   ++A D+      + +S  +  G +W +F    +LL I   L  
Sbjct  202  VVSLYSFFIIRFGFFLPIVALDSGAQDGTISRSWRMTKGSFWRLFVVLAVLLAIVTVLML  261

Query  281  LTARI-------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLP  333
                +         +G+   +   L ++P   + + LIY DL+    G     + +  L 
Sbjct  262  SIQALVITVFKMSLLGQIIQILIGLAISPILTITFALIYLDLRVRSEGSDLEQVLQARLA  321

Query  334  LTAAIFGWMLIPGLLLV  350
                        G +  
Sbjct  322  NITGSNHASSAEGNVPG  338


>OGF51174.1 hypothetical protein A2231_01940 [Candidatus Firestonebacteria 
bacterium RIFOXYA2_FULL_40_8]
Length=271

 Score = 58.2 bits (137),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 35/237 (15%), Positives = 73/237 (31%), Gaps = 23/237 (10%)

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
             F       L    +  +     I     +                I+L+ +   ++   
Sbjct  20   TFKIYRKHFLIFVQIFAMFNLPIIIIKTAMGFLEGPYGTVTAGFINIILSLLVLPVMNFL  79

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
                    Y+ K    L            S    L+L  L++  G L LIIPG +F   +
Sbjct  80   LARTISDGYLGKKVTILSSFKHFDHLKFWSMMKTLLLSGLMIILGFLCLIIPGFIFTFRY  139

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA-  291
                 ++A + I    ALE+S+ L+  ++  +     ++ +I+  +  +      +  A 
Sbjct  140  ALVPQIIAIEGIYNKPALERSQFLMKKNFGNLMTTGFVVGIIAYAIYGVILLPMIIYIAL  199

Query  292  ----------------------ANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPP  326
                                    +  ++++TP     Y L Y +++    G     
Sbjct  200  HHGAGAAALAPTGLIMVSFSFSIEVLAAVVVTPLMLTAYTLFYYNMRIKKEGFDIQM  256


>WP_005045810.1 hypothetical protein [Halococcus salifodinae]EMA49231.1 hypothetical 
protein C450_18248 [Halococcus salifodinae DSM 8989]
Length=273

 Score = 58.2 bits (137),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 29/202 (14%), Positives = 71/202 (35%), Gaps = 5/202 (2%)

Query  135  PIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMK  194
                    +  T            +L+A  A +    + +    F       +    + +
Sbjct  70   IRMVETSTEGLTPFALPVPPIVAWLLIALTAVLGEAANIIAARAFFAESSRALSGGLAGR  129

Query  195  LGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSR  254
              +    +  +  I++ ++V  G + LI+PG+ F + F F +  +A ++   + A+  S 
Sbjct  130  NIVLATLNGIVGGIVVGIIVAIGLIFLIVPGIFFAIAFLFLRQEIAIEDSNFVDAMADSW  189

Query  255  LLVSGHWWAIFGRFVLLLVISLTLSFLTARI-----PYVGEAANLAFSLLLTPFSFLYYY  309
             L  G+   +F   + + ++    S +   +     P +    ++    +   F      
Sbjct  190  QLTKGNRLELFALVIGVAILVTLASTVVPLLVGAVSPLLNVVVSILLGGVTAVFGTAVIT  249

Query  310  LIYSDLKANYRGPQHPPIKRQW  331
              Y+ L A+    Q+     +W
Sbjct  250  RAYAQLHADRAAVQNGEDIDEW  271


>MBP48220.1 hypothetical protein [Myxococcales bacterium]
Length=226

 Score = 57.9 bits (136),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 17/143 (12%), Positives = 45/143 (31%), Gaps = 0/143 (0%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
              +  ++      +    ++                 G   V    +L +L   ++  G 
Sbjct  67   FFVIKLSIAGPLRAGYDMALLRITKGDQSVEVGDFFAGFHKVVPLAILGLLHGSIITAGM  126

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            L L++PG++  +  + C  +  +  +  + AL+ +  L  GH  A+F   +      +  
Sbjct  127  LCLVVPGVVLALGLWPCYLLAMEQELSPIDALKAAWRLTQGHKLALFWLVIASFGAIVVG  186

Query  279  SFLTARIPYVGEAANLAFSLLLT  301
                    ++         +   
Sbjct  187  LLALVVGVFIAAPVVQLAWINAY  209


>HCC71773.1 hypothetical protein [Bacteroidales bacterium]
Length=134

 Score = 55.5 bits (130),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 23/116 (20%), Positives = 49/116 (42%), Gaps = 2/116 (2%)

Query  186  DVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIG  245
             V     +     +     L  +++  ++  G ++LIIPG++      F  Y++ D  + 
Sbjct  2    KVDFRYLVSGFSENYVHIILANLIVFAIIMIGFIMLIIPGIILSCRLAFVSYLVMDKKME  61

Query  246  GLQALEKSRLLVSGHWWAIFGRFVL--LLVISLTLSFLTARIPYVGEAANLAFSLL  299
             + A+E+S  L  G+ W IF   ++   ++I   L  +    P +   +    +L 
Sbjct  62   PITAVEESWRLTKGYGWTIFAMGLVSFFIIIFGLLMLIVGISPAIMWVSGAFATLY  117


>WP_180275835.1 zinc-ribbon domain-containing protein, partial [Sphingobium sp. 
IP1]
Length=58

 Score = 53.2 bits (124),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 9/39 (23%), Positives = 14/39 (36%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  V CP+C      P + +       RC  C  +   +
Sbjct  1   MILV-CPNCATRYIVPDTAVGPDGRQVRCAACKHSWSQE  38


>KKS76411.1 hypothetical protein UV50_C0022G0002 [Parcubacteria group bacterium 
GW2011_GWB1_42_9]
Length=317

 Score = 58.6 bits (138),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 40/176 (23%), Positives = 68/176 (39%), Gaps = 5/176 (3%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
               Y     L    +   L       +    +   +      +   +  +  +       
Sbjct  22   FTYYQRRFKLIAGIVALPLSFYFVATVLSSFEPTLFWGAPFNLLGFVSAILSVLAIYQAM  81

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            I  ++  +    +   + +GSF  L ++  L+V GG LL IIPG++  +WF F   +L  
Sbjct  82   IDGSEHEIMSCYRHSWQKIGSFVWLALISSLIVFGGLLLGIIPGVILSIWFIFSAIILFA  141

Query  242  D---NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
            +      GL AL +SR  V G+WW I GR +   +    LSF+   I  +G     
Sbjct  142  EEGRGSRGLNALVRSRNYVRGYWWPILGRIMFFSIPMAFLSFIVMGI--LGWLLGA  195


>WP_034901039.1 hypothetical protein [Erythrobacter litoralis]AOL23783.1 hypothetical 
protein Ga0102493_112778 [Erythrobacter litoralis]KEO98732.1 
hypothetical protein EH32_06400 [Erythrobacter litoralis]
Length=236

 Score = 57.9 bits (136),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 31/149 (21%), Positives = 59/149 (40%), Gaps = 0/149 (0%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
                +L G      ++ +        L R      R   ++ L+ +L I+V+   SL L+
Sbjct  64   LGVGLLTGFGAFVIAVVLNYWLYAALLARQPAPAFRRFWAYVLVSVLSIIVIVLASLALV  123

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            IPG++  V +      L   +   + +  +S  + SG  WAIFG  V+++ + L LS L 
Sbjct  124  IPGIIVAVRWAPLIPALLARDEPAMDSFGESWQMTSGSSWAIFGVAVVVIAVGLFLSLLF  183

Query  283  ARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
                 V    +   ++           ++
Sbjct  184  DLATAVAGGPDSLGAVFFDAAGGNVMAVL  212


>TAL09222.1 hypothetical protein EPO00_06175 [Chloroflexi bacterium]
Length=354

 Score = 59.0 bits (139),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 21/185 (11%), Positives = 48/185 (26%), Gaps = 11/185 (6%)

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
              +            F ++    +  F     G    G F +    + +     +L++  
Sbjct  167  SIFPASIALIGVLGAFYWLQGRWLSGFALSGFGYPGNGRFDVSAAGVAVAFAIVTLIVEF  226

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
                  V +      LA + +    AL +S  L  G   +I   F++       ++ +  
Sbjct  227  GLFYAVVRWSVAVPALALERLSLRNALRRSSELTKGRRLSILWLFIVAGFALTLVASVLV  286

Query  284  RIPYVGEAANL-----------AFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWL  332
              P +                   ++        Y  ++ + L  + R     P      
Sbjct  287  YGPAIIVVIAGWGFDAMVASIMIGAIATVALGAPYMAILVAILYRDLRDAGPRPRNDVPP  346

Query  333  PLTAA  337
                 
Sbjct  347  VPPGW  351


>MAG90267.1 hypothetical protein [Rhodobacteraceae bacterium]
Length=74

 Score = 53.6 bits (125),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 14/36 (39%), Gaps = 0/36 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
            + CP+C A  + P   +P      RC +C      
Sbjct  2   RIACPNCSAHFDVPDDAIPKDGRKLRCSQCQHKWHQ  37


>PSP47766.1 hypothetical protein BRC75_08735 [Halobacteriales archaeon QH_7_69_31]
Length=221

 Score = 57.5 bits (135),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 25/128 (20%), Positives = 50/128 (39%), Gaps = 3/128 (2%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                  ++G       +   +   D     S+ + L  +    L   +  + V  G  LL
Sbjct  51   VGSVIAIVGQGVGIALVANRLDGLDPTPENSLGVRLIVLFVSVL---IGGIGVIVGLFLL  107

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            +IPG+   V  +     +  D+ G ++AL +S    +G+   +FG  + +++I +  S L
Sbjct  108  VIPGVYLWVRLYLAPPAVIVDDCGPVEALGESWSRTAGNTVTVFGVALAVVIIGVGASAL  167

Query  282  TARIPYVG  289
                   G
Sbjct  168  VLLALTGG  175


>WP_184732161.1 hypothetical protein [Streptomyces netropsis]MBB4885559.1 hypothetical 
protein [Streptomyces netropsis]
Length=263

 Score = 57.9 bits (136),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 25/166 (15%), Positives = 46/166 (28%), Gaps = 0/166 (0%)

Query  130  VLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGL  189
             +       A              +    +LL+      +  S            T V +
Sbjct  32   CIYALVGGFASEEGETLEAWDIFLSAGIPVLLSLYLLQGVVGSLAATLAEQDERGTRVTV  91

Query  190  FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQA  249
             R ++  L  +       +L   +       L        V +     V A + +G  +A
Sbjct  92   RRLLRASLPRIPGTVGAYLLTAFLFPLLVSPLAPLVAWPWVLYSLAPSVTAHEEVGMFRA  151

Query  250  LEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
            L ++  LV G WW +F       +I      L   +       + A
Sbjct  152  LRRASKLVKGVWWQVFVAMAFAALIGFGADMLMGFLVPADWVVSAA  197


>MBI4600142.1 hypothetical protein [Candidatus Uhrbacteria bacterium]
Length=313

 Score = 58.6 bits (138),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 36/198 (18%), Positives = 67/198 (34%), Gaps = 11/198 (6%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             ++L    L F        + P+   + Q   W     +     +    +  T S+ +  
Sbjct  126  LLFLACSNLLFILTRVGASINPSFVRDIQVSRWYVLASIIIGFLLTSYAAAGTISIALRY  185

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
             +     FR+  +       F +  IL  L +  GS++ +IP   F   FFF  Y++   
Sbjct  186  SENGEKAFRAFFVAPMQYIHFLIANILYTLGLLIGSVVFVIPAAWFGSRFFFWPYLILSK  245

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
             I  + A  +S+ +  G    +F              FL+  I  +G        L + P
Sbjct  246  KISFIAAFRESKRMSKGATVEVF-----------LFWFLSVMITTLGIVVVGIGYLFILP  294

Query  303  FSFLYYYLIYSDLKANYR  320
               L    +Y  +    +
Sbjct  295  MVALATVYVYKQMDKRCQ  312


>HBZ40362.1 hypothetical protein [Erysipelotrichaceae bacterium]
Length=202

 Score = 57.1 bits (134),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 33/185 (18%), Positives = 72/185 (39%), Gaps = 4/185 (2%)

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
            +    +  A     A  +              T A     + ++       +    VGL 
Sbjct  1    MFATILNLAAFPMTAGLIKLGLTGENVTFNDFTTALSENIVKYLKLIFGGILLGLAVGLV  60

Query  191  RSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
              + + +  + +F+    L +  + G  LLLI+   +  V+F F    +  D++  + A+
Sbjct  61   SVIYVIVTFMVTFSGDT-LNLFALIGLLLLLILVLAVGAVFFTFWFAAMVLDDLTVMNAI  119

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTLSFL---TARIPYVGEAANLAFSLLLTPFSFLY  307
            +KS   V   +W + G  +L+ +++  LS +      IP VG   +   +   T  +  +
Sbjct  120  KKSFDSVKRCFWTVVGVTLLIQILTNILSGIAGGFGGIPLVGALLSSVVTTGSTVLTMAF  179

Query  308  YYLIY  312
             +++Y
Sbjct  180  TFILY  184


>CCH73280.1 membrane hypothetical protein [Tetrasphaera australiensis Ben110]
Length=355

 Score = 59.0 bits (139),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 28/231 (12%), Positives = 60/231 (26%), Gaps = 8/231 (3%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            +     L     +   V   F    +A +  G ++A+ ++ +L  G WW   GR ++L +
Sbjct  76   MLLLLFLAYPVMIWLSVKCAFIFQTVAIEKAGPIEAIRRTFVLTKGMWWVTLGRQLMLSL  135

Query  274  ISLTLSFLTARIPYVGEAA--------NLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHP  325
            ++L L F+ +       A         ++A  ++      +    +Y     +       
Sbjct  136  LALPLMFVFSLASAPFSALTRDSGDASSVAGGIVAAIVMGIGAIALYIFQLYSVVYSTLM  195

Query  326  PIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNR  385
             I  +              P    V  + Q        +A       +      +P    
Sbjct  196  YIDARRRESLGGQAHAYGSPQQDYVQPAYQPGGYPYGPAASGSAAGAVAGSQWNSPAYPG  255

Query  386  SLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHL  436
              P          Y           +         +            P +
Sbjct  256  PAPVPSSPYPQQPYGQYPQSTTDAPAYPESGGSLWSGSPSSSDPTAPMPQI  306


>WP_149346280.1 hypothetical protein [Pedobacter sp. BS3]TZF85665.1 hypothetical 
protein FW774_00885 [Pedobacter sp. BS3]
Length=296

 Score = 58.2 bits (137),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 20/150 (13%), Positives = 52/150 (35%), Gaps = 1/150 (1%)

Query  147  WLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLL  206
              +     +   I  A + Y  +  + ++                 +    R+     L 
Sbjct  77   ISSYMGWEYAMVIFFALLNYTAITGTVLSYISLYLEKGKQAPSVEEVWSYFRYFFMRILG  136

Query  207  L-ILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
               LL + +    +L ++PG+           V+  +N     + ++S  L+  +WW  F
Sbjct  137  SVTLLGIALVIAIMLCLVPGIYLFPIISLVYPVMVLENGTLRYSFDRSYSLIKNNWWVTF  196

Query  266  GRFVLLLVISLTLSFLTARIPYVGEAANLA  295
            G  +++ +I+     L +    + +  ++ 
Sbjct  197  GTLLIIWLIAYACMSLVSLPAILMQTVSMF  226


>RKX99572.1 hypothetical protein DRP77_13020 [Candidatus Poribacteria bacterium]
Length=247

 Score = 57.9 bits (136),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 31/225 (14%), Positives = 64/225 (28%), Gaps = 12/225 (5%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              +  L     F+      +L  A              +L+ +    +       S+   
Sbjct  21   YILLCLPGRAVFSAALWVAILPSAQRGGAPFGVALTVSVLSALIAYPITSPPPIASVAYE  80

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            +     G+ R  +   +         I+ ++ +G G  L   P LL  V      + +  
Sbjct  81   LIGERGGIGRPFRRTFKVFFPLLGTFIMWLIFMGIGLALFAFPALLLYVRHCLAPHAVII  140

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG------------  289
            +  GG  A+ + + ++ G +       +  L   +    L      +             
Sbjct  141  EGEGGYGAMRRGKAILEGEFGRGMAVMLFALFAEVLAGQLIGTALLLLKGAEGFGAGWEY  200

Query  290  EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPL  334
                    LL  PF  L    +Y  L+    G      + ++L  
Sbjct  201  LILGFILPLLFDPFRALLSTELYLSLRREKEGLTDESFREEFLEW  245


>STO01854.1 Protein of uncharacterised function (DUF975) [[Eubacterium] infirmum]
Length=341

 Score = 58.6 bits (138),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 35/308 (11%), Positives = 87/308 (28%), Gaps = 4/308 (1%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L+  ++   +            +                      +I    S       +
Sbjct  34   LIYTFVCSNLPYMLGDMIPAFKRTIYLDFIDRYYEYSTFPGLYSIFIEAAFSLGLAMFIL  93

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
               +        +  G        L+ +++ ++   G +L IIPG++F + +    +++A
Sbjct  94   NFIRAGKISIELIFGGFEKFIKAFLMNLVMSIITVIGLILFIIPGIVFALMYSQAYFIMA  153

Query  241  DD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV---GEAANLAF  296
            D+  +G ++ L++SR+++ G+   +FG     +  +L  S       Y+      AN   
Sbjct  154  DNPELGAIECLKRSRIMMIGNKGYLFGLIFSFIGWALLASISIVFGKYIIEDMMIANPVV  213

Query  297  SLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQN  356
             +L+T    +  Y + + L+                 +T             +   ++  
Sbjct  214  QMLITIVLEIPMYFVLAYLQITNGIFYELASGHIRPVVTPTYANMYGNLNQNMPRNNQGF  273

Query  357  LSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLS  416
                          Q             ++     + L +              SE   +
Sbjct  274  NQNVSDNYNQGYGVQNPSVSKDVGVGEPQADFGGAETLVNESPANNTIANDTAASEVPEN  333

Query  417  LGPVTLFA  424
                    
Sbjct  334  NESAEADE  341


>RLG27558.1 hypothetical protein DRO03_11900 [Methanosarcinales archaeon]
Length=143

 Score = 55.5 bits (130),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 31/125 (25%), Positives = 55/125 (44%), Gaps = 11/125 (9%)

Query  190  FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQA  249
            F++    +     + +  IL  L+V  G +LLIIPG+++ + F F  Y++ D  +G + A
Sbjct  7    FQTFSPVILSFFDYLIGSILYGLIVVVGLILLIIPGIIWAIKFQFFDYLIVDKGLGPVDA  66

Query  250  LEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYY  309
            LEKS  +  G  W +F               L A I  +G    +    +  P + +   
Sbjct  67   LEKSSDITRGVKWDLF-----------AFGILLAIINILGFLCLVVGLFVTIPVTLVAMA  115

Query  310  LIYSD  314
             +Y +
Sbjct  116  FVYRE  120


>GGA60510.1 hypothetical protein GCM10008025_00670 [Ornithinibacillus halotolerans]
Length=636

 Score = 59.4 bits (140),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 35/340 (10%), Positives = 92/340 (27%), Gaps = 15/340 (4%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              I     +           L         +      + L   + +   +    GS+   
Sbjct  58   FNIQFTLPLFIPLLSDLQQSLTFLPEQPGDSIVLTLVVALVYFSLVSYTMGMFLGSIKQV  117

Query  182  IC------KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
            +        + + L       L     FT ++ ++   +   +++  I G +  + +   
Sbjct  118  LSPSSLQQDSFLQLGYRYYWRLFTYQLFTSVIGVVSFYLLITTIIGGIIGFIVLLLYVLV  177

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP------YVG  289
             Y++  ++    +AL  S   V  ++   F   +  ++    LS     +P      Y+G
Sbjct  178  PYIIVLEDKSFSEALGDSPKYVKRYFTKYFRLAIGAILSIAILSIGIQLLPNESLKYYIG  237

Query  290  EAA-NLAFSLLLTPFSFLYYYLIYSDLKANYRGP--QHPPIKRQWLPLTAAIFGWMLIPG  346
                    S+ +  F  L +  I  +          +    K +   +   +F +  +  
Sbjct  238  LVTYTFIGSVFIAAFMHLLHNCIREEDLQTEEDQLVKRIVPKWKKWTIIMIVFLFPWLGV  297

Query  347  LLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQ  406
                      +  +  ++  + +  +    P      +       +     +  + L   
Sbjct  298  QFAKGEHVTAIQFQPKITYSEGVYYKANWSPANNGSNHTYTTYGFEDGEEFELTMSLPDS  357

Query  407  RKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFP  446
              +T       G +T   D+              E     
Sbjct  358  ITSTDGPFFGEGEITWKVDKERITKNGNSTVYWGEEVAET  397


>QDT06426.1 hypothetical protein K227x_48360 [Planctomycetes bacterium K22_7]
Length=264

 Score = 57.9 bits (136),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 34/189 (18%), Positives = 67/189 (35%), Gaps = 0/189 (0%)

Query  103  ISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLA  162
                         +       ++ L  +L + PI   ++L     L     +     ++ 
Sbjct  25   QDWGTGTVIGNAWKIYAEHFRLFGLVALLVWLPIEVVMVLWDDLLLESSGVSGWLLNIIF  84

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
              A   L  +         I K       S + G+          ++  +VV  G +LLI
Sbjct  85   AFATQWLPEAIALCIARSVIAKEKPTFRNSFRQGVGVWLWLAWTNLIATIVVSLGFILLI  144

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            +PG+   + F  C  V+ D+ + G  A+ +S  +   H+W + G   L++ +   LS + 
Sbjct  145  VPGIFLSIRFSLCNAVVVDERLKGTVAMRRSFDMTGSHFWPLLGLTALMIAVVAGLSSID  204

Query  283  ARIPYVGEA  291
              I    + 
Sbjct  205  LLISGPIQV  213


>NMC56649.1 hypothetical protein [Eubacteriaceae bacterium]
Length=335

 Score = 58.6 bits (138),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 31/307 (10%), Positives = 85/307 (28%), Gaps = 13/307 (4%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            + L            +  +  +          K   +   +        +L ++ Y+ + 
Sbjct  27   FMLLAAFASSFEMAMIFVLTDSSVSGEKRSFGKMLAFSFKRAFPVMGTTVLLSLIYLAVF  86

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
              ++  +  I +              +           + ++++    L+++   +L  +
Sbjct  87   AVFLIAAAAIAVSMGIDFSSIMYYEDISSQMQL----GIYVIILFVLYLIMLAFVILISI  142

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR------  284
             +    +        G  +L+ +RLL  G    IFG  +L+ +I + ++ L +       
Sbjct  143  IYSMSIFARVKYKFCGAASLKYARLLTKGKRGKIFGNMLLIALIDIVIAGLLSYAANTAS  202

Query  285  ---IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGW  341
               IP++     +  + +    S  +  +  +               +      A     
Sbjct  203  DFDIPFIPFLIQIVITFIGMLVSVFWAVMFINFDNVKGPDIISSRFAKAINIKDAYFQSE  262

Query  342  MLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKL  401
             +         + Q  +A    S   +  +    Q    P   ++       L  AD   
Sbjct  263  TVSQDYTYNQNNSQQNNAPVQNSPTDEKAESAPAQSDVQPVDAQTDNSLNGELKDADQPS  322

Query  402  LLSKQRK  408
                   
Sbjct  323  AEPSDTD  329


>WP_036850679.1 hypothetical protein [Porphyromonas macacae]KGO00677.1 hypothetical 
protein HR11_02210 [Porphyromonas macacae]
Length=234

 Score = 57.5 bits (135),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 36/179 (20%), Positives = 63/179 (35%), Gaps = 1/179 (1%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +  +  L + L +      L L   +          +  +      ++            
Sbjct  27   VFVLSALVLSLMYVFYLLTLSLFIDSLSINVLMAIVFLAVYLFACLVMQTGFCRIALHLA  86

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                      +      R    + +L I++ L+V  G  LLIIPG+ F + F+   YV  
Sbjct  87   AGGSFSFSESKKFFWAPRLYIPYFVLSIIVALLVSVGLALLIIPGIYFAIKFYLMPYVYF  146

Query  241  DD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
            D+   G  + LE+   +  GH+  I G F+L +V SL    L     +V     +  S 
Sbjct  147  DESERGIFEVLERLWKISKGHFPGILGFFLLAIVASLLGFLLLGVGIFVTMPLYMILSA  205


>KKT31394.1 hypothetical protein UW18_C0004G0051 [Microgenomates group bacterium 
GW2011_GWF1_44_10]KKU01707.1 hypothetical protein UX04_C0004G0051 
[Microgenomates group bacterium GW2011_GWF2_45_18]OGJ41369.1 
hypothetical protein A2378_03290 [Candidatus 
Pacebacteria bacterium RIFOXYB1_FULL_44_10]HAU99562.1 hypothetical 
protein [Candidatus Pacebacteria bacterium]HAX01486.1 
hypothetical protein [Candidatus Pacebacteria bacterium]
Length=231

 Score = 57.5 bits (135),  Expect = 2e-06, Method: Composition-based stats.
 Identities = 32/199 (16%), Positives = 72/199 (36%), Gaps = 11/199 (6%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
               +++  I++ F   F A+ +  A      +      + +            +      
Sbjct  27   WALLWVGIIIIFFFVSFLAIGIGLAQNPETPSMAGITLVQILAYVVEAFMGIGLIRLSLQ  86

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +   +V +  ++    + +  F L  ++   ++ GG L+ I+P     + F+   +V+A
Sbjct  87   IVDGKNVSISEALPNSWKTILWFFLAGLMYGCMLLGGFLVFILPAFYIALRFWMFPFVIA  146

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
            D N G ++ALE       G            L+  L LS +T+    V         ++ 
Sbjct  147  DRNEGPIRALEIIAETTKG-----------NLINLLLLSLVTSFFNVVAILTLGIGWIIT  195

Query  301  TPFSFLYYYLIYSDLKANY  319
             P + L + + Y  + +  
Sbjct  196  IPATHLSFAIAYRAMSSEN  214


>PIN91066.1 hypothetical protein COU57_02180 [Candidatus Pacearchaeota archaeon 
CG10_big_fil_rev_8_21_14_0_10_32_14]
Length=335

 Score = 58.6 bits (138),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 36/217 (17%), Positives = 73/217 (34%), Gaps = 48/217 (22%)

Query  151  QNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL  210
                    + +   + I + L      +        +    +   G ++         ++
Sbjct  119  FTFMLAILLFVLIFSIIQILLYGAIIYVVNNNKNGKMTFGEAFNGGKKYFWKLAGGFCVI  178

Query  211  -----------ILVVGGG--------------SLLLIIPGLLFCVWFFFCQYVLADDNIG  245
                       I+++  G               L +I   + F + F F  Y +A DN  
Sbjct  179  LAILVGVLIAGIIMIVIGALLGSIGIILIIFSVLAMIGFFIYFTILFMFFPYSIAIDNAK  238

Query  246  GLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP-------------------  286
             + + ++SR +V G WW +FG F LL++I + +  + A I                    
Sbjct  239  VIDSFKRSRGVVKGRWWKVFGYFCLLMLIMIGVYLVYAIISNIVGVFMGIFILINLILGL  298

Query  287  ----YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
                ++    N  ++L++ PF  L+    Y +LK   
Sbjct  299  IVAGFLIAVLNSLWNLVIMPFMNLFLNNFYIELKKRK  335


>HEG99323.1 hypothetical protein [Thermoleophilum album]
Length=220

 Score = 57.1 bits (134),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 25/124 (20%), Positives = 44/124 (35%), Gaps = 0/124 (0%)

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
              L  +     +     +   GL  ++           L+L L  + +  G LL ++PGL
Sbjct  77   GPLVTAACIRVVVREPARERPGLAAALAFAFELFPRAVLVLTLATVGIFCGLLLFVVPGL  136

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
               V +     VLA +++  L AL +S  LV G +W          + +   +F      
Sbjct  137  YLAVRWALVLPVLALEDVNPLAALRRSGQLVRGAFWRCAFVVAAGSLFASVAAFAIGSPF  196

Query  287  YVGE  290
                
Sbjct  197  AAVA  200


>WP_174134421.1 hypothetical protein [Sulfitobacter sp. 1151]NSX53297.1 hypothetical 
protein [Sulfitobacter sp. 1151]
Length=257

 Score = 57.9 bits (136),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 42/240 (18%), Positives = 77/240 (32%), Gaps = 7/240 (3%)

Query  97   GSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQ  156
                     L +    +       L       I+     + ++    P+T          
Sbjct  19   FFSNMHWLFLFSFVPAIVIILAPILTRFITFHILTLDYFLSTSGTANPSTISWITGVFTF  78

Query  157  WAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGG  216
              + +  V    +    +   +   +    V          R +     +  +  LV   
Sbjct  79   IRLFVINVLASSIAAGCIACLVRGILKSEPVRPLCYFAKTFRLIVPLFFVCFITNLVTII  138

Query  217  GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
            G LL+IIPGL+F VWFF     L  ++     +  +S+ L +G+ W I    VL  V ++
Sbjct  139  G-LLVIIPGLIFAVWFFVLIPTLMIEDNRYF-SPSRSKALAAGYGWPILCLVVLKGVFAV  196

Query  277  TLSFLTAR-----IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
              +FL        I ++G   +   S+    F  +   + Y  L     G     I   +
Sbjct  197  ICTFLPTFFFPLDITWIGWIMSAVISMTSMVFFSIASAITYFRLIEIKEGYSTQNIAEIF  256


>MBA2655141.1 hypothetical protein [Gammaproteobacteria bacterium]
Length=247

 Score = 57.5 bits (135),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 39/229 (17%), Positives = 79/229 (34%), Gaps = 5/229 (2%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYIL--  168
            W L+      +L    +  ++   P     +       +   +   WA+LL  V   +  
Sbjct  19   WSLYTTSLKYILVWSFIVSIVHIIPTLFGFVGFFYQDFSGHLEFSWWALLLFIVLLTVEA  78

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
              ++ +  +++    +  V    S+   L  +    + ++L  + V  G  L I+P +  
Sbjct  79   FFVAILFYNIYTIATEQKVNYKLSISTALTCLIPLYVAMVLYFVFVNVGMFLFILPAVFI  138

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY-  287
             +        +  D +   +A E S  LV G+WW  F   V+   IS  L  L    P+ 
Sbjct  139  SISLVMFLPFIVIDRLNVYKAFEASARLVWGNWWQTFIVLVIPYAISYFLRSLFKITPWG  198

Query  288  --VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPL  334
                         +  P+ +    + Y++LK     P+      +    
Sbjct  199  GEWLLFIEAIILTISMPYFYSALLIQYNNLKIIKSLPEPMAQPPRTQGP  247


>MSY37893.1 hypothetical protein [Actinobacteria bacterium]
Length=223

 Score = 57.1 bits (134),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 24/137 (18%), Positives = 45/137 (33%), Gaps = 2/137 (1%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
            A     LL  + +  +               +     H+  F L   L  ++   G LL 
Sbjct  77   AFSLIALLASAGVMRAALGSTRGQAPSFADML--NGTHLWKFLLFSFLYTVLQNVGLLLC  134

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
              P ++  ++F    + + D  IG L A + S  ++ GH+        L + + +  SF 
Sbjct  135  FFPIVIVTLFFQLGPFYILDRGIGPLAAFKASARMMRGHFQIGLWLAALTVAVLILGSFT  194

Query  282  TARIPYVGEAANLAFSL  298
                  V        + 
Sbjct  195  LGLSTLVTLPMLSLVTA  211


>MAS95237.1 hypothetical protein [Verrucomicrobiales bacterium]
Length=373

 Score = 58.6 bits (138),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 40/321 (12%), Positives = 83/321 (26%), Gaps = 27/321 (8%)

Query  17   SSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTV  76
             + + A+      P      + DP++                 L         E + + V
Sbjct  39   ETLVWAEGMPDWKPFSEAQEMGDPSDPMVKCAYSGEVRPESQMLPFGDGWVAPEFKDQFV  98

Query  77   NCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPI  136
                                           +            L+ + + G +      
Sbjct  99   QQLAEGGVSVEDGVVGEAYVADFTLGTMFSQSWKIWTDQLFQILLITLIIWGPISIAMEF  158

Query  137  FSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW-MTGSMFIYICKTDVGLFRSMKL  195
                +           Q        A     ++     M  ++  +     V L  +   
Sbjct  159  LVYEVFTEDPESLQAVQRSFQLERAAEFWIGVIATGGTMMIAILRWNGGGKVDLPGAFSE  218

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC------VWFFFCQYVLA---------  240
            G R+ G       LL L+V GG+++L +P ++        +W FF    +          
Sbjct  219  GFRNYGRLLGTRFLLNLLVMGGAIVLCVPIIVMADVVSEGLWLFFIPLGILAIWLIVRLG  278

Query  241  -------DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA--  291
                       GG+ A++ S  +  G  W I    +    + L + F+T  +  + +   
Sbjct  279  CADGAALIREEGGMPAIKYSWKMTKGRVWKICLFRIAAYSLPLVVVFITGMVLAIPQLDN  338

Query  292  --ANLAFSLLLTPFSFLYYYL  310
               +   S + T         
Sbjct  339  FWMSGIVSAIGTAVLSFTIVF  359


>CAD7185640.1 unnamed protein product [Sepia pharaonis]
Length=976

 Score = 59.4 bits (140),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 17/327 (5%), Positives = 55/327 (17%), Gaps = 11/327 (3%)

Query  27   ARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFC  86
              C                                   P     +             + 
Sbjct  144  IYCAIWPFIYWAMCPFIYWAIWPLIHWAMWPFIYWAMCPFIYWAMFPFIYWAMCPFIYWA  203

Query  87   LQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPAT  146
            + P   +        ++   +  +   F            +   + +A            
Sbjct  204  IWPFIYWTMCPFIHSAMCHFIYWAMCPFIHWAMCPFIYCAIWPFIYWAMCPFIYWAMCPF  263

Query  147  WLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLL  206
                      WA+       +   + W        +    +       +           
Sbjct  264  IHWAMCTFIYWAMCHFIYWAMCPFIYWTMRLFIYCVLWPFIYWAMCSFIHRAIWPFIYWA  323

Query  207  -----LILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHW  261
                    +   +       I   +   +++  C ++          A+          +
Sbjct  324  IWPFIYWAIWPFIHWAMWPSIYCAIRPFIYWTMCHFIHCAMCPFIYMAMCPFIHWAMCTF  383

Query  262  WAIFGRFVLLLVISLTLSFLT------ARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
                    +   +   + +        A  P++          ++ PF +          
Sbjct  384  IHWAMCPFIYWAMCPFIHWAMCTFIYWAMYPFIYWTMWPFIHCVIWPFIYWAMCSFIHWA  443

Query  316  KANYRGPQHPPIKRQWLPLTAAIFGWM  342
               +      P     +        W 
Sbjct  444  IWPFNYCAIWPFIYWAMCPFIYWAIWP  470


>PKQ21473.1 hypothetical protein CVT65_18350, partial [Actinobacteria bacterium 
HGW-Actinobacteria-5]
Length=147

 Score = 55.5 bits (130),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 16/91 (18%), Positives = 33/91 (36%), Gaps = 0/91 (0%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
              G+L+ +   +           V++ + +G L+A+ +S  L  G +W + G + L  VI
Sbjct  13   VLGTLVAVPLAIWLGTRLILAPAVISVERLGPLRAIRRSWFLTHGQFWRMLGIYGLSSVI  72

Query  275  SLTLSFLTARIPYVGEAANLAFSLLLTPFSF  305
                +   + +     A        L     
Sbjct  73   ISLAAGTVSSVFSFAGALLGIADANLALIGM  103


>NNJ70809.1 DUF975 family protein [Kiritimatiellales bacterium]
Length=240

 Score = 57.1 bits (134),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 31/206 (15%), Positives = 68/206 (33%), Gaps = 11/206 (5%)

Query  110  SWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
                        + +    + +  +  F  ++   A        N   ++       +  
Sbjct  43   WGMSILGYVLYTVLVMSFSLFVFSSVFFVGVVSGVAGGDMTAATNAMQSVSQIVELLVSG  102

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
              +    + F+ I +        + +G R       +     L V   +LLLIIPG++  
Sbjct  103  AFTVGFMAFFLGIAQEGEARLELLFVGFRRFWKSFCVYFFYSLFVLLWTLLLIIPGIIAT  162

Query  230  VWFFFCQYVL-ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
              +    +++  D++ G L+A+ +S+ +++G+ W  F         +L   F        
Sbjct  163  FRYAMAFFIIADDEDCGALEAIRRSKEMMAGNKWKFFCLHWRFFGWALLAVF--------  214

Query  289  GEAANLAFSLLLTPFSFLYYYLIYSD  314
                     L L P+    +   Y D
Sbjct  215  --FTFGIGFLWLVPYMQTAFAKFYED  238


>KTR86842.1 hypothetical protein NS354_02490 [Leucobacter chromiiresistens]
Length=399

 Score = 58.6 bits (138),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 25/154 (16%), Positives = 48/154 (31%), Gaps = 2/154 (1%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                 + + +    G+    +    V LF    +      S +     +   +     L+
Sbjct  161  WGGLILWVLILAGAGAAVAIVAVVLVFLFLLATVVFSGDASSSSAPSAVWPGLVILLALI  220

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
             IP +   +       V+  D +    A+ +S  L  G  W +FG F  L    + +S L
Sbjct  221  CIPVVWLSIRVALTPAVIVLDRMRLGAAIRESWRLTRGRGWRVFGVFFALGSAHMIVSLL  280

Query  282  TAR--IPYVGEAANLAFSLLLTPFSFLYYYLIYS  313
                 I          +  +  PF      L++ 
Sbjct  281  ALIVCIFAAAAVMGAVYFEMFGPFEHWTLVLMFP  314


>NWG92281.1 zinc-ribbon domain-containing protein [Parvularculaceae bacterium]
Length=61

 Score = 52.9 bits (123),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 9/49 (18%), Positives = 15/49 (31%), Gaps = 1/49 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTT  49
           M  + CP C    +    +      S RC EC ++      +       
Sbjct  1   MI-ITCPSCATRYDVDDDRFSPDGRSVRCAECNESWFVPAPQPIENLMP  48


>NNF63771.1 hypothetical protein [Acidimicrobiia bacterium]
Length=277

 Score = 57.5 bits (135),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 31/139 (22%), Positives = 65/139 (47%), Gaps = 0/139 (0%)

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
            + +   +          L    ++ +  +GS  LL+++L ++VG G +L I+PG+   V 
Sbjct  75   AVLFNMIVGGERTGTADLGEGFRVTMGRIGSLVLLVLMLGVLVGVGLVLFILPGVFAIVV  134

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
             F    VL  +  G + ++++S  LV   +  +FG  ++L+ + +T  F    + +VG  
Sbjct  135  LFPAVAVLYLEGKGAVASMKRSYQLVIKRFLEVFGLLLILVALGITAGFAMGALGFVGSV  194

Query  292  ANLAFSLLLTPFSFLYYYL  310
               A   L+   +  + Y+
Sbjct  195  VASALGSLVGQAATYHAYM  213


>NIP30995.1 hypothetical protein [Candidatus Dadabacteria bacterium]
Length=179

 Score = 55.9 bits (131),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 31/148 (21%), Positives = 60/148 (41%), Gaps = 0/148 (0%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
                    +++             +  +      + +        + +  C  + G    
Sbjct  1    MLIYIPTNIIELIKEDTAVALVLFFYGIYFIFLILNVINEIGLIKISLKFCDNEKGKLSD  60

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
            + L       F L +IL  L++ GG++LLIIPG+++ + F F  Y + D  +G ++AL+K
Sbjct  61   LFLHYPLFFKFILGVILYGLIILGGAILLIIPGIIWGIKFQFFSYFIIDKGLGPIEALKK  120

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSF  280
            S L+  G    +F   +LL       +F
Sbjct  121  SALITKGVKLDLFIFAILLXKYYWNFTF  148


>WP_004099200.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Acetonema longum]EGO62212.1 hypothetical 
protein ALO_19662 [Acetonema longum DSM 6540]
Length=266

 Score = 57.5 bits (135),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 44/250 (18%), Positives = 90/250 (36%), Gaps = 13/250 (5%)

Query  91   REFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNP  150
                         +++   SW +F  +   +L I +L  +     +     L+       
Sbjct  1    MNQYQETIREYDRNEIWQISWSVFTSQFKSILIIMMLFYIPMNGLLMI---LESFLHCFT  57

Query  151  QNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL  210
              + +  A ++    +  L    +   +   +   DV    +M     H        +L 
Sbjct  58   DLEIYYRASVIVEFLFWALASMGIAVVVEDTLLGNDVSWLNAMGRAFSHWKICLQTNVLQ  117

Query  211  ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI----FG  266
             L++ G  LL + PG+++ V + F   V+    + G  ALE S  LV G W  I      
Sbjct  118  GLIILGFLLLFVFPGVIWAVNYAFAIQVVVLREMSGKSALEYSASLVRGRWEKIFGFLLL  177

Query  267  RFVLLLVISLTLSFLTARIPYV------GEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
                +++I+  +S++ +RIP V          +   + + T F  +   +++  L+  Y 
Sbjct  178  FGGGIVIITQGISYVLSRIPMVFARVPDLAMFSGIVTDVATTFFSVVCTVLFLHLEYLYD  237

Query  321  GPQHPPIKRQ  330
              Q    +  
Sbjct  238  KRQEEKHQEN  247


>WP_138191041.1 glycerophosphodiester phosphodiesterase [Culicoidibacter larvae]TLG73902.1 
hypothetical protein FEZ08_07155 [Culicoidibacter 
larvae]
Length=584

 Score = 59.0 bits (139),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 26/268 (10%), Positives = 68/268 (25%), Gaps = 31/268 (12%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L+    L        +                      I        +  L ++   +  
Sbjct  81   LVSFVTLAEFGGLLVLSRNSYWYLPLMGTGAFLESLQHIKKIFRPSGIKALLYILVILPF  140

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                    L  ++     ++  + L  IL    +    +++    +     +FF  + + 
Sbjct  141  LQIGFSSALIPAI-----NIPEYILSFILASPWLTVLLVIVFALCVYLTTRWFFAIHFVV  195

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS---------------------  279
             +N      L+ S  L  GH   +     ++ +++                         
Sbjct  196  LENEDLATGLKHSNELTKGHKIRLLFAVFVIQIVAAIAGWLLLLIPDLLYGVIVAIENSS  255

Query  280  -----FLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPL  334
                 F    +  +G    +   +L  P +  Y  +++   +  Y        K      
Sbjct  256  MLWDIFFMLFLTILGIVVTIVPLILGLPLAISYSTVLFYRWRNRYERFVPIEEKVLPEVR  315

Query  335  TAAIFGWMLIPGLLLVSLSRQNLSAEQL  362
               I    +   +L+V+      +  + 
Sbjct  316  VPLIQRKFIRIIILIVAAGAILWTGFRN  343


>MBI1946177.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Deltaproteobacteria bacterium]
Length=313

 Score = 57.9 bits (136),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 36/312 (12%), Positives = 80/312 (26%), Gaps = 13/312 (4%)

Query  5    RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRI  64
            +C +CG+               A C  C   +    +     +        P  G     
Sbjct  2    QCSNCGSAL---------APGQATCSACGLVVGGGASVGAFGERPVEARPMPPPGPYNIR  52

Query  65   PSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGI  124
                  +               L     +              A S           L I
Sbjct  53   FGVSEPLTETFRLWGNDLGRLVLMTLIPYGFLLPFGIGFGVWAAISMSSGGEPSTTQLVI  112

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
                           +L   A  +   +   +       V    LG    +G + +    
Sbjct  113  LGSVGFALAVVAGVLMLASSAGAMLLADDKERTGGTGLGVWQAFLGGLARSGWLILANLL  172

Query  185  TDVGLFRSMKLGLRHVGSFTL-LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
                +     +      +  +     L  +    +L   + G+           ++  + 
Sbjct  173  YVAIIVVLWGVPFAVPVALLVETEQPLWALGFVPALGTFVAGVWLLTRLMPMLPLIVVEE  232

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF---LTARIPYVGEAANLAFSLLL  300
            +    AL ++  L +G    +F   ++  V  + +     + + IP +G    LA +++L
Sbjct  233  LSVGVALSRAWQLTAGRALDVFLANLVFGVALMGIYMAVGIISIIPLLGLLIQLAANIIL  292

Query  301  TPFSFLYYYLIY  312
                 +Y + +Y
Sbjct  293  GSLQSVYAFTLY  304


>RHA44277.1 hypothetical protein D1825_02125 [Cellulomonas rhizosphaerae]
Length=170

 Score = 55.9 bits (131),  Expect = 3e-06, Method: Composition-based stats.
 Identities = 23/171 (13%), Positives = 50/171 (29%), Gaps = 15/171 (9%)

Query  148  LNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL----GLRHVGSF  203
            ++         I+        L  +    +         +      K      +      
Sbjct  1    MSFGLSFTFILIIGIFSLVFYLVEAAFVRAALKVTYGQRLEFADFFKFENATNVLITALL  60

Query  204  TLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA  263
               + L++ +V    L+  +  L   +   F  + + D N+  + A++ S  LV  +   
Sbjct  61   IAGINLVLGLVTWIPLIGQVVSLAANLALLFTLWFVVDKNLSPIDAIKASFELVRANLST  120

Query  264  IFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
                        +    L A I +VG       SL+  P   +    +Y +
Sbjct  121  T-----------ILFYLLAAVILFVGFLLCGLGSLVAIPVVLVATSYLYRN  160


>MBI5634317.1 hypothetical protein [Nitrospirae bacterium]
Length=760

 Score = 59.0 bits (139),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 51/515 (10%), Positives = 140/515 (27%), Gaps = 40/515 (8%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            W L       + G  +L  ++      +  ++        +N    + +L    A     
Sbjct  128  WGLIAGVKAEICGAAILTGLIYGIFFMTGYVIPLFMGEAGRNLYVAYGLLYGFGAISYPL  187

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGG-------SLLLII  223
             + +         K +V +    +   + +   +L L ++   +            L   
Sbjct  188  TAGLIMMGVKRAAKDNVSVSMIFRSINKGILLLSLFLTVVGYALFALFYSLGADHFLAGS  247

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
              +LF   F  C  ++ +  +G   AL     ++  ++  I   ++ L+ I++  SFL  
Sbjct  248  GEILFFPVFALCVPLIEEKGMGPFTALRTFFRMLFRYFPTIAALYLSLICINIFGSFLL-  306

Query  284  RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWML  343
                          +   P SF+   +++ ++ A     +                    
Sbjct  307  -----------IGFIWTIPLSFVASGVLFRNIMAAPALVEV----EGKSMTGDRQRPVPT  351

Query  344  IPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLL  403
             P       +      +  ++    +    G   +            P  +++    +L+
Sbjct  352  APARQTAGPTLTGNEWQNAVAVVLLVMIIAGVSTRLWALKESRNIHPPVNVAANSRNVLV  411

Query  404  SKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKV  463
               +              +    F    +   L L  + +             ++ +   
Sbjct  412  HSDKALFMLSPDGRTERRVELSTFGLRREPADLELLEDGTIL-----IGDMDKKVILRCS  466

Query  464  LDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLEL  523
             ++ +       +++          ++       +   +  L      + +     +L L
Sbjct  467  PENGSCQTIGPPNNYRIEENFKFVADEKRNLLFVADTNNHRLI----VQDLAGTAYRL-L  521

Query  524  TLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDR-TDLLNVHASNSHAE  582
              P  I        +       GG  +    L    ++ +  GD   D   + + +    
Sbjct  522  ESPSNI-----AYPNDMGLDDSGGLWIS-NTLHERIMSFQVEGDAVKDSSRIISLDPFGA  575

Query  583  PLREIGFTWQKSGDAFSLRQMFDGNIESITVLVAG  617
             +  +G   Q S D   +        + +  +   
Sbjct  576  AVTALGQALQGSSDRQKVMADLQAAKKDMEAIKKD  610


>MBI4705907.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=242

 Score = 57.1 bits (134),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 29/199 (15%), Positives = 62/199 (31%), Gaps = 11/199 (6%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                 ++  +        + + +        +  G  +     +  IL  L V  G LLL
Sbjct  55   LVNFLVMTFMEGGMTLFALKVARGQPYELGDIFKGGPYFAPLLVANILTGLAVAFGLLLL  114

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            I+PG++  +       ++ D N+  + A+++S  L +GH   IF               L
Sbjct  115  IVPGIILALGLSLTVPIVVDRNLHAVDAMKESWRLTTGHKVGIF-----------VFGLL  163

Query  282  TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGW  341
             A +  +G  A     L++ P   + +  + +  +               +P + +    
Sbjct  164  MAALMLLGLLACCVGVLVVAPLGQIAWVFLPATERPADGARGIGSGPGPQIPQSTSSSNS  223

Query  342  MLIPGLLLVSLSRQNLSAE  360
               P               
Sbjct  224  SPRPYCSSGGAPATPSRMS  242


>SES89824.1 hypothetical protein SAMN05443572_101512 [Myxococcus fulvus]
Length=289

 Score = 57.5 bits (135),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 42/212 (20%), Positives = 71/212 (33%), Gaps = 0/212 (0%)

Query  75   TVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFA  134
               C     + C       R +    R      A +     R    LL  + + +   F 
Sbjct  24   CDRCGSFGCTGCRGHPEATRCAPCVRRIREDPTAMAPTRLLRDACSLLLQHRVLLAGLFI  83

Query  135  PIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMK  194
              +  L+   A     Q+       L+A    + + L                    S  
Sbjct  84   LQWLCLMGSGALLHLTQSTLPGITSLVAAELGLSILLMRAVARRLREQAGLPALDVSSPW  143

Query  195  LGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSR  254
            + L  +    L+ +L  LV+ GGS LL++PG+   +        +A D  G L AL  S 
Sbjct  144  VVLARIPLLFLMYVLTFLVILGGSFLLVLPGVYMALCTSLAPAAVAVDGKGPLAALRLSY  203

Query  255  LLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
             LV GH   + G  + +LV+      + + + 
Sbjct  204  QLVRGHRRRLLGALLPVLVLMPVTDTVGSLLT  235


>WP_162601381.1 hypothetical protein [Occallatibacter savannae]
Length=327

 Score = 57.9 bits (136),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 22/244 (9%), Positives = 58/244 (24%), Gaps = 7/244 (3%)

Query  69   LEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLG  128
                                  R  +   +        +                   L 
Sbjct  39   PPATLFATLAAAMGSFMIPFIARMPKNMSAEQTPRMGFVFFFCMSVVMTLCSAAFAPSLA  98

Query  129  IVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVG  188
                 +              +      ++   LA     +L  +     +  +   + + 
Sbjct  99   ACCYASVHADLGSSHVTFRESYAFACKRYWSYLALFFLTILIATSPVLVLEAFTAVSALS  158

Query  189  LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQ  248
            L    +     + +   + +++ + V           ++  +           +++G   
Sbjct  159  LKYHPREIGPGLVALIPVGMIVFIGVYV-------LSIVLTLRISLAFPACVTESLGASA  211

Query  249  ALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYY  308
            AL +S LL  G    IF   +++         +   +  +  A   A SL +     +  
Sbjct  212  ALRRSDLLTRGAKGRIFLVLLVVYAACYAAYLVGFFVVAIWFAFLSAASLTMGGHVPIIL  271

Query  309  YLIY  312
             LI 
Sbjct  272  TLIM  275


>TXH43255.1 hypothetical protein E6Q90_07385 [Actinobacteria bacterium]
Length=382

 Score = 58.2 bits (137),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 27/125 (22%), Positives = 47/125 (38%), Gaps = 15/125 (12%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            L +  G L+ ++  +   +        LA ++ G L AL++S  LV G +W   G  +L 
Sbjct  241  LALFAGGLVALVLSVWASIGLVLTTPALALEDAGALHALKRSWHLVKGAFWRTLGIVLLG  300

Query  272  LVISLTLSFL----TARIPYVG-----------EAANLAFSLLLTPFSFLYYYLIYSDLK  316
             ++   +  +     + I  VG             A L   L+  PF      L+Y D +
Sbjct  301  TIVGQAIGSVAAAPFSLIGGVGAELTTVSVFALAMAGLVSVLVALPFVAGVITLVYIDRR  360

Query  317  ANYRG  321
                 
Sbjct  361  IRTEN  365


>MAG18385.1 hypothetical protein [Candidatus Diapherotrites archaeon]
Length=389

 Score = 58.2 bits (137),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 31/195 (16%), Positives = 60/195 (31%), Gaps = 8/195 (4%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            + +    ++ AF  ++  LL +         Q   + I        L   S +      Y
Sbjct  66   VILIPGVLIYAFLYVYLTLLAQVRALQIVGFQTAPFTIGKLIKLIFLEIFSAIIALTSWY  125

Query  182  ICKTDVGLFRSMKLGLRHVGSFTL--LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
              K          L L  V    +  + + L L+     L  II      +       + 
Sbjct  126  HKKWKFYFLGLFFLMLIGVVGAFISPVFLTLNLIALLFGLPYIILIYYNALRLSLSASIF  185

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL-----  294
               + G    L ++  L  G+  +IF   +L+ ++ + +S     I  V +         
Sbjct  186  LHKDQGIFDTLHEAWDLGKGNVPSIFVAALLVGILLIVVSIPIMIIALVLQFGAGGLLSP  245

Query  295  -AFSLLLTPFSFLYY  308
                ++ T FS    
Sbjct  246  FIGGIISTLFSNAIM  260


>WP_167149738.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Lysinibacter cavernae]NIH53814.1 hypothetical 
protein [Lysinibacter cavernae]
Length=457

 Score = 58.6 bits (138),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 33/188 (18%), Positives = 56/188 (30%), Gaps = 28/188 (15%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            L      L   I  +   V        +  +++G   A+ +S  L SG++W  FG  +LL
Sbjct  270  LGGIALVLAFAIVAIWIGVKLSLVSVCIVFEDLGVRAAIARSWRLTSGYFWRTFGTVLLL  329

Query  272  LVISLT--------LSFLTARIP--------------------YVGEAANLAFSLLLTPF  303
             VI           +SF+                          +  A +L  S +    
Sbjct  330  NVIISVASNIILTPISFIFGIFMGMLSPNGDSGQFIGAMIAFYVITIALSLVISAITLVA  389

Query  304  SFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLL  363
                Y LIY DL+    G     ++        A    +  P L   + +        + 
Sbjct  390  QTACYALIYIDLRMRKEGLDVELMRFTETYPNPAPTDDLPNPYLPRFAPAPSAGGGYVMP  449

Query  364  SAGKDIQQ  371
             +     Q
Sbjct  450  PSPSPWAQ  457


>WP_155913061.1 hypothetical protein [Mycolicibacterium sp. CBMA 361]MUM33813.1 
hypothetical protein [Mycolicibacterium sp. CBMA 361]
Length=170

 Score = 55.5 bits (130),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 22/147 (15%), Positives = 52/147 (35%), Gaps = 0/147 (0%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
             V + +  +                 +        R +G+F  + +L+ L+   G+L  I
Sbjct  6    VVIFAIAFVMSNCLMATQLDVGDAKPVTLGTFFKPRRLGAFLGVSLLIFLMTAAGTLACI  65

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            +PG++      +  Y + D  +  + A++ S  LV  +       +++ +       F T
Sbjct  66   VPGIILGFLAQYAPYFVVDRQMDPVAAIKASFTLVRDNVGTTILVYLIGMAAVFVAEFGT  125

Query  283  ARIPYVGEAANLAFSLLLTPFSFLYYY  309
                 +G  A +   + +     +  Y
Sbjct  126  VLTCGLGGLALIPAMVSIMGLIHVVTY  152


>MBE6875750.1 DUF975 family protein [Ruminococcus sp.]
Length=374

 Score = 58.2 bits (137),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 35/297 (12%), Positives = 72/297 (24%), Gaps = 20/297 (7%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
             T+    +         +                          + +   L +   SLLL
Sbjct  97   ITIFIASILEIGKCRYFYHARQGDSDFGNLFWAFQGGRYMPCVKVNLQRYLEIFLYSLLL  156

Query  222  IIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
            IIPG +  + +    Y+LA++ N+   +AL  S+  + G     F      +        
Sbjct  157  IIPGYIKLLEYALVPYLLAENPNLSRERALSISKQTMYGEKMKYFLLIFSFIGWW-----  211

Query  281  LTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFG  340
                  ++G        L + P+        Y+ ++A          +            
Sbjct  212  ------FLGVITCYLGFLYVAPYMQATDTEFYACMRAKMLATGITTEEELTGYNN-----  260

Query  341  WMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYK  400
                 G           +A+         Q +   Q     +              + Y+
Sbjct  261  ---FGGFNGGYPETNPYNAQNPYQNPNPYQSQNPYQNPNPYESQSPYQNPNPYEPQSPYQ  317

Query  401  LLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSAR  457
                 Q  +     L   P       F   D +     +  ++   N       +  
Sbjct  318  NPNPYQPDSMPGINLDKTPYDTQGSDFNQPDNSSAPDSRPRVNLDKNPYDDNNPNHF  374


>MBD3176542.1 hypothetical protein [Armatimonadia bacterium]
Length=233

 Score = 56.7 bits (133),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 29/172 (17%), Positives = 59/172 (34%), Gaps = 0/172 (0%)

Query  128  GIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDV  187
                A+  I   ++      L           +     ++   ++               
Sbjct  7    WFEQAWKIISEDVVQWFLIGLVATLIMGAAYCIPIAGLWLAGPINVGVILAIRARWNGQR  66

Query  188  GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGL  247
                 +  G +  G   L L+L  ++V  G LL IIPG +   W+     V+ ++ +G  
Sbjct  67   PDINHLWEGFQFTGVAGLYLLLYWVLVSVGLLLCIIPGFILAAWWCLALIVIHEEGLGAW  126

Query  248  QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLL  299
            +A+++S+ LVS   W      V++ +    +S +      VG         +
Sbjct  127  EAMQRSKELVSKDLWNWLLLIVVMGLGYSVVSMVPVIGGAVGFVFIQIVLFI  178


>PIT97047.1 hypothetical protein COT77_03640 [Candidatus Berkelbacteria bacterium 
CG10_big_fil_rev_8_21_14_0_10_41_12]
Length=220

 Score = 56.7 bits (133),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 40/197 (20%), Positives = 76/197 (39%), Gaps = 8/197 (4%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
             W  +   +L + LA   I   LLL    +       +    LL  V  ++  +  +   
Sbjct  26   YWIFVSTLVLTLTLALYFIRYFLLLALNLFHGEWLIFFSIVFLLIEVVLVIFAIECIILC  85

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
              +   K  + +   ++        F   ++L I+V     +L +IP       +     
Sbjct  86   SILLHEKKRISILDVIQDSYEKYYRFLPTILLFIVVSAVSLILFVIPAFFVVPKYSLSLV  145

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
            +   ++   ++AL++S  L  G++ A F       VI+  + FL  RIP VG    + F+
Sbjct  146  ICLRESNDPVRALKRSAKLTRGNFPASFWV-----VITSAILFLVIRIPIVGWLFGVFFA  200

Query  298  LLLTPFSFLYYYLIYSD  314
            +   PF      L+Y +
Sbjct  201  V---PFIANTLVLLYYE  214


>WP_066055597.1 hypothetical protein [Bacillus korlensis]
Length=247

 Score = 57.1 bits (134),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 39/245 (16%), Positives = 82/245 (33%), Gaps = 8/245 (3%)

Query  93   FRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQN  152
             +                   F R     L +    ++      F   ++  +       
Sbjct  1    MQHITFSDIEHKSTFQIFKLSFQRGKSAYLSLVFYLVLFLPLLFFLENIIVSSLNTFRGE  60

Query  153  QNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLIL  212
            Q + +      ++        +         K  +    S  + L+ + +  L  I+  L
Sbjct  61   QAFLFFFDNLFLSLYYAFTVLIFSKNSDSFFKVQIS---SFSVLLKTLPAILLASIVYYL  117

Query  213  VVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
             +  GSLLL+IPG+LF +WF+    V+  +    L +  +SR L+ GH++ +    ++  
Sbjct  118  SILFGSLLLLIPGVLFLIWFYLYPIVITSEKQKSLASFNRSRYLLKGHFFQLLILLLVYT  177

Query  273  VISLTLSFLTARIPYVGEAANLAF-----SLLLTPFSFLYYYLIYSDLKANYRGPQHPPI  327
                 L  +  ++  +   AN         +L  PF     +L Y  ++A          
Sbjct  178  ATRYVLEMIFQKLQLLNGMANDLVAYTLSGVLTLPFEATALFLFYLSVRAEKEAFNFSEF  237

Query  328  KRQWL  332
             + + 
Sbjct  238  NKTFY  242


>WP_039391209.1 hypothetical protein [Novosphingobium sp. MBES04]GAM05590.1 lipoyltransferase 
[Novosphingobium sp. MBES04]
Length=205

 Score = 56.3 bits (132),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 22/140 (16%), Positives = 46/140 (33%), Gaps = 2/140 (1%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
               A  +   +  T      +    + L   +   L  +GS         L      + L
Sbjct  23   VGEAISVALRAVPTLIGATLLYIVGITLVSFVLGLLVALGSLVAGTAGASLGGVVMLIGL  82

Query  222  IIPGLLFCVWFFFCQYVLADDNIG-GLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
                +   V       V+  + +   + A+ +S  L  GH   + G + LL++  L +SF
Sbjct  83   FGVSIWSAVRLSLLVPVIVREGLSNPVAAMRRSWELTGGHTPRLLGFYALLMLGYLAISF  142

Query  281  LTARIPY-VGEAANLAFSLL  299
            + + +          A ++ 
Sbjct  143  MVSLLLITPLSLVFGAGAVT  162


>THB73204.1 hypothetical protein D3926_24290 [Desulfobacteraceae bacterium]
Length=237

 Score = 56.7 bits (133),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 30/208 (14%), Positives = 73/208 (35%), Gaps = 4/208 (2%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
                    I +L +++         L + A   +  N    +  +  +       ++   
Sbjct  19   FFRDYFRPIMMLTLMINVPFSLVIGLERLAKLPDILNSFLLYLSIGYSFLVFPYAMAVHV  78

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
                    + +  +   +     H G F    +  I ++  G +LLI PG++  V     
Sbjct  79   KLYSQIALEGEYDIRECLIYARDHYGPFLFASLCSIAIIFIGFILLIFPGIIVSVLLSLF  138

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP----YVGEA  291
             + +  + +  ++AL+KS  +   H+W IFG    + ++ + ++                
Sbjct  139  PFFMIFEQMEPIEALKKSVSVAKYHFWKIFGPTFAVQLMIMIVTLSGPFASKSRGIHIYL  198

Query  292  ANLAFSLLLTPFSFLYYYLIYSDLKANY  319
            A++   L  T  S+L   +++     + 
Sbjct  199  ASVFVDLATTLLSWLCVIILFRVYYIHR  226


>PZS01445.1 hypothetical protein DLM69_04945 [Chloroflexi bacterium]
Length=354

 Score = 57.9 bits (136),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 27/227 (12%), Positives = 73/227 (32%), Gaps = 22/227 (10%)

Query  128  GIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDV  187
               ++    +       +      +      +L      + +  + +  +         +
Sbjct  126  FQSISPFSSYDYNDTISSLGHFYASLGVYIILLAGINVLMQVATAALIYAANERYHGRTI  185

Query  188  GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV-----WFFFCQYVLADD  242
                + +     + +    + L+ L+     +  +    +  V     +FF     L  +
Sbjct  186  TTGAAYRYVGGRISALLGWVALVYLLFIIAGIGFVFLIGIIAVPIIAPFFFISLPALIVE  245

Query  243  NIGGLQALEKSRLLV-SGHWWAIFGRFVLLLVISL--------TLSFLTARIPYVG----  289
              G  QAL++SR LV  G+W    G ++  +++           L+ L +++P +     
Sbjct  246  RAGPGQALQRSRELVGKGYWGMSMGLWLATVLLIWLLTQGVGTVLTLLLSQLPGLNPTTV  305

Query  290  ----EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWL  332
                 A +     +L P S + + + Y +L+          +    +
Sbjct  306  LALDLAFSGLLDFVLAPISIIAFTIFYLNLRVRIEAYDIEALAAATM  352


>HIH89202.1 hypothetical protein [Candidatus Bathyarchaeota archaeon]
Length=294

 Score = 57.5 bits (135),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 26/158 (16%), Positives = 54/158 (34%), Gaps = 8/158 (5%)

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
             +    G  + Y+  + +       L L       LL    IL+        ++  +   
Sbjct  117  LIRGAYGRYWGYLKTSILIYLGFFALVLIPALIIMLLGTGGILMGTILLAAAVVLIIYLD  176

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP---  286
            +           +N+   ++L +S  L+ G  +      V+L V+SL    +  +I    
Sbjct  177  IRLSLYPQAFYLENVYATESLGRSYELLRGRVFLTLILQVVLWVVSLIPGVVVGQITGYF  236

Query  287  -----YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
                 +VG A +L  + L  P   +   + Y  ++   
Sbjct  237  SGQLWFVGSALSLIGTALAAPIGPIALTVWYYSMRTRE  274


>MBI2922678.1 hypothetical protein [Planctomycetes bacterium]
Length=233

 Score = 56.7 bits (133),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 30/174 (17%), Positives = 56/174 (32%), Gaps = 4/174 (2%)

Query  136  IFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL  195
              +         L      +  A     + +++  LS           + +      +  
Sbjct  15   FAAQWQTWVVMSLVCFVCIFAAAFTCVALPFVVGPLSAGMYVAAFKQLRGERPELADLWA  74

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
            G    G    + IL  L +  G LLLIIPGL+  +W+ F          GG +A+     
Sbjct  75   GFNFFGPTLAITILGGLAIFCGYLLLIIPGLVLAIWWLFSIPAAVATGCGGFEAMSA---  131

Query  256  LVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYY  309
                     F   V  +++ + +  +   IP+ G A  +    L     +   +
Sbjct  132  -SRARVTQGFWSVVGFILVLMVVQMIAGFIPFGGVAIGMPLQTLAIAIVYRNLF  184


>PHQ62737.1 hypothetical protein COC10_09890 [Sphingobium sp.]
Length=219

 Score = 56.3 bits (132),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 24/154 (16%), Positives = 48/154 (31%), Gaps = 0/154 (0%)

Query  145  ATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT  204
                        + +L A   Y ++  + +            V            +GS  
Sbjct  1    MFSAANILIPVFFLLLTALFYYGVMYGTVLGYIRLYIENHGTVDQEVLKSEVRSKLGSLI  60

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
             L  L+ ++V  G +L ++PG+   +       +L  +N      +     L+ G WW  
Sbjct  61   SLTFLIAIIVFFGLMLCVLPGIYLAIPLSLGWAMLVFENKPVGDVISDCFKLIKGEWWMT  120

Query  265  FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
            F    +L +I    + + A    +        S 
Sbjct  121  FATIFVLSLILGVANVVFAMPATIYGIIKGFTSA  154


>WP_142094151.1 hypothetical protein [Propioniferax innocua]TQL58352.1 glycerophosphoryl 
diester phosphodiesterase family protein [Propioniferax 
innocua]
Length=411

 Score = 58.2 bits (137),  Expect = 4e-06, Method: Composition-based stats.
 Identities = 30/265 (11%), Positives = 70/265 (26%), Gaps = 25/265 (9%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
             +L  +L                +  +   + +   +     + + +  +   + + +  
Sbjct  150  VMLPGLLTAFVSPMVSTATHGGAIGLRTGPFAYLRTIIGRGLVGIVVGLLVYVIMVVVLG  209

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
              VG+       +           L + V     +L +   +    W  F  YV+  + +
Sbjct  210  IAVGV-------VVLTHIAGDSTALTVAVGIVMGILAMAAIITVSTWMLFAPYVVYREGV  262

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE--------------  290
            G + A+ +S  LV   +W   G + LL +I   +S + + + ++                
Sbjct  263  GPITAMRRSYRLVRHAFWRTLGIYFLLNIIGNIISSVLSYVIHIPATVVAAVGEMAGLQV  322

Query  291  ----AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPG  346
                        LLTP       ++Y D +    G      ++             L   
Sbjct  323  ASTVLVTGIGYALLTPLLAAGVTMLYLDRRIRTEGFDVELGRQAQQVAADPASEQWLTSP  382

Query  347  LLLVSLSRQNLSAEQLLSAGKDIQQ  371
                                     
Sbjct  383  APSSPAPPNPAPPSPAPPNMGQAPW  407


>WP_114575805.1 hypothetical protein [Saliphagus sp. LR7]
Length=269

 Score = 57.1 bits (134),  Expect = 5e-06, Method: Composition-based stats.
 Identities = 23/117 (20%), Positives = 46/117 (39%), Gaps = 7/117 (6%)

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
            +  ++ +  VG G   L++PGL+    F F    +A D +G ++A+ +S  L  GH   +
Sbjct  132  VASMVGLFAVGIGLAALVVPGLVVATLFAFTHPYIATDRLGVVEAMGRSYELTKGHRIRV  191

Query  265  FGRFVLLLVISL-------TLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
            F    + ++            +     +P   E  N+AF  +    +       +  
Sbjct  192  FAILAVTVLAFYAVTTVGALFAVAVGGLPIAAELVNVAFGAVGWLVALSILAAAFDR  248


>WP_152391887.1 hypothetical protein [Paenibacillus guangzhouensis]
Length=238

 Score = 56.7 bits (133),  Expect = 5e-06, Method: Composition-based stats.
 Identities = 27/163 (17%), Positives = 61/163 (37%), Gaps = 1/163 (1%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
            +    ++  + ++          ++G+  + + G ++     L +I++ ++   G LLLI
Sbjct  68   SAILTIIPAAIISMIFLKIEHGEEIGIGAAFRRGRKYWFKLMLYMIIVRIMTSIGLLLLI  127

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            IPG++      F QYV+  +   G   + +S  +V G     F     + +  +  ++L 
Sbjct  128  IPGIVIGTRMLFVQYVIMLEGTYGNNPIGRSNDMVKGRTGQFFLIAAGIGLAQMGCNYLF  187

Query  283  A-RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
               +       NL   +L   F      L +       +    
Sbjct  188  QTYVHLDSWLLNLVTGMLSELFFEFMTVLFFIAYLYIRKYENP  230


>MBI9051909.1 hypothetical protein [Anaerolineaceae bacterium]
Length=239

 Score = 56.7 bits (133),  Expect = 5e-06, Method: Composition-based stats.
 Identities = 49/239 (21%), Positives = 87/239 (36%), Gaps = 15/239 (6%)

Query  92   EFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQ  151
                  +    I  +L +SW+LF +    +L I L   +     +    +       N  
Sbjct  1    MDHTIKTEKYGIKHILNESWKLFRQNLNDILKIILCIHLPINIILALLQVYISTLKWNEF  60

Query  152  NQNWQWAILLAT-VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL  210
            N      I       +  +    +   +   + + D     +++ G     +      L 
Sbjct  61   NTQIYRLIDSGLDTLFGTIATIAIVLLINRSLEEEDTSWHINLRNGFSKWSAVIATFFLK  120

Query  211  ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
             +++ G  L+L IPG++F +++FF   V+   N+  + ALE SR LV G WW I G   L
Sbjct  121  SILLAGLILMLGIPGIVFAIYYFFVIQVVVLRNLSWVDALEYSRKLVLGQWWHIAGITFL  180

Query  271  LLVISLTLS--------------FLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
            L +I + +               ++      +    N  F +  T F     YL  SD 
Sbjct  181  LGIIYIFIYAAILFPITQISHNPYINILPNTMYSIINGFFIVASTIFFLNTDYLRQSDQ  239


>MBR97464.1 hypothetical protein [Dehalococcoidia bacterium]
Length=340

 Score = 57.9 bits (136),  Expect = 5e-06, Method: Composition-based stats.
 Identities = 43/323 (13%), Positives = 99/323 (31%), Gaps = 16/323 (5%)

Query  6    CPHCGAERNTPSSKLPAKKSS-ARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRI  64
            C +C  E    +S   A  +      + C +L F+     +     + A        + +
Sbjct  6    CKNCDFENLEEASYCGACGNELGNACKNCGSLNFNTNVCTKCGRVISSALKEEEFEWQAV  65

Query  65   PSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGI  124
              +       T        S     E     S S     S   +             + +
Sbjct  66   IPENNPTMEHTKVVTEPRYSLPRPGEISNLISTSFNAYKSNASSFLTLALISSIPATIFL  125

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
             +   +  +         + +  ++  N    + +L   +   ++  + +      ++ K
Sbjct  126  AVGDDLFNYFSGLMEDTTEKSPTIDRPNWALIFPLLTLAILGEIVSTASIIFGSAQHMNK  185

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG-----LLFCVWFFFCQYVL  239
              VG+ + +      +     + I+  LV+     L I        +   V F+F    +
Sbjct  186  EKVGVSKCVSYSFSSIFRLISVXIIFALVLIIPGXLSIFFIGIPLLIFLIVKFWFVSCFV  245

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP----------YVG  289
              +N G  +A   S  LV   WW  FG  +++++I+   S L + +            + 
Sbjct  246  IIENSGITEAFRGSWNLVKSKWWXTFGTGLIIIIITALASALMSFVTGQAGSLLDNEIIT  305

Query  290  EAANLAFSLLLTPFSFLYYYLIY  312
                   +  + PF  +   + +
Sbjct  306  HVLRGIATTXIAPFQAISTGIYF  328


>HAJ90839.1 hypothetical protein [Rhodospirillaceae bacterium]
Length=76

 Score = 52.9 bits (123),  Expect = 5e-06, Method: Composition-based stats.
 Identities = 10/39 (26%), Positives = 14/39 (36%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M    CP C A  N P+  +       RC +C      +
Sbjct  1   MIL-TCPSCSASYNVPNEAIGPDGRQVRCKKCKHEWFQE  38


>SMF15505.1 hypothetical protein SAMN02745866_01010 [Alteromonadaceae bacterium 
Bs31]
Length=1113

 Score = 58.6 bits (138),  Expect = 5e-06, Method: Composition-based stats.
 Identities = 41/300 (14%), Positives = 86/300 (29%), Gaps = 27/300 (9%)

Query  356   NLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGL  415
              +S   ++                + +    L            K         +S  G 
Sbjct  808   MISNPMVIFPADGFTFLPKEPEAYSTEQWDKLLAFSVDFKPDQEKRSWYGDPVASSRSGP  867

Query  416   SLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQ  475
                 +         D        +L +     L       +    + + DD  R     +
Sbjct  868   VDLKLYAPQVLRNIDGPVVSAKFELNMPYARILDEKSNLVSVRVKEILFDDGNRYTEQSE  927

Query  476   HSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAE-----------QVHSILGKLELT  524
             H        +    +  E       ++IYL      E           ++  I+G+L   
Sbjct  928   HPVAMSFIGFGNKARVAELPEERLRKNIYLSGTADIEIPLEGEEYLDKRILLIVGELVFN  987

Query  525   LPLAIESLQLTRNDIGKTLQIGGK-QLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEP  583
             +P AI+++  +  ++GK  Q+    ++ +  L S  ++L   GD + L  +   N     
Sbjct  988   IPTAIKTMDFSEIELGKNYQLNDVLRIEVIELSSTGISLATHGDPSALAALQVLNEKNRM  1047

Query  584   L---------------REIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFEL  628
             L               +       +    +     ++G   ++ V +A    T  YP+ L
Sbjct  1048  LGSGVNFTKEQKRYGVKPNYAGEAEPPVRYVASYHYNGRAVALRVALAESKETFIYPYGL  1107


>MYE37998.1 hypothetical protein [Candidatus Spechtbacteria bacterium SB0662_bin_43]
Length=419

 Score = 57.9 bits (136),  Expect = 5e-06, Method: Composition-based stats.
 Identities = 30/346 (9%), Positives = 92/346 (27%), Gaps = 15/346 (4%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            I  +  ++     + A+            Q+   A+ +     + +    +  ++   + 
Sbjct  63   ILAVVSIIGNILFYYAIWSSFINITKGVKQSIGDALRVDVARLLKVLFVNILLALIYVVP  122

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
               + L   +   + ++     + +LL                +F + + F  +V+ ++N
Sbjct  123  VGVLALGFLVLPDITYIQVLYGITLLLTFFATA----------IFAIRYSFVPFVIIEEN  172

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
                +A  +S  +       +F  F  ++ I + +  +   +        +  S+   P 
Sbjct  173  CSIFEAFPRSAQITKTSRTGVFILFWFVVPIVILIGLVGFLVGI--FITAIIGSI--IPI  228

Query  304  SFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLL  363
                        + +  G             T  +    +  G++   +         + 
Sbjct  229  VAYRVLRAQELGEQDNTGIVQTTNYESSKKSTRIVSMLCISVGVVFFGVLSLIGFLALIN  288

Query  364  SAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLF  423
            +  +        + +             +   S+          +  S          L 
Sbjct  289  ADNEANDFVKTIEVRSIQTALSLHQALNKEYPSSIELNRCGTIDRLISIIQYDPHDSDLL  348

Query  424  ADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDAR  469
             D F+   +NP    K++             S  +    +     +
Sbjct  349  DDMFYGASRNPETG-KIDTYVVGTKVKEPIFSGTLHSSDIDAVFGQ  393


>PIR67629.1 hypothetical protein COU50_02265 [bacterium CG10_big_fil_rev_8_21_14_0_10_33_18]PIU76388.1 
hypothetical protein COS74_04295 
[bacterium CG06_land_8_20_14_3_00_33_50]PJA71940.1 hypothetical 
protein CO152_04030 [bacterium CG_4_9_14_3_um_filter_33_26]
Length=253

 Score = 56.7 bits (133),  Expect = 5e-06, Method: Composition-based stats.
 Identities = 45/248 (18%), Positives = 98/248 (40%), Gaps = 20/248 (8%)

Query  126  LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKT  185
            +L        I   +       +          ++++ +A +L+        ++  + + 
Sbjct  1    MLFGPYLILFILGLIPTLFLGSMVSDPMPALVLMIISGIASVLISFFIQVALLYAVVDRE  60

Query  186  DVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIG  245
             +      K+ L  + S++ ++  + LV  GGS  LI PG++F +WF F  +V A+++  
Sbjct  61   HISAGEIYKIALARMFSYSWVVFFVGLVTFGGSFFLIAPGVIFSIWFSFSVFVFAEEDKR  120

Query  246  GLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP-------------------  286
            G+ AL KS+    G W +I  R++   ++ +   F+   I                    
Sbjct  121  GMNALLKSKEYTKGLWGSILWRYIAFALVIIGPLFIFFAIFGIIMAVTSAMMGSTGQIAY  180

Query  287  -YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIP  345
             ++     +A  +L+ PF   Y Y+++ ++K           + +  P         +I 
Sbjct  181  QFISSGFQIAIQILMAPFVLTYSYMLFMNVKELKGNQVAEDPQGKKWPYIIIGVVGTIIG  240

Query  346  GLLLVSLS  353
              L+V+L 
Sbjct  241  IALMVALP  248


>MBI3887958.1 hypothetical protein [Candidatus Microgenomates bacterium]
Length=178

 Score = 55.5 bits (130),  Expect = 5e-06, Method: Composition-based stats.
 Identities = 23/139 (17%), Positives = 45/139 (32%), Gaps = 0/139 (0%)

Query  134  APIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSM  193
              IF  +                          I+  + ++       +         ++
Sbjct  30   WLIFFIVAALTVVNYVIVASKSFHLQNFLFSLLIVFLIVFLGYVYIAIVGGAAKNFKEAI  89

Query  194  KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKS  253
            K     +        L + ++ G   L+  P L F +WF F  + +  +N  G +AL  S
Sbjct  90   KFVFNKLPKILWTGFLSMFLMWGAFYLMFFPLLAFSIWFEFASFTVLLENKYGFEALLLS  149

Query  254  RLLVSGHWWAIFGRFVLLL  272
            R    G +  IF R +++ 
Sbjct  150  REYTRGFFGKIFKRRLVIG  168


>HFG18269.1 DUF3426 domain-containing protein [Deltaproteobacteria bacterium]
Length=284

 Score = 57.1 bits (134),  Expect = 5e-06, Method: Composition-based stats.
 Identities = 11/77 (14%), Positives = 19/77 (25%), Gaps = 1/77 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + CP C  +      ++P + +  RC  C          S               G 
Sbjct  1   MI-ITCPECLTKFRLDDDRIPEEGAKGRCTRCQHVFEIRKPASPEDSFFSQGENLAEFGG  59

Query  61  QRRIPSDRLEIQSKTVN  77
                  R + +     
Sbjct  60  IDGQEGSRRKWRFPWKW  76


>OLP76399.1 Galectin-3-binding protein B [Symbiodinium microadriaticum]
Length=2816

 Score = 59.0 bits (139),  Expect = 5e-06, Method: Composition-based stats.
 Identities = 31/246 (13%), Positives = 55/246 (22%), Gaps = 8/246 (3%)

Query  76    VNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAP  135
                                    G       +      +         +  L   L    
Sbjct  789   WLAGWLVGWLAGWLVGWLAGWLVGWLVGWLFVVGWLVGWLVGWLVGWLVGWLVGWLVGWL  848

Query  136   IFSALLLKPATWL--NPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSM  193
                 L      WL          W +       +   + W+      ++    VG +   
Sbjct  849   XVGWLAGWLVGWLAGWVVGWLGGWLVGWLAGWLVGWLVGWLVSWCVGWLVGWLVGWWVGW  908

Query  194   KLGL---RHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV--WFFFCQYVLADDNIGGLQ  248
              +G      VG F L+  L+I+    G   L+   L + V                    
Sbjct  909   LIGWLVGALVGHFWLVGWLVIIGWLVGCFWLVACLLAWFVACLLACLLGCFWLVGWLVGW  968

Query  249   ALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYY  308
                   L+V G W  +FG  V++    +  S        VG          +        
Sbjct  969   LFWVGWLVVFG-WLVVFGWLVVVFGWLVGASVGWLVRWLVGWWVGSLVGGFVGWLGGWSV  1027

Query  309   YLIYSD  314
              ++   
Sbjct  1028  GVLLPR  1033


>NUN14493.1 FHA domain-containing protein [Myxococcales bacterium]
Length=479

 Score = 58.2 bits (137),  Expect = 6e-06, Method: Composition-based stats.
 Identities = 40/226 (18%), Positives = 69/226 (31%), Gaps = 11/226 (5%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                   +    +V+    I  A+L      L P        + L     + L  +    
Sbjct  257  WNVLQPHVIPAAMVVGILTIPVAVLGWLFLSLIPALAFVYSLVSLVQSLLMPLAFAAAML  316

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             M        +    + K  +    +  + + +  LVV  G +LLIIPG+       F  
Sbjct  317  FMLRVRLGKPISPMDAWKSVMAQPVNLWVNMFVSGLVVLVGFILLIIPGIALG---MFST  373

Query  237  YVLADDNIGGLQALEKSRLLVSG----HWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
             V   +    +    +S  L       +        V L V+S  +  +   IP++G   
Sbjct  374  VVYFLEGRRMVGMNLRSVQLFVHDGLRYVLVFLLLGVGLAVLSTVVGIVFGFIPFIGSLF  433

Query  293  NLAFSLLLT----PFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPL  334
               FS +LT    P        +Y        G       R+ + L
Sbjct  434  GGIFSAVLTAVGAPLFASLIIHLYFQAIGEAEGKNAEAEARRAMSL  479


>NYI57516.1 hypothetical protein [Cellulomonas soli]
Length=364

 Score = 57.5 bits (135),  Expect = 6e-06, Method: Composition-based stats.
 Identities = 19/162 (12%), Positives = 50/162 (31%), Gaps = 28/162 (17%)

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRH----VGSFTLLLILLILVVGG  216
                    +    +  S+   +    +GL   ++             + L  L++L V  
Sbjct  146  PVVSVATTVLSGLVILSVSRSVIGRTIGLKEVLRSWRVWYVLGFTVLSGLAQLVVLAVWI  205

Query  217  GSLLLIIPG---------------------LLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
            G+++ +                        + + V        L  +  G   A+ ++  
Sbjct  206  GAIVPLAVNDSAGAAVAVALIGGLAVAVAAVWYSVRTLLAPAALMLEGGGFWAAVARAWR  265

Query  256  LVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
            L  G +W + G ++   ++S+ +S + + I           +
Sbjct  266  LTRGSFWRLLGIYL---LVSILVSIVVSIISAPATVIAGLAT  304


>KTT97616.1 hypothetical protein NS355_11030, partial [Sphingomonas yabuuchiae]
Length=78

 Score = 52.5 bits (122),  Expect = 6e-06, Method: Composition-based stats.
 Identities = 11/44 (25%), Positives = 16/44 (36%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M    CP C      P S +  +  + RC  C  +    P E +
Sbjct  1   MIL-ECPECRTRYLVPDSAIGLEGRTVRCANCRHSWFQSPPELE  43


>WP_011194601.1 hypothetical protein [Symbiobacterium thermophilum]BAD39452.1 
conserved hypothetical protein [Symbiobacterium thermophilum 
IAM 14863]
Length=308

 Score = 57.1 bits (134),  Expect = 6e-06, Method: Composition-based stats.
 Identities = 32/203 (16%), Positives = 70/203 (34%), Gaps = 31/203 (15%)

Query  160  LLATVAYILLGLSWMTGSMFIYICKTD-VGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
             +A +    L    +  +    +     V +  S ++G    G+      LL+L+     
Sbjct  96   WVAILLLYPLYKGALLDAATRAVLHMPPVSVGESFRVGATRYGAMLGSHALLVLMWIAAV  155

Query  219  LLLIIPG-------------LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
             LL + G             +    +  F  + +  +  G   A+ +S  LV   +W + 
Sbjct  156  PLLALAGLLVLAFLTIPVGLIALATFTVFTGHAVVVEQKGAGSAIGRSFELVRSRFWPLL  215

Query  266  GRFVLLLVISLTLSFLT----------------ARIPY-VGEAANLAFSLLLTPFSFLYY  308
            G  ++  ++S  LS++                 + +P+ +        +  +TPF  +  
Sbjct  216  GTGIVFWLLSTLLSYIVVGPSSFAAGIVTAITGSFLPFTLLTLVEGLAATFITPFMAVGL  275

Query  309  YLIYSDLKANYRGPQHPPIKRQW  331
             ++Y D +    G     + RQ 
Sbjct  276  TVVYFDTRVRREGYDLEWMARQQ  298


>MBC8095534.1 protein kinase [Akkermansiaceae bacterium]
Length=678

 Score = 58.2 bits (137),  Expect = 6e-06, Method: Composition-based stats.
 Identities = 34/238 (14%), Positives = 71/238 (30%), Gaps = 1/238 (0%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
            +   +V   A + + + +  + +             +  +       + +       I  
Sbjct  407  HFWPLVGMTALLLALIGVAASAFSYQTADGKSVDTSIIALLLNGPLFAGLNFYFLKKIRG  466

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
                +  +     +      L   ++  +V  G L L++PG+   V + F   ++ D  +
Sbjct  467  QRTSVETAFAGFSKRFLHLFLANFVVTALVIIGLLCLVLPGIYLFVAWVFALTLVLDKGL  526

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR-IPYVGEAANLAFSLLLTPF  303
                ALE SR  V+ HWW +F    +L ++S          I   G  A  A        
Sbjct  527  DFWAALELSRKAVNKHWWKVFFLLAVLALLSFVGLLAVGIGIFITGPIALAALMYAYEDI  586

Query  304  SFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQ  361
                         A    P+  P         A     +L+   ++  ++        
Sbjct  587  FNSPSPAAADLATAQAGNPKIVPPSSGVGWKAAIGTAAVLLVVGIVAYIAIYGAKRNH  644


>GDX17703.1 hypothetical protein LBMAG05_09990 [Actinobacteria bacterium]
Length=310

 Score = 57.1 bits (134),  Expect = 6e-06, Method: Composition-based stats.
 Identities = 22/199 (11%), Positives = 59/199 (30%), Gaps = 39/199 (20%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLI----------  211
                   +     T  +   I    + +  S K     +     L I+            
Sbjct  101  VLFIVQAVTAGMFTHVVGNAIIGKKINVTESWKKTRPQLMRVIGLSIISFLLPTSAIFIG  160

Query  212  ------------LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSG  259
                        ++V  G  + +   +   +  +     L  ++   L A ++S  L   
Sbjct  161  LFIGVALTGINSILVFVGLGIGLAAAIYCWIGLYVSIPTLVLEDSKFLVAFKRSFYLART  220

Query  260  HWWAIFGRFVLLLVISLTLSFLTARIP-----------------YVGEAANLAFSLLLTP  302
            + + + G  ++ ++ S  +S + +                    ++    ++    ++ P
Sbjct  221  NTFRVLGIGIMGIITSQAISIVVSTPFALFAQSNAREDPTTSSIFMSSMGSILGYTVMLP  280

Query  303  FSFLYYYLIYSDLKANYRG  321
            F   +  L+Y+DL+     
Sbjct  281  FIAAFTTLLYTDLRIRKEN  299


>MBI5844290.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=290

 Score = 56.7 bits (133),  Expect = 7e-06, Method: Composition-based stats.
 Identities = 27/178 (15%), Positives = 59/178 (33%), Gaps = 2/178 (1%)

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW-  173
              + + L     +        +F  + +         ++ W    +   +  IL      
Sbjct  37   FWQIFLLPIFLNVVCPFIVLVLFKNIGMNINFDDFDFSKLWPIFAIFIVICIILPLAIIL  96

Query  174  -MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
             +       I    + +  S+K G   +        L  +      +L +IP  +F V+F
Sbjct  97   SIIYLTEQTIQGKIITIGASLKYGFYRLLPGVWTGFLFFIAFYPLLILFMIPAFIFFVFF  156

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
            F C + +      G  A + S+ +V G+WW++ G    +  + L        +  +  
Sbjct  157  FACIFAVCLRGKSGFAAFKYSKSIVKGNWWSVAGNLFAIHFLFLLCYVPFIVLLLIAA  214


>WP_166292691.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Leucobacter sp. HDW9C]QIK64358.1 hypothetical 
protein G7068_14955 [Leucobacter sp. HDW9C]
Length=393

 Score = 57.5 bits (135),  Expect = 7e-06, Method: Composition-based stats.
 Identities = 21/171 (12%), Positives = 49/171 (29%), Gaps = 2/171 (1%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +    +L          +  +        P  +     I       +   L       F 
Sbjct  109  IYLFVVLVTAAFVPAFAAENIRSRFLGHRPSARQIWTGISRVMWRILGYSLLMGLIVGFG  168

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                  V +F  +   L  + +     +L+ L++    +   +  + F     F   V+ 
Sbjct  169  L--GIVVFIFAMIFGVLFTIAAGGPAALLIGLILAAVVIAGFVFLIWFFTKLTFVPSVMV  226

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
             +    + A+ +S  L  G  W  FG   ++ +     + L   +  +   
Sbjct  227  FEGATIMAAVSRSWQLTKGRVWRTFGTLFVVSLAFTAATGLIGLLMALVFF  277


>PIP97860.1 thioredoxin, partial [Rhodobacterales bacterium CG18_big_fil_WC_8_21_14_2_50_71_9]
Length=39

 Score = 51.3 bits (119),  Expect = 7e-06, Method: Composition-based stats.
 Identities = 10/37 (27%), Positives = 12/37 (32%), Gaps = 0/37 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
            V CP CGA        +PA     +C  C       
Sbjct  2   RVTCPACGARYAVDDGAIPAGGRMVQCSACQAEWRAQ  38


>WP_003536493.1 MULTISPECIES: DUF975 family protein [Erysipelotrichaceae]EDS19219.1 
hypothetical protein CLORAM_01216 [Erysipelatoclostridium 
ramosum DSM 1402]EHM91699.1 hypothetical protein HMPREF1021_01637 
[Coprobacillus sp. 3_3_56FAA]EHQ48025.1 hypothetical 
protein HMPREF0978_00731 [Coprobacillus sp. 8_2_54BFAA]QMW75057.1 
DUF975 family protein [Erysipelatoclostridium ramosum 
DSM 1402]QPS11671.1 DUF975 family protein [Erysipelatoclostridium 
ramosum]
Length=522

 Score = 57.9 bits (136),  Expect = 7e-06, Method: Composition-based stats.
 Identities = 18/190 (9%), Positives = 60/190 (32%), Gaps = 1/190 (1%)

Query  101  RSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAIL  160
             +  +  ++ W ++            L  +          +   + +      +    + 
Sbjct  9    WAKEKTRSNKWNIWKGFLAIFAASLGLSFIALLLFSVMIDIGGSSYYDTFTFMDLAVTVG  68

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
            +  + ++++G S         I + D+     ++  +         ++ + L+   G+L 
Sbjct  69   VFALYFLVIGFSVNIYRYIKKIVQEDIADLNELRAPIGQYFKQGFGVLAVGLICVLGTLA  128

Query  221  LIIPGLLFCVWFFFCQYV-LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
             ++PG++  +      Y+     ++   +A+  S  ++ G     F  F       L  +
Sbjct  129  FVVPGIILGLGLSMTPYLLANYPSLSIFEAITTSWKMMQGKKMKCFVLFFSFYGWILLST  188

Query  280  FLTARIPYVG  289
                 +    
Sbjct  189  VTLGILFIWL  198


>NLY05342.1 hypothetical protein [Candidatus Atribacteria bacterium]
Length=223

 Score = 55.9 bits (131),  Expect = 7e-06, Method: Composition-based stats.
 Identities = 31/218 (14%), Positives = 71/218 (33%), Gaps = 3/218 (1%)

Query  99   GLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWA  158
                              R   ++ + L+  +L     +               +     
Sbjct  1    METFDFWRELKKNWNLIIRYPQIILVTLIPSLLIATSNYLTWQGSLTWVSVFFQEAGSAI  60

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
            +L+  V    + LS +    + Y  +  + L R+ ++  + +    ++ +++  + G  +
Sbjct  61   LLILGVFLSFIALSLVIAIAWDYRYRDVIDLRRAWRIVQKRLPDIVMVSLIMGFIEGFFA  120

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVS---GHWWAIFGRFVLLLVIS  275
            +  +  GL+F     F   +   +      A+  S  LVS   G  +  F   +  L+I 
Sbjct  121  IWFMFLGLVFAFLLLFTIPLTVVEGDNPFSAIRNSFHLVSENLGECFTFFIIALFFLLIG  180

Query  276  LTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYS  313
              L +L   +   G   N     L+  ++ L     Y 
Sbjct  181  YLLFWLLGFLGIAGLILNTVVGALILAYASLLLSSFYF  218


>NLA27476.1 hypothetical protein [Firmicutes bacterium]
Length=273

 Score = 56.7 bits (133),  Expect = 7e-06, Method: Composition-based stats.
 Identities = 30/207 (14%), Positives = 67/207 (32%), Gaps = 9/207 (4%)

Query  108  ADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI  167
                 ++   GW  L I LL        +  +         +    +           Y 
Sbjct  22   RRFLHIYIAMGWIYLPIALLYSYYYNKTLGWSFAGALGQTGSDVGFDILLQTYALLGIYT  81

Query  168  L-------LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLL--LILLILVVGGGS  218
            L       L  + +   + + +   +       +    +     LL    ++ + +  G+
Sbjct  82   LVQQILRPLTAAGVVKVVTMTLRGEETSPGDIFRSIFENWNWLKLLALGAVITVTLFMGA  141

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            L L++P L F V F     VL  +     +AL +S  +V      +   F+L+ +++   
Sbjct  142  LALLLPALFFSVTFSLVTQVLMVEGGSVWRALSRSWSMVLKDLGRVALVFLLMGLLTYFA  201

Query  279  SFLTARIPYVGEAANLAFSLLLTPFSF  305
            + +      +G    +AF +    +  
Sbjct  202  ASVVTTPITLGMGLLIAFEVENIIWFM  228


>WP_067390131.1 DUF975 family protein [Enterococcus canis]OJG19460.1 hypothetical 
protein RU97_GL001031 [Enterococcus canis]
Length=249

 Score = 56.3 bits (132),  Expect = 7e-06, Method: Composition-based stats.
 Identities = 37/200 (19%), Positives = 65/200 (33%), Gaps = 5/200 (3%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                L G+ +   + A   IFS   L   T  +  +     A++   +    L  S    
Sbjct  18   HYNSLAGLTVFWGLFALLQIFSHYKLVGLTKESESSMISAVAMIGMLLLLPFLADSLYKS  77

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
                     D         GL    +   L +  ++     S LLIIPG++    +    
Sbjct  78   LQICREGGIDSTWRPVWVKGLPEYCALLGLTLAHVIFTFLWSCLLIIPGIIKSYGYSQAF  137

Query  237  YVLADDN-----IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
            Y+  +       +   QAL++S  L+ GH W  F      +  +  LS   +   ++   
Sbjct  138  YIFMEGRAQGTKVTFRQALKQSNQLMKGHKWTYFKIQFSFVGYTFLLSSFASLPIFISSF  197

Query  292  ANLAFSLLLTPFSFLYYYLI  311
               A S        ++  L 
Sbjct  198  FTEAASGFRLLICLIFGILF  217


>HAC1236277.1 DUF975 family protein [Listeria monocytogenes]
Length=244

 Score = 56.3 bits (132),  Expect = 7e-06, Method: Composition-based stats.
 Identities = 41/257 (16%), Positives = 78/257 (30%), Gaps = 14/257 (5%)

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             +++ I + +      M  G +  G   L  +L+ +     SLLLI+PG++    +    
Sbjct  1    WVYLAISRREQPDVAYMFSGFKQFGRTFLAYLLISIFTFLWSLLLIVPGIIKTYSYSQTF  60

Query  237  YVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
            ++L D+ NI  L A+ +SR +++GH   +FG  +  L+           IP     A   
Sbjct  61   FILRDNPNISALDAITESRHMMNGHKGRLFGLSLTFLLWY--------LIPLAVAIAGTV  112

Query  296  FSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQ  355
                             +D                 + + A+    + I   +   L   
Sbjct  113  IVAGGMA-----TTSYTADPAEVISALAAGATFGGLVLILASWLITLGISLYVYPYLITS  167

Query  356  NLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGL  415
                   L A  +      T   +         E          +       K      +
Sbjct  168  IAVFYDDLYAATEGTFTEETVIVEEEVNPFGATEADPFAEDTHPEGFGPDAAKEPETPVV  227

Query  416  SLGPVTLFADRFWADDQ  432
               P T  A +   + +
Sbjct  228  PEAPETPEAPKDNNEPK  244


>NMB76397.1 hypothetical protein [Myxococcales bacterium]
Length=58

 Score = 51.7 bits (120),  Expect = 7e-06, Method: Composition-based stats.
 Identities = 10/34 (29%), Positives = 13/34 (38%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            V CP C         K+PA  +  +CP C    
Sbjct  2   KVTCPQCSTSYRVGDEKVPAGGAQIKCPRCSHLF  35


>MBI2551190.1 hypothetical protein [Candidatus Uhrbacteria bacterium]
Length=294

 Score = 56.7 bits (133),  Expect = 8e-06, Method: Composition-based stats.
 Identities = 20/114 (18%), Positives = 40/114 (35%), Gaps = 0/114 (0%)

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
            ++        +     + +      S   L  L+ +    G L  +   L   + +FF  
Sbjct  125  AIMFLARYLPLVPSLVLAVIGTVFPSLNGLTPLIYIFEFIGRLAAVFIVLAVAIDYFFSL  184

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
            +    +   G  A   SR +V G  +A+FGR V+  +I    +     +  +  
Sbjct  185  FTFMLEGKKGWSAFLHSREIVKGLRFAVFGRLVIPSLIFFFGAAAVGVLLLLSA  238


>NBO29941.1 hypothetical protein [Synechococcaceae bacterium WB6_1A_059]
Length=113

 Score = 53.2 bits (124),  Expect = 8e-06, Method: Composition-based stats.
 Identities = 15/108 (14%), Positives = 25/108 (23%), Gaps = 1/108 (1%)

Query  1    MPTV-RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCG  59
            M  +  CP C A       K+P + +   CP C    +F   +                 
Sbjct  1    MIEIFTCPACSARYKIQEDKIPGRGAKITCPRCGHKFVFYREDGVADDDKVPDNVGALDF  60

Query  60   LQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLL  107
                I     +   K         +     +    A    +    Q  
Sbjct  61   STMGITWRVRKGPGKHTYEFHDLNTLREFIQDGQVAQWDQISLDDQDW  108


>HAV61890.1 hypothetical protein [Verrucomicrobiales bacterium]
Length=483

 Score = 57.5 bits (135),  Expect = 8e-06, Method: Composition-based stats.
 Identities = 38/324 (12%), Positives = 80/324 (25%), Gaps = 27/324 (8%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             V C  CG     P  ++     +  C EC    +    E           +     L  
Sbjct  187  QVVCRECGKLF--PPEEVLKFGDAIVCGECKPVYLQRMREGAGW-------SGAGATLSE  237

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                 R        +  +   +F         AS      +                   
Sbjct  238  DELLQRDYDVDIGESISQSWEAFKRNAGLVIGASVVAYLVLIACNVIPILSMILPLIISG  297

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             +     +     + +  +     +     +     +       + +          +  
Sbjct  298  PLLGGLWLFYIKNVRNEEVAFGDAFSGFGPRFGGLLVTYVVSTILSMVAFIPAVICAVIF  357

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                + +  +            +  I+L +             +   V + F   ++AD 
Sbjct  358  VFIPIQMAANSGGNPEFSTPGIVATIVLAVPAFL-------ISIYLSVVWMFALPIVADK  410

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
             +G   A+  SR +V+ HWW  F   ++  ++S            +G  A L   L+  P
Sbjct  411  GLGFWAAMNLSRRMVNKHWWLTFALMLVCGILSS-----------IGALACLVGVLVTAP  459

Query  303  FSFLYYYLIYSDLKANYRGPQHPP  326
             +F      Y  +          P
Sbjct  460  VAFGALAWHYQRVFGELAPQNQNP  483


>PIS11650.1 hypothetical protein COT73_02845 [Bdellovibrio sp. CG10_big_fil_rev_8_21_14_0_10_47_8]
Length=219

 Score = 55.5 bits (130),  Expect = 8e-06, Method: Composition-based stats.
 Identities = 27/195 (14%), Positives = 63/195 (32%), Gaps = 14/195 (7%)

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGL---RHVGSFTLLLILLI  211
            W  +++      +   +++        + + +V  F     G      +   ++L  + +
Sbjct  33   WVVSLIPFFGLIVSSPMTFGYIRCLDRLRRGEVFEFADFFWGFTSLNRLIQISILGAIHL  92

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            L    G  LLI+PG+ + +   +        N  G++++  S  +  G WW++FG  +  
Sbjct  93   LGSLVGFCLLILPGVWWGIATSWASSYFVLKNQDGMESIRASLQITKGRWWSMFGLML--  150

Query  272  LVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
                     L   +   G        L+  P +FL +  +                   +
Sbjct  151  ---------LIGILNLAGVLMLGLGILITMPLTFLVFLTVVDSYADRPPPAVVVXXXEGF  201

Query  332  LPLTAAIFGWMLIPG  346
               +  +     I  
Sbjct  202  CKPSQWVTAEPWIQW  216


>KAF5813155.1 hypothetical protein HanXRQr2_Chr03g0094911 [Helianthus annuus]
Length=269

 Score = 56.3 bits (132),  Expect = 8e-06, Method: Composition-based stats.
 Identities = 24/225 (11%), Positives = 71/225 (32%), Gaps = 5/225 (2%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +     +  ++ F  +            +       +++    +            S+++
Sbjct  46   IFLGIEVAYMVIFFFVSLFAQTTVIIIASCYYTGNDFSLKELVLKVSKTWTRPFVTSLWV  105

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +       F  +   +  +  F    IL+ +++    +  I   +   V +     V  
Sbjct  106  QLLALGYTSFFLLPFLVPSLVLFDHRTILITILIFLA-IFFITFYIYLSVVWGLAIVVSV  164

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI----PYVGEAANLAF  296
             ++  G  +L K++ LV+G     F   +  +++ + ++ + +++    P VG    +  
Sbjct  165  VEDSYGYSSLGKAKELVNGKRVHGFLLNLFFILVLVVIAIIGSKLSPAMPIVGGVIQILL  224

Query  297  SLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGW  341
               ++ F  + Y   Y   K +          R         +  
Sbjct  225  MGTISMFQSMAYTGFYFQCKNDMTKSGGLEYSRIPAAPVLDQYIP  269


>KYK30526.1 hypothetical protein AYK19_18230 [Theionarchaea archaeon DG-70-1]
Length=308

 Score = 56.7 bits (133),  Expect = 8e-06, Method: Composition-based stats.
 Identities = 25/183 (14%), Positives = 60/183 (33%), Gaps = 8/183 (4%)

Query  156  QWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG  215
                    +    + ++     +   +    V +F ++       G     +  ++L++ 
Sbjct  124  FCLRNWFKLGSTNVIITIALVIIMAIVLFIPVAVFVAVFAYAYATGPSLGAVAGILLILL  183

Query  216  GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
               L+ ++    F   +     ++ ++    ++A+ +S  +V G         V + +I 
Sbjct  184  VFMLIALLVAGFFWARWVAAFPIMVNERTFLMEAMSRSWNMVKGKTIRTLFVMVAVFLIP  243

Query  276  LTLSFLTARI-------PYVGEAANLAFS-LLLTPFSFLYYYLIYSDLKANYRGPQHPPI  327
              L + +A +         V        S  LL P       +IY +L+A   G      
Sbjct  244  TILQYSSAFLEFSLGRSLVVLTVVFGIVSQGLLIPLVDCTRVVIYFELRARKEGFDLEKR  303

Query  328  KRQ  330
              Q
Sbjct  304  AEQ  306


>HBB03643.1 hypothetical protein [Patescibacteria group bacterium]
Length=153

 Score = 54.4 bits (127),  Expect = 8e-06, Method: Composition-based stats.
 Identities = 15/76 (20%), Positives = 26/76 (34%), Gaps = 3/76 (4%)

Query  558  NAVTLRFLGDRTD---LLNVHASNSHAEPLREIGFTWQKSGDAFSLRQMFDGNIESITVL  614
              + L   G  +      +++      E             +   L   FD  I  I ++
Sbjct  1    MQIKLIISGGNSSFDVQDHINFFTPEGEEASPSNTILSYINNDGVLTYDFDTGINYIKII  60

Query  615  VAGDSMTQSYPFELTR  630
             +  S  +SYPF LT+
Sbjct  61   YSDTSTIKSYPFVLTQ  76


>MBI3091688.1 hypothetical protein [Candidatus Tectomicrobia bacterium]
Length=243

 Score = 55.9 bits (131),  Expect = 8e-06, Method: Composition-based stats.
 Identities = 38/227 (17%), Positives = 69/227 (30%), Gaps = 3/227 (1%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
             L  +  + + L          L   T L               + +  L       ++F
Sbjct  15   HLGTLAPIILTLTVPVELVRSYLMLQTELLRTPGLNLLIQYGTELPFHALVEPAFFMAVF  74

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
                   + +  +    LR       + + L ++   G LL +IP +   V F F     
Sbjct  75   ALPTGRRLTVLDAYGEALRWWPRMFGVYVKLWVLTMLGFLLFVIPAIWIMVVFAFADLAA  134

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA---NLAF  296
              D    L     S  LV+G    I G  +L L  S+    + +  P + +      LA 
Sbjct  135  LLDPERQLNPFSTSNRLVAGFQLRILGVVLLQLAASVLFDQILSLNPALLQVWWARALAG  194

Query  297  SLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWML  343
            S       +    L+   ++A     +  P   +  P  A  +  + 
Sbjct  195  SAFAVVVRWFSIVLVIFYVRARLAHGERVPSFEELFPRLAERYPGVA  241


>WP_191474837.1 hypothetical protein [Candidatus Neoanaerotignum tabaqchaliae]
Length=265

 Score = 56.3 bits (132),  Expect = 8e-06, Method: Composition-based stats.
 Identities = 29/148 (20%), Positives = 68/148 (46%), Gaps = 1/148 (1%)

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
             ++ L    +  +   Y+    V   +S+   +     F    I+ +++   GS+  +IP
Sbjct  95   LFLPLVTMAVADATGDYLKNEAVSAKKSILNSVSKGTVFIAAAIINLVLCLVGSMFFVIP  154

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL-SFLTA  283
            GL+F VWF+F  Y +  +N G +++L +S+ L+ G +  +    + L ++S+ + +F++ 
Sbjct  155  GLIFTVWFYFFTYEIIYNNSGVIESLARSKALIKGSFLKMAVYVLFLNILSVMIDNFVSL  214

Query  284  RIPYVGEAANLAFSLLLTPFSFLYYYLI  311
             + ++   A     +         Y + 
Sbjct  215  ILGFLTYLAAGEVFVRTIMIFGETYVVC  242


>WP_175475016.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein, partial [Curtobacterium sp. MCBA15_005]
Length=229

 Score = 55.9 bits (131),  Expect = 9e-06, Method: Composition-based stats.
 Identities = 14/100 (14%), Positives = 33/100 (33%), Gaps = 0/100 (0%)

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
                   +G+        +L +    L + +  +     F      +A +      A  +
Sbjct  7    FFGVFAAMGAADSGTGWFVLALVLFVLGIGVVVVWLATRFVLTVPTIALEGRPVFAAAAQ  66

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
            S  L  G +W  FG  +L+ ++    + + +    +G   
Sbjct  67   SWRLTRGAFWRTFGILLLVQLMFGFAASIASIPLTIGAVL  106


>WP_124954075.1 hypothetical protein [Halomarina oriensis]RRJ32173.1 hypothetical 
protein EIK79_05240 [Halomarina oriensis]
Length=286

 Score = 56.3 bits (132),  Expect = 9e-06, Method: Composition-based stats.
 Identities = 26/172 (15%), Positives = 56/172 (33%), Gaps = 4/172 (2%)

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
            T  +      +   +     L   ++ S   L +L  ++    +L+  +  L+  +  FF
Sbjct  115  TRWIGWATLSSLAFVAFFNFLLFFYISSVVFLGVLTPVLGLFWALIGGVGTLVLSLLLFF  174

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP----YVGE  290
             +  +A  ++G ++A+  S  L   +   +FG  V+L++I     FL   +         
Sbjct  175  TRQEIATRDVGPIEAMTGSWSLARTNEIELFGLGVVLILIEAAQRFLVLALGSVRQLFTI  234

Query  291  AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWM  342
             A+     +   F        Y  L+                P        +
Sbjct  235  VASSLLGAVALVFFSAVVAQAYRQLRLEREDDGSETTADSLDPNDEWDDPPL  286


>KIE04164.1 hypothetical protein NF27_JF00420 [Candidatus Jidaibacter acanthamoeba]
Length=243

 Score = 55.9 bits (131),  Expect = 9e-06, Method: Composition-based stats.
 Identities = 31/205 (15%), Positives = 73/205 (36%), Gaps = 9/205 (4%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
              +F    W +  I ++  +++   I+       A   NP+       + +   A  +L 
Sbjct  39   INIFRLNFWQIGIIVIIVSMVSHLAIYPVTEYLTAFAKNPEALKEAKFLPIIFTALTILL  98

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLR-----HVGSFTLLLILLILVVGGGSLLLIIPG  225
               M       +  +           +R        +  L   L  +++  G  L   PG
Sbjct  99   NQTMFVIAITLVVLSRPQNIFQCFERIRAFVKNKFITLYLACTLKSILINIGLTLYFFPG  158

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            ++  V F   ++++  +  G  +++ KS  +V      +F + V+ + +   + F+   +
Sbjct  159  IIVVVAFSMVEFIILIEGKGIKESIYKSIEMVR----PLFFKLVVCISLFYFILFIVLGL  214

Query  286  PYVGEAANLAFSLLLTPFSFLYYYL  310
            P   ++      + L   + L  YL
Sbjct  215  PIYFKSVIYFGLIALYIINMLVLYL  239


>WP_184342077.1 hypothetical protein [Prosthecobacter vanneervenii]MBB5034351.1 
hypothetical protein [Prosthecobacter vanneervenii]
Length=265

 Score = 56.3 bits (132),  Expect = 9e-06, Method: Composition-based stats.
 Identities = 30/210 (14%), Positives = 64/210 (30%), Gaps = 13/210 (6%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            +       + I L  +V+          +    +     +       L    + ++G + 
Sbjct  43   WFIYRKHFVLIALTVMVVWVPCELLVSYMDAFVFDENDTRRSFKFAQLVGNVFGVIGTAG  102

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
            +      +   T  G+ +++  GL +         L  L +    LLLI+P         
Sbjct  103  VIHVAMNHSAGTPAGIGQALAAGLSNWFRMWWTQFLSTLFLFLSFLLLILPFFYLAPRLA  162

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY------  287
                V+  + + G  A+ +S  LV GH+W + G   L L+++         +        
Sbjct  163  LVDNVVVCEGLTGTAAMRRSEELVQGHYWQMAGILCLQLLMACLPILFYVGLALFEIEIP  222

Query  288  -------VGEAANLAFSLLLTPFSFLYYYL  310
                   V    ++  +         Y   
Sbjct  223  NWMLEAGVAVVFDIIAAFSTVWLYCAYEAF  252


>TDJ55875.1 hypothetical protein E2O47_03395 [Gemmatimonadetes bacterium]
Length=305

 Score = 56.7 bits (133),  Expect = 9e-06, Method: Composition-based stats.
 Identities = 28/188 (15%), Positives = 57/188 (30%), Gaps = 5/188 (3%)

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGL-  189
                    A ++                     +  + L  ++        + +   GL 
Sbjct  86   HMVPVAVVATVVLSLGLPEWGQGQEATPFTPFAMFGVALVAAYGVLLTGTILTQRYTGLI  145

Query  190  -FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQ  248
              ++    L     + L   L+ +    G L+LI+PG++  +   +    +    +G + 
Sbjct  146  RGKAYTYALNRSIPWVLTWFLVAVATSLGYLVLIVPGIIAALRLVWADEFVVAHRLGPVP  205

Query  249  ALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL-TARIPYVGEAA--NLAFSLLLTPFSF  305
            AL+KS  L  G    +F    L  +    +       I  +           L LT F+ 
Sbjct  206  ALKKSWELTRGALGEVFIFQFLAGLFGWVIFMAGLIGIMGLSRVTAPMGPLGLPLTIFTG  265

Query  306  LYYYLIYS  313
                L+  
Sbjct  266  SMAGLLGY  273


>NLE88109.1 hypothetical protein [Myxococcales bacterium]
Length=158

 Score = 54.4 bits (127),  Expect = 9e-06, Method: Composition-based stats.
 Identities = 11/37 (30%), Positives = 17/37 (46%), Gaps = 0/37 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  V CP C A+      ++P      RCP+C ++  
Sbjct  1   MFEVSCPSCRAKYPVDERRVPPTGLKMRCPKCGESFQ  37


>MBI4733040.1 hypothetical protein [Chloroflexi bacterium]
Length=343

 Score = 56.7 bits (133),  Expect = 9e-06, Method: Composition-based stats.
 Identities = 35/292 (12%), Positives = 83/292 (28%), Gaps = 10/292 (3%)

Query  1    MPTVRCPHCGAE-RNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCG  59
            M  V CP C            P+ +   R        +    E+++ +     A      
Sbjct  1    MDIVVCPKCKMRVIPKADGTCPSCQGRIRGAAKTYFSLEGVGEAKKQRPEVIGAASSRQI  60

Query  60   LQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGW  119
             +     + L+   K     +    F + P   +    SG   +S     S     R  +
Sbjct  61   KRELHYWNVLKRAWKFFWQFKILWLFFIIPVCFYSLFSSGYFLLSMNPVSSTGGAARSLY  120

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
             L+ + L+ ++                +   + Q                G  +    + 
Sbjct  121  LLIALMLVTVLYLAGWFLGTAAYIEGIFQADKGQEKLSFSSQLKKGLSFFGKIFGANLLI  180

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
                         + +    +     +  L +L +    L  +   ++        Q  +
Sbjct  181  SV---------GFIAVLALLLVLVIFMGDLALLCIIPFVLAFLPAMIIVQCLMEQSQAAI  231

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
              D +G L  L++   ++  ++W +   F++L   +  +S + +   Y    
Sbjct  232  VVDKLGILAGLQRGWEVLKNNFWKVILMFLILNGGAYVISMVISMPLYFAWL  283


>WP_176476741.1 zinc-ribbon domain-containing protein, partial [Yangia sp. SAOS 
153D]PBD16796.1 hypothetical protein CLG85_23630, partial 
[Yangia sp. SAOS 153D]
Length=111

 Score = 52.9 bits (123),  Expect = 9e-06, Method: Composition-based stats.
 Identities = 13/65 (20%), Positives = 19/65 (29%), Gaps = 1/65 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + CP+CGA+   P   +PA     +C  C  T                        L
Sbjct  1   MRLI-CPNCGAQYEVPVDAIPAGGRDVQCSSCGHTWFQLHPLDAPLTEEAPAEHSQDDEL  59

Query  61  QRRIP  65
              + 
Sbjct  60  WEEMD  64


>RLB74892.1 hypothetical protein DRH06_09025, partial [Deltaproteobacteria 
bacterium]
Length=45

 Score = 50.9 bits (118),  Expect = 9e-06, Method: Composition-based stats.
 Identities = 10/36 (28%), Positives = 15/36 (42%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  + CP C    N  S ++P + +  RC  C    
Sbjct  1   MIII-CPECSTRFNINSDRIPDQGAKVRCARCKHVF  35


>RMG69868.1 hypothetical protein D6710_08015, partial [Nitrospirae bacterium]
Length=123

 Score = 53.2 bits (124),  Expect = 9e-06, Method: Composition-based stats.
 Identities = 28/113 (25%), Positives = 52/113 (46%), Gaps = 3/113 (3%)

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
            +  +++ L+   G++  IIP LL    F F    + ++ +  L AL++S   V  +  A 
Sbjct  1    IATLIISLLAALGAMFFIIPSLLVFCVFMFTYVAIMEEGLSALDALKESYRTVRANLSAT  60

Query  265  FGRFVLLLVISL---TLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
               F++LL I+L    +    A   ++G   N+  S  L  F+ +   L Y +
Sbjct  61   VTLFIILLGIALSVQLIEIFFAMFRFLGVIINVVLSSTLIAFTSIALLLSYRE  113


>OGC51151.1 hypothetical protein A2982_03115 [candidate division WWE3 bacterium 
RIFCSPLOWO2_01_FULL_39_13]
Length=408

 Score = 57.1 bits (134),  Expect = 9e-06, Method: Composition-based stats.
 Identities = 39/238 (16%), Positives = 78/238 (33%), Gaps = 3/238 (1%)

Query  132  AFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFR  191
                           + +          +L  +            +         + +  
Sbjct  167  WAVQTLPENFPVENIFSSATGLVIALLTVLLMLLISYYYTLLTIKTAANIGDSEYLRMGD  226

Query  192  SMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALE  251
             +K   + +GS  +L+IL++L+   G LLLIIPG++F   F F   +L  +N+G ++AL+
Sbjct  227  LLKYPFKKLGSMLVLMILMMLIYSLGFLLLIIPGIIFMTMFMFAPIILVKENVGAIEALK  286

Query  252  KSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
            +S+ L SG+ + +F + +   ++ +          Y      +  S     F +   Y +
Sbjct  287  RSKALTSGYRFNLFLKGIGFTLLMILCFIPMVIFMYA---TMMIGSFAFNVFIYEVLYTL  343

Query  312  YSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDI  369
            Y DLK                    A         L          +     +   + 
Sbjct  344  YLDLKRIKSTEIELNTPPAVPVEPQAESARTETASLSGSPGDAFTPNPIPSPAIRPEP  401


>NJO07805.1 hypothetical protein [Chloroflexaceae bacterium]
Length=170

 Score = 54.4 bits (127),  Expect = 9e-06, Method: Composition-based stats.
 Identities = 28/137 (20%), Positives = 54/137 (39%), Gaps = 18/137 (13%)

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
            +L ++++LV+  G + +I+  L     F F   V+  +  G + AL +S  LV+G +W +
Sbjct  19   VLGLMIMLVLLVGVVFVIVMSLFVFAVFLFSTQVIVIEGYGPIGALRRSWNLVTGSFWRV  78

Query  265  FGRFVLLLVISLTLSFL------------------TARIPYVGEAANLAFSLLLTPFSFL  306
             G  V+  ++   L +L                     +      AN    ++  P    
Sbjct  79   VGILVVTWLLVGVLQWLPAYVVQLVLQILFSAPEEFFMLQSFSTIANYVILIVFLPIQMT  138

Query  307  YYYLIYSDLKANYRGPQ  323
               L+Y D++    G  
Sbjct  139  ALTLLYYDVRVRKEGLD  155


>KEP22454.1 hypothetical protein DA06_21770, partial [Georgenia sp. SUBG003]
Length=134

 Score = 53.6 bits (125),  Expect = 9e-06, Method: Composition-based stats.
 Identities = 18/64 (28%), Positives = 27/64 (42%), Gaps = 0/64 (0%)

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
                    L  +  G L++L +S  L  GH+W IFG   L +VI   +S+       +G 
Sbjct  58   RLTLSAPALMLERTGVLESLRRSWSLTRGHFWRIFGSLALAVVIVSVISYALMIPLSLGM  117

Query  291  AANL  294
            A   
Sbjct  118  AFTG  121


>HHB81612.1 thioredoxin [Aliiroseovarius sp.]
Length=60

 Score = 51.3 bits (119),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 12/38 (32%), Positives = 16/38 (42%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  V CP+CGA+    S  +P      +C  C  T   
Sbjct  1   MRLV-CPNCGAQYEVDSRVIPENGRDVQCSNCGHTWFQ  37


>MBC7537647.1 hypothetical protein [Bacteriovorax sp.]
Length=199

 Score = 55.2 bits (129),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 29/175 (17%), Positives = 61/175 (35%), Gaps = 0/175 (0%)

Query  130  VLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGL  189
            +LA+A       L     L+  +    +   ++  +  +    +      +     D   
Sbjct  18   MLAYAWDLLKKNLPLYIGLSLISMILNFIPYISIFSIFINIGFFNCCYKLMKNETIDFND  77

Query  190  FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQA  249
            F           +  + ++L+ + +  G  LLIIPG+   +   F   V+  +   G+ A
Sbjct  78   FFFSFQSFSRFLNILVAVVLMTIAIIIGYALLIIPGIYLSIALLFTTIVMVTEKKVGIDA  137

Query  250  LEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFS  304
            L++S  +V G WW +F     L ++++           V     +        F 
Sbjct  138  LKRSMEIVDGKWWNVFMFCGFLFLLNIAGLLCLLVGLIVTIPLTIILLFEYYFFL  192


>NNL27683.1 hypothetical protein [Acidimicrobiia bacterium]
Length=243

 Score = 55.9 bits (131),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 19/181 (10%), Positives = 51/181 (28%), Gaps = 15/181 (8%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
             ++  +   L  M              +  +++  LR   +  ++  L+      G +  
Sbjct  60   ISLLTVPFALLVMCVVALDTYFGRTATVASAIRSSLR--PTALVIGYLVFGASLMGLVAG  117

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            ++PGL+   W      ++ +++      + ++  L  G    I   ++   +I   L+  
Sbjct  118  VLPGLVILAWAGIAVPIVIEEHGRLFDTIARTWRLTRGVRGTILAFYLWFALILGVLTAA  177

Query  282  TAR-------------IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIK  328
                               V        +    P   +   ++Y D +            
Sbjct  178  VWITAFGLEALGVGLDPSVVLWPLAFVTAAAAIPIFPVGLAVMYVDARVRNEAFDLQQRL  237

Query  329  R  329
             
Sbjct  238  E  238


>WP_152604154.1 hypothetical protein [Vibrio tubiashii]
Length=160

 Score = 54.4 bits (127),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 18/129 (14%), Positives = 46/129 (36%), Gaps = 7/129 (5%)

Query  194  KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKS  253
            +          L+     + V  G + LI+PG+     + F  + +  ++     AL +S
Sbjct  20   RFSWSLWWRLLLVYTFYGIAVMVGFIALILPGIYLAAKYAFADFEVVLNDKPVFSALHES  79

Query  254  RLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP-------YVGEAANLAFSLLLTPFSFL  306
                 G    +     ++ V    + F+   +        ++ +      S  +  F+ +
Sbjct  80   WNDTKGIAGRLMLVTTVIAVFQFAIGFIIGAVGELSSVMYFITDVVGGIVSSSVMVFTSV  139

Query  307  YYYLIYSDL  315
             Y+ +Y++ 
Sbjct  140  VYFRLYTEN  148


>NNM90246.1 hypothetical protein [Bacilli bacterium]
Length=302

 Score = 56.3 bits (132),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 33/210 (16%), Positives = 68/210 (32%), Gaps = 6/210 (3%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            LL +  +  +        AL++K      P+  N    +  A   +     ++       
Sbjct  93   LLSLIAINTIYPLMQTVYALMIKDHFEEKPKETNLFTYLTRALPYWGRYVSTYWLLLGLS  152

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +    + +   +   L      +   +  IL+V   ++  ++  + F +     Q  + 
Sbjct  153  ALAIVILMIIGIILGILFAAIGHSSASVFGILIVIVFTIAAMVAIVFFLIRLSMTQLTIV  212

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFV----LLLVISLTLSFLTARIP--YVGEAANL  294
             +      A+++S  L    +W IF   V    ++  IS  L F    IP   + E  + 
Sbjct  213  LEERKNWSAIKRSFFLTRKAFWRIFLISVIAASVISAISSGLIFAIQIIPSTALSEFISS  272

Query  295  AFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
               L+  P        +Y D K        
Sbjct  273  MIQLVWMPILPFLMINLYFDQKTRREPSHQ  302


>WP_166977408.1 MULTISPECIES: hypothetical protein [unclassified Actinomyces]
Length=367

 Score = 56.7 bits (133),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 36/229 (16%), Positives = 66/229 (29%), Gaps = 36/229 (16%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             I  +           +L+   +T    +    Q      T+  I +            I
Sbjct  105  IIQFIFSADESLLTTPSLIENLSTSALFELFRNQLIATFITLLCISIITGAAIYVTADLI  164

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG---------------------------  215
                      +K  LR +    +L IL  L+V                            
Sbjct  165  VGDKKPAAFYLKKTLRSLHKIVILYILSALIVLSVVFIGIITITFISFMLFSYDLDMIFT  224

Query  216  ---GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
                 S L++I  L   +        +  + IG L+A+++S  L  G    I G  +L  
Sbjct  225  VVAISSPLIVIFSLYLYLKLGLTTQAMLLEEIGILKAIKRSWTLSKGFLLKILGISILAG  284

Query  273  VISLTLSFLTARIPYVGEAANL------AFSLLLTPFSFLYYYLIYSDL  315
            ++   L+   A I  +             F +++   S +   L+Y   
Sbjct  285  ILLYVLNMALAAITGIIFYLTAFSSNAYIFFVVIQILSTMTMALLYPIW  333


>WP_102216465.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Gleimia hominis]PMC86044.1 hypothetical protein 
CJ187_05175 [Gleimia hominis]
Length=427

 Score = 57.1 bits (134),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 21/146 (14%), Positives = 46/146 (32%), Gaps = 26/146 (18%)

Query  214  VGGGSLLLIIPGLLFCV---WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
            V    LL I+  L+  +    F      + ++ +  L+A+ +S  L  G+ W + G  +L
Sbjct  282  VWVIPLLAIVGILITALLNARFALTMPAVVEEKLSPLKAISRSWRLSKGNTWRLAGILIL  341

Query  271  LLVISLTLSFLTARIPYVG-----------------------EAANLAFSLLLTPFSFLY  307
             ++    +  + +    +G                         +     +L TP     
Sbjct  342  TMIAVSVVVGVISLPLSIGLGVVVGLTSTTAATLSALSAAVTAISYAISFVLTTPIQAGV  401

Query  308  YYLIYSDLKANYRGPQHPPIKRQWLP  333
              ++Y D +    G     +      
Sbjct  402  VTMLYVDQRIRKEGFDMQLLNESQTH  427


>WP_101552891.1 hypothetical protein [Bacillus sp. UMB0728]PLR70128.1 hypothetical 
protein CYJ37_25740 [Bacillus sp. UMB0728]
Length=241

 Score = 55.9 bits (131),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 23/193 (12%), Positives = 61/193 (32%), Gaps = 0/193 (0%)

Query  127  LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD  186
                +     F    +                 LL  +             +   + + +
Sbjct  26   FVFPIQLVFTFYINYITAPFQYFGIPLWTSLLQLLFILILFPFIQIPYISLIKYDMLEDE  85

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGG  246
            V + + +    ++     +L I+  ++   G LL I+PG++  ++F         +N   
Sbjct  86   VSMKKIIGDIFKNGFHVYILGIITAILSLIGFLLFIVPGIVLMIFFLCVPQTAVLNNAKW  145

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFL  306
              A++KS  L    + ++    +  +++   +S  T  +      +    + LL   +  
Sbjct  146  GSAMKKSIHLGWKKFLSLTLLVLFFILVDSIISGATFFLSAGLTNSFFIINALLIFINCF  205

Query  307  YYYLIYSDLKANY  319
               +    +   Y
Sbjct  206  IIPVFIFSISYIY  218


>PIE56355.1 hypothetical protein CSA34_04545 [Desulfobulbus propionicus]
Length=475

 Score = 57.1 bits (134),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 58/385 (15%), Positives = 121/385 (31%), Gaps = 12/385 (3%)

Query  97   GSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQ  156
                  + +                  I LL I      +   +      ++  Q     
Sbjct  13   CREAWQVYRRRVWVLLGLLFLPSLFFSIVLLAIGGLVVTLSGGVNAFMGDFVGHQLDREV  72

Query  157  WAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGG  216
               ++   +   L   WM  +  I     D    +++  G +++  F  + ++   +V  
Sbjct  73   IIGMVILGSIGTLFGIWMVCTQLIAALDDDCTFNQALIAGWQNLLPFGFVCLIYTGIVMT  132

Query  217  GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
            G +L ++PG+LF  WFFF  + L D+N  G+QAL+ SR+ + G +W +  +  LL +I +
Sbjct  133  GIVLAVLPGILFACWFFFAFFFLLDNNCRGIQALQASRMALRGRFWNVLFKLFLLWLIKV  192

Query  277  TLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTA  336
             L      IP+VG+   +       PF   +   +Y DL         P           
Sbjct  193  LLFS----IPWVGKPLAILA----APFFLFFLVGLYRDL---KETSPLPTRSGSMAWGVL  241

Query  337  AIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSS  396
            +  G +++   ++ ++        ++L    +I         Q   +     E    +  
Sbjct  242  SAVGVVVVFLGMVGAVVTVGPQLPEILFNNTEITSLSKGAATQLKPIKSKALEVSTGVVW  301

Query  397  ADYKLLLSKQRKTTSEGGLSLGPVTLFAD-RFWADDQNPHLWLKLELSDFPNLSLAQKGS  455
             D                  +  + +  D         P     +   +       +   
Sbjct  302  RDPVGDSVDNGARRLLDIQGVTLLHIDEDLEVSVHLAEPLNKYFVAAQEGGEEGHDELVK  361

Query  456  ARIEIDKVLDDDARDLYDRQHSFEH  480
              +++D           D +     
Sbjct  362  FLLDVDMDRSTGGAATADSERVGYD  386


>MBI1309030.1 DUF3426 domain-containing protein [Proteobacteria bacterium]
Length=257

 Score = 55.9 bits (131),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 17/36 (47%), Gaps = 0/36 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M TVRCP C A      +K+  +    +C +C +  
Sbjct  4   MLTVRCPKCDAAYQVDETKIGPQGRRLKCAKCGEIW  39


>OFX33191.1 hypothetical protein A2Z07_03805, partial [Armatimonadetes bacterium 
RBG_16_67_12]
Length=169

 Score = 54.4 bits (127),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 25/118 (21%), Positives = 46/118 (39%), Gaps = 0/118 (0%)

Query  154  NWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILV  213
             +   +       +   L+     + +            +  G      +    IL  L+
Sbjct  52   PFLSFLFGLVSLVVSQVLAIGITRISLRFADQQKAEIADLYTGYPLFFRYLFASILYALI  111

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            V  G +LL++PG+   V F    +++ D  +G ++AL KS  L  G  W +F   +LL
Sbjct  112  VAIGLVLLVVPGVYLAVRFSQYGFLVVDKGLGPVEALRKSAALTEGARWQLFLFGILL  169


>EKE01476.1 hypothetical protein ACD_21C00122G0007 [uncultured bacterium]
Length=246

 Score = 55.9 bits (131),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 35/196 (18%), Positives = 67/196 (34%), Gaps = 0/196 (0%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY  166
             A    L+C+   G+  +  +   L    +F +L     T +          + L  V  
Sbjct  14   WAALLRLYCKVFSGVWFLGGIVGALVNVSLFLSLFYLCKTNIPIVGNVVCVVVNLIIVFL  73

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
             +  ++ +              L  S+    +   +  + + L   +   G+ L ++PG+
Sbjct  74   NIYLMAVILHRANAIDEVQGGALMTSLSFVNKKYSTIVIGVFLASFLGVLGTGLFVVPGI  133

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
               + FF  Q ++  DN     AL  S  LV G+WW        ++ I+  L F      
Sbjct  134  YLTIAFFMLQPLILFDNKDSFSALRDSCGLVWGNWWRTCAVIFPVMFINYLLGFAVQFAV  193

Query  287  YVGEAANLAFSLLLTP  302
              G A  +     +  
Sbjct  194  IRGTAWYVIMGANMLV  209


>QDU63139.1 hypothetical protein Pan216_40140 [Planctomycetes bacterium Pan216]
Length=243

 Score = 55.5 bits (130),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 24/191 (13%), Positives = 58/191 (30%), Gaps = 21/191 (11%)

Query  146  TWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTL  205
              +          +++      L  +                   +     L ++    +
Sbjct  57   FAVGRGVWGLIQTLIMLFFYLGLDLMLLKVVRGERTDIGELFSGGKYYLTALVNMIVIYI  116

Query  206  LLILLILVVGGGSLL----------LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
               +L +V+   S +           +    +  + F+   Y++ D+   G++AL+++  
Sbjct  117  AFGILGIVIALCSFVPILLIPLALAAMFFSFMVTLVFWPFLYIVIDEKPQGIEALKRANE  176

Query  256  LVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
            L  G++ A+   F+ +L            + +VG  A         P  F      Y  L
Sbjct  177  LTKGNYGAMVIIFLYML-----------GLGFVGALACGVGLFYTLPLCFTTLTTAYVKL  225

Query  316  KANYRGPQHPP  326
            +      +  P
Sbjct  226  RGEATVMETAP  236


>WP_073005725.1 hypothetical protein [Clostridium amylolyticum]SHI93990.1 hypothetical 
protein SAMN05444401_1844 [Clostridium amylolyticum]
Length=318

 Score = 56.3 bits (132),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 19/166 (11%), Positives = 58/166 (35%), Gaps = 17/166 (10%)

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
               +    +  S +    +  I      +   +      +  +  ++I+ +L++     +
Sbjct  115  FVILLLASIVFSPVIFVFWKLITWMMESVDLYLFEFGIEIVKWMSIIIIPLLMLLISLFI  174

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL----  276
            +    L++   F F   V   +  G L ++++S +LV  ++W + G  ++ ++  +    
Sbjct  175  V----LIYYTLFSFSFQVATIEKKGPLASIKRSFILVKKNFWKMVGYNLIFILTVMGIRY  230

Query  277  ----TLSFLTARIPYVGEAANL-----AFSLLLTPFSFLYYYLIYS  313
                 +  +   I  VG+  ++          L         ++  
Sbjct  231  SLQSLVGLILGIIYLVGKLFSINQDYNILLASLYGVLSWPINILTW  276


>PYQ09707.1 hypothetical protein DMH00_12530 [Acidobacteria bacterium]
Length=270

 Score = 55.9 bits (131),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 24/168 (14%), Positives = 59/168 (35%), Gaps = 1/168 (1%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
            +    +         L L     +   +    +     ++      +  +  ++   +  
Sbjct  82   WEFIGLAWTLVKPHWLPLGLMFLILTLSGAVPYIGPCISLLLSGTLMVGIYRAILGLLAG  141

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
                +   M  G    G   L  ++  ++VG G +  I+PG++  + + F   +LA+ ++
Sbjct  142  RAPTV-EMMFNGFDRFGQAFLASLVYTVLVGLGLMACIVPGVILAIMWTFVSPILAETDL  200

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
                A++ S  L  G+ W +F   +  + + L           V +A 
Sbjct  201  EFWPAMKASADLTKGYRWELFCLILASIPVLLLGLLCCCIGVVVAQAV  248


>OGS54086.1 hypothetical protein A2Y20_10305 [Firmicutes bacterium GWF2_51_9]OGS59405.1 
hypothetical protein A2Y19_09420 [Firmicutes 
bacterium GWE2_51_13]HAM63105.1 hypothetical protein [Erysipelotrichaceae 
bacterium]
Length=277

 Score = 55.9 bits (131),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 33/185 (18%), Positives = 72/185 (39%), Gaps = 4/185 (2%)

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
            +    +  A     A  +              T A     + ++       +    VGL 
Sbjct  76   MFATILNLAAFPMTAGLIKLGLTGENVTFNDFTTALSENIVKYLKLIFGGILLGLAVGLV  135

Query  191  RSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
              + + +  + +F+    L +  + G  LLLI+   +  V+F F    +  D++  + A+
Sbjct  136  SVIYVIVTFMVTFSGDT-LNLFALIGLLLLLILVLAVGAVFFTFWFAAMVLDDLTVMNAI  194

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTLSFL---TARIPYVGEAANLAFSLLLTPFSFLY  307
            +KS   V   +W + G  +L+ +++  LS +      IP VG   +   +   T  +  +
Sbjct  195  KKSFDSVKRCFWTVVGVTLLIQILTNILSGIAGGFGGIPLVGALLSSVVTTGSTVLTMAF  254

Query  308  YYLIY  312
             +++Y
Sbjct  255  TFILY  259


>HBB65106.1 hypothetical protein [Candidatus Vogelbacteria bacterium]
Length=222

 Score = 55.5 bits (130),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 41/184 (22%), Positives = 63/184 (34%), Gaps = 1/184 (1%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
             +F  R   LL + L  +  +               L    +N    +    +   +  L
Sbjct  12   RIFRERWAPLLSLSLGLMFFSALMRGIFFGEGFFHSLALPGENTALLVASVLLMLAVHIL  71

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
            +       I    +      ++       G    L  L  L+  G   L IIPG++F VW
Sbjct  72   TAGAVLRVITEGDSAASPRNALAYAWGRRGDLFSLFFLNFLLG-GAYALFIIPGIIFQVW  130

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
            F     VL  +   G +AL  SR  V GH WA+F R   L+  +L +S     +P     
Sbjct  131  FSLSVIVLIAEGRSGTEALLASREYVRGHDWAVFSRIGFLVFFALLISSAADLLPLPSPV  190

Query  292  ANLA  295
                
Sbjct  191  RQGL  194


>TRW99263.1 hypothetical protein FNJ84_00865 [Paracoccus sp. M683]
Length=291

 Score = 56.3 bits (132),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 12/63 (19%), Positives = 17/63 (27%), Gaps = 0/63 (0%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              + CP CG E   P   +P    +  C  C Q        +  T+   N          
Sbjct  63   IRLTCPKCGTEYRLPEDAIPVAGRNVECSTCGQVWHQPGIAAGTTRGDSNQRPDDSEPTP  122

Query  62   RRI  64
               
Sbjct  123  MPR  125


>RME69568.1 hypothetical protein D6778_00320 [Nitrospirae bacterium]
Length=234

 Score = 55.5 bits (130),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 28/186 (15%), Positives = 69/186 (37%), Gaps = 3/186 (2%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
               +  AL +          +   +   +  +A   +     T   + ++      L++S
Sbjct  41   IYLLTVALFMDSPLPQRGLPRGQIFIAFVIGIALQSMAFGMSTLMAWNHVKGIPDDLWQS  100

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
            ++  +          +LL      G    ++PGL    +F F  + +        +A+++
Sbjct  101  LRATIAQAFQLITAGLLLGTATAVGIFFFVLPGLFLAFFFMFTFFFIMVKRQHIFEAMKE  160

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSF---LTARIPYVGEAANLAFSLLLTPFSFLYYY  309
            S +LV+ ++      F+L++ +S+ ++F   +  R   VG+  N     +L   +     
Sbjct  161  SFVLVNRNFLQSLKLFLLIVALSILVAFTGVVLMRASLVGQFLNALVVSVLFGLTATMLV  220

Query  310  LIYSDL  315
            + Y   
Sbjct  221  VFYKIW  226


>MBD3250178.1 hypothetical protein [Candidatus Pacebacteria bacterium]
Length=231

 Score = 55.5 bits (130),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 35/210 (17%), Positives = 75/210 (36%), Gaps = 11/210 (5%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            F           +     +   +  A+++  +        +    +      +  + L  
Sbjct  33   FFISQVFKQAWKVFKQKWSTILLLFAVMIGVSFLYGLVAGDRPSLLFSLLSVFGQMLLGM  92

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
            +   +F+ + + +   F  +   L   G + +  +   L+V GG +LLIIPGL + + + 
Sbjct  93   VFLQVFVRLYREEEVSFDLVGQLLPRFGHYFIGTLFYALIVLGGLVLLIIPGLYWAIKYA  152

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAAN  293
               +++ D  +   +A ++S  L  G  W + G             F  A + Y G  A 
Sbjct  153  LVPFLIVDQKLKFTEAFKESARLTKGAKWDMIG-----------FYFAAAVLAYSGFLAL  201

Query  294  LAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
            L    +  P ++L +  +Y  L        
Sbjct  202  LVGVFVTAPVAYLAFAGLYVKLVERGEQSN  231


>2NB9_A Solution structure of ZitP zinc finger [Caulobacter vibrioides 
CB15]
Length=49

 Score = 50.9 bits (118),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 11/36 (31%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M    CP C +      SK+       RC  C    
Sbjct  1   MIL-TCPECASRYFVDDSKVGPDGRVVRCASCGNRW  35


>PIR44495.1 hypothetical protein COV10_04550 [Candidatus Vogelbacteria bacterium 
CG10_big_fil_rev_8_21_14_0_10_51_16]
Length=199

 Score = 54.8 bits (128),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 37/183 (20%), Positives = 66/183 (36%), Gaps = 15/183 (8%)

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            +L +IPG+L  VWF    YV   ++  G+ A  +S+  V G +W + GR + + ++ L +
Sbjct  1    MLFLIPGILATVWFSLAVYVFVVEDKRGMAAFFQSKAYVEGRFWGVLGRMLFVALLLLVI  60

Query  279  SFLTARIP---------------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                  I                 V  AA     L+L P ++ Y Y +Y  LK+ +   +
Sbjct  61   YVPLITIFSFLLVGFNLSEQVFELVMGAAFYVLFLVLAPMTYCYLYELYVALKSIWTPTE  120

Query  324  HPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDL  383
                  +            +I  LL  ++    L   +       I +    +  Q    
Sbjct  121  IVNTGGRKAKFIVPALVGWVITPLLFGAVIATLLYGLRGEFPEGLIPENESGELSQEEQE  180

Query  384  NRS  386
               
Sbjct  181  QFE  183


>OUW89139.1 hypothetical protein CBD86_00810 [Gammaproteobacteria bacterium 
TMED226]
Length=278

 Score = 55.9 bits (131),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 41/176 (23%), Positives = 68/176 (39%), Gaps = 12/176 (7%)

Query  139  ALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLR  198
              L+     L     ++   I +  +  +   ++    S+          +      G  
Sbjct  19   NFLVIWVNGLICIMTSFLAGITIIGLIAVPAIMAGYLESLLRVRRGEKAEIGDFFTFGFN  78

Query  199  HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLV  257
            H GSF  L ILL L +   SLLLIIPG+   + ++F  Y+  D+ NI   +AL  S  LV
Sbjct  79   HFGSFLGLAILLFLGIFFASLLLIIPGIFLFIAWYFAWYIKIDNPNITVTEALSMSMSLV  138

Query  258  -SGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
                WW +F   +L+ +     +                  LL+ PF+++     Y
Sbjct  139  LKIGWWKLFALVLLVSIAGGIXNL----------FTFNLAGLLIYPFTYMITVEAY  184


>RMG72764.1 hypothetical protein D6710_04330, partial [Nitrospirae bacterium]
Length=185

 Score = 54.8 bits (128),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 28/158 (18%), Positives = 59/158 (37%), Gaps = 0/158 (0%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            I +   ++A        +       N         + L       L +  +       + 
Sbjct  19   ILIAPSIIATLITSILGVSLTGMRFNEHMYGRFMLVGLVGFILHALSVCIILSMAMDSLG  78

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
             +     R++K  L       +  +++ L+   G++  IIP LL    F F    + ++ 
Sbjct  79   GSQPLFSRALKKSLSRFFDILIATLIISLLAALGAMFFIIPSLLVFCVFMFTYVAIMEEG  138

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            +  L AL++S   V  +  A    F++LL I+L++  +
Sbjct  139  LSALDALKESYRTVRANLSATVTLFIILLGIALSVQLI  176


>TFH65499.1 hypothetical protein E4G91_02245 [candidate division Zixibacteria 
bacterium]
Length=255

 Score = 55.5 bits (130),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 32/201 (16%), Positives = 64/201 (32%), Gaps = 1/201 (0%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
              L          I+L   V        AL        +           +      +  
Sbjct  34   MRLLKNGVRQFAMIFLAIQVPLMLVQMLALPASDQPQGDLGGMGNMILAAIFFGFLGIFA  93

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
               +  +   ++       +      +    +    +++L +VV  G L LI+PG+LF +
Sbjct  94   TLNIMTAARHHVAGAPKPFYEVFLNAMVRFPTAVFTVLILGIVVFTGLLALIVPGVLFYI  153

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
            +       +A  +     +L +S   V GHWW +F    +L    L +S     +  +  
Sbjct  154  FGCCSLQAIALTDRRIFASLIESYESVRGHWWQVFALQFMLFATLLVISLPLILVDALLT  213

Query  291  AANLAFSLLLTPFSFLYYYLI  311
              +L   + +  F      +I
Sbjct  214  -PSLLVKIPIYLFINALEIVI  233


>TSC58475.1 hypothetical protein Greene041662_736 [Candidatus Peregrinibacteria 
bacterium Greene0416_62]TSC98753.1 hypothetical protein 
Greene101449_856 [Candidatus Peregrinibacteria bacterium 
Greene1014_49]
Length=193

 Score = 54.8 bits (128),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 25/145 (17%), Positives = 50/145 (34%), Gaps = 6/145 (4%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +      L      +     +    L    + G   +    L  IL   +    SLLL
Sbjct  25   IMLWGAACVLLVGKKLVHSPAGRNRTSLAAVAREGRFFIVPLLLTGILRSCIGLLWSLLL  84

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            I+PG+++ +   F   ++  ++I    AL +S +LV GH W +  R      +++  + +
Sbjct  85   IVPGIIYAIRTTFYTIIIVAEDISYRAALRQSIVLVRGHTWQVLWR------LAVIFTVI  138

Query  282  TARIPYVGEAANLAFSLLLTPFSFL  306
                  +            T    +
Sbjct  139  YGPPNLLATLGYWVVGAYETGMLVM  163


>OQX68453.1 hypothetical protein B6A08_09975 [Sorangiineae bacterium NIC37A_2]
Length=270

 Score = 55.9 bits (131),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 17/161 (11%), Positives = 51/161 (32%), Gaps = 3/161 (2%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +   +     +     I      +    + +    H     +   ++ L+   G    
Sbjct  96   LLIGTAVSQGVTIVQLQSIQARGEPLPPLEAWREFKPHAWPLLVTSFVVSLLGALGFAFF  155

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            ++PG++   +      +   +N  G+ AL++S  L+       F   +   +    +  +
Sbjct  156  VLPGIVIMAYLQLAAPLTILENRRGIDALKESVSLILRSAKGFFVVMICAGIAQWVVGVI  215

Query  282  TARI---PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
               +     +   A+      L+P   +    +Y +++   
Sbjct  216  LNALLTNAALNHLAHNIAMAALSPLYAMVLVALYREVRERE  256


>NVK36973.1 hypothetical protein [Gammaproteobacteria bacterium]
Length=253

 Score = 55.5 bits (130),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 33/225 (15%), Positives = 66/225 (29%), Gaps = 27/225 (12%)

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
                  Y       F    ++ L     +          A +L  +   +   +++   +
Sbjct  8    NQSFHFYSRLFNKVFWLSVASSLSPLLMFFAVGAGQPSLAGMLFVMMLSMFFSAYIMVLI  67

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG-----------------------  215
              Y    D  L  +  L L+ V   T   I+  L                          
Sbjct  68   HQYSQDQDDSLSSAFSLTLKKVLPITGTSIVFGLFAVVVAIPAAVIATLLAAGIEDQQLQ  127

Query  216  --GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHW--WAIFGRFVLL  271
                +L++ IP        FF  Y    D    + AL+ S  LV G+   +  F    ++
Sbjct  128  AGLIALIVSIPVCYVLYRCFFAVYFTLVDGASPIDALKASNQLVKGNKLVFRSFMLLSVV  187

Query  272  LVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLK  316
            ++  + +  L +++  VG  A       +      Y+ +    + 
Sbjct  188  MLAYVAIILLISQMIAVGSMAQAILEFAVNVIVLPYFTITIYRIF  232


>WP_084613712.1 zinc-ribbon domain-containing protein [Roseibacterium elongatum]
Length=221

 Score = 55.2 bits (129),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 9/37 (24%), Positives = 13/37 (35%), Gaps = 0/37 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
            + CP+CGA        +P      +C  C  T    
Sbjct  2   RLTCPNCGARYEVDDGLIPPDGRDVQCSNCGSTWFQP  38


>HGX21135.1 DUF4339 domain-containing protein [Verrucomicrobiales bacterium]
Length=292

 Score = 55.9 bits (131),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 27/206 (13%), Positives = 68/206 (33%), Gaps = 5/206 (2%)

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
                   + +  + +++          +    +   +        +       ++  + +
Sbjct  78   QDYFRCFVPLAWITVIVWLPCCLFTSWMDYHFFGEEELGKSFRMHMQVENLIGIIASAGI  137

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
               +  +      G++ S++ GLR  G   L   L  +    G L L++PG+   V    
Sbjct  138  LCWLRDHHQGGSPGVWGSLRNGLRLWGRIFLANFLSGICFVLGLLALVLPGIWVAVRLSL  197

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT-----LSFLTARIPYVG  289
             + V+  D +   +++++S  L  G +W + G ++L  +  L      L F      +  
Sbjct  198  IEAVVVMDGLKTTESIKRSWKLTHGQFWPLLGLWLLAGLPGLLSLAPYLIFAGLLPQFDV  257

Query  290  EAANLAFSLLLTPFSFLYYYLIYSDL  315
               +   SL            ++   
Sbjct  258  WWIDALASLAGECVFMFCTIFMFQAW  283


>KKP69379.1 hypothetical protein UR67_C0007G0084 [candidate division CPR3 
bacterium GW2011_GWF2_35_18]OGB62607.1 hypothetical protein 
A2X44_04475 [candidate division CPR3 bacterium GWF2_35_18]OGB65858.1 
hypothetical protein A2250_01725 [candidate division 
CPR3 bacterium RIFOXYA2_FULL_35_13]OGB76675.1 hypothetical 
protein A2476_03530 [candidate division CPR3 bacterium RIFOXYC2_FULL_35_7]OGB78834.1 
hypothetical protein A2296_05235 
[candidate division CPR3 bacterium RIFOXYB2_FULL_35_8]
Length=244

 Score = 55.5 bits (130),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 44/197 (22%), Positives = 77/197 (39%), Gaps = 8/197 (4%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
             + I + G       +                       L   V    + +  +T  +  
Sbjct  41   YMLIGVPGSYYFEGTLGMKTDEWGLMTPEMGQIGKLLLPLGFIVFAGGMIILLITFFLLQ  100

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                  + +        +H+  F +++IL+ +   GGS+LLIIPGL+F  WF F      
Sbjct  101  KTENQKLKVLEIYHKAFKHLFHFLIIMILIGISFIGGSILLIIPGLIFLTWFIFAPLAYI  160

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI--------PYVGEAA  292
             +   G QALE SRL V GH+ ++    +++L+I +  +  TA I         +     
Sbjct  161  IEGKTGYQALEDSRLKVKGHFVSVVLTLIMMLLIFIIPNATTAMISEALKLPLFFEQLMF  220

Query  293  NLAFSLLLTPFSFLYYY  309
            N   +L++ PF+    Y
Sbjct  221  NGINALIIFPFNISVLY  237


>WP_125216393.1 zinc-ribbon domain-containing protein [Rickettsiales endosymbiont 
of Stachyamoeba lipophora]AZL15988.1 DUF3426 domain-containing 
protein [Rickettsiales endosymbiont of Stachyamoeba 
lipophora]
Length=220

 Score = 55.2 bits (129),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 8/46 (17%), Positives = 16/46 (35%), Gaps = 1/46 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRT  46
           M  + CP C  +     + +P +    +C +C     F     +  
Sbjct  1   MI-ISCPSCHTDFEVDDALIPPQGRKLQCSKCKHLWFFKIKSDEDY  45


>HID26986.1 hypothetical protein [Methanosarcinales archaeon]
Length=309

 Score = 56.3 bits (132),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 24/166 (14%), Positives = 58/166 (35%), Gaps = 11/166 (7%)

Query  160  LLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
                +    + +  +  +  +++    + +  +     +   SF L        +  G+ 
Sbjct  141  NFINLLLAEIVIGLIILAGIVFVVPGFLAIDFTHGFSEKDFTSFAL--------IFIGAF  192

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            +  I  L+  +     QY L  +N+G + A+         H   +F  ++ ++ IS+ L 
Sbjct  193  IWFIYALIISIALSVVQYALVIENLGPIDAISTGFKFFRNHKLDVFLVWLFIIAISIGLG  252

Query  280  F---LTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGP  322
                +   IPY+         L+     +    L ++ L  +  G 
Sbjct  253  IIGQIVGLIPYLNVIWFFVNMLISVVVIYPLMTLWWTRLYMSRTGM  298


>HDH96468.1 tetratricopeptide repeat protein [Proteobacteria bacterium]
Length=500

 Score = 56.7 bits (133),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 11/70 (16%), Positives = 17/70 (24%), Gaps = 0/70 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            +RCP+C      P  K+P +    RC +C         E                   +
Sbjct  20  EIRCPNCKTGYQVPDEKIPEEGMKVRCSKCKHVFEVKKGEVIYELEELQKPEPQPEPTAK  79

Query  63  RIPSDRLEIQ  72
                     
Sbjct  80  EAEPTPQPQP  89


>MAX26220.1 hypothetical protein [Phycisphaeraceae bacterium]
Length=720

 Score = 57.1 bits (134),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 25/292 (9%), Positives = 62/292 (21%), Gaps = 26/292 (9%)

Query  5    RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRI  64
             CP CG +             +  CP C Q+          T        C        +
Sbjct  17   TCPECGKDFKPSEFDYVPSSVAFCCPHCDQSYFGTDERGHLTPRAFECVGCKRHIDMDEM  76

Query  65   PSDRLE-IQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLG  123
                    + +  +          + +  F A             D  + F  R      
Sbjct  77   VLRPGPGYEERQAHQSENPWVERKKSQSSFLAWIKTSWRAMMRPNDLAKTFGDRNDCFPA  136

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWA--------------ILLATVAYILL  169
                            L +           +                  +     A+   
Sbjct  137  SISFLFFNNALITLLGLGVIIVMVGMETLTDSYRNNHLETAAGLVALLGLCAMGGAFAAF  196

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
             +  +   +     +       + +     +     +  +  + + G   L  I G+ + 
Sbjct  197  IVVSIWAMLTHGALRVLAKPEGNFRETFDCLCFSNGVTFITAVPICGLYGLSWIVGIWWP  256

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            +   F         +   +A+  +          +F    ++ +  L    +
Sbjct  257  ISAGFMLAKRH--KVSVGRAMMAAW---------VFPIIAVIGLGGLIGYMV  297


>GFZ94078.1 hypothetical protein CYANOKiyG1_04740 [Okeania sp. KiyG1]
Length=174

 Score = 54.0 bits (126),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 18/94 (19%), Positives = 37/94 (39%), Gaps = 0/94 (0%)

Query  210  LILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFV  269
            +IL     ++L I+  +L   +F     +  ++  G  + L +S  L  G+   IF   V
Sbjct  24   VILGSTFLAILFIVLAILLARFFILEIPLAVEEVAGATKTLGRSWELTKGYVRRIFIILV  83

Query  270  LLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
            +  +I+L +  +      + +   L       P 
Sbjct  84   IAGLITLPVGIILQIFTTILQGILLLAISPTQPI  117


>NJQ97971.1 hypothetical protein [Hydrococcus sp. CSU_1_8]
Length=126

 Score = 52.9 bits (123),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 21/118 (18%), Positives = 38/118 (32%), Gaps = 12/118 (10%)

Query  218  SLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
             + L    +         +  +A +       A+ +S  L  G    +   F +  +I+L
Sbjct  1    MIGLFFGYIWLYSRLSIVELPIAIESETDASAAIGRSWNLTKGFVVRLQLIFFVAFLITL  60

Query  277  TLSFLTARIPYVG-----------EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
             LS +   I +              A ++     L PF      +IY DL+    G  
Sbjct  61   PLSLVVNLIGFFLPQDSAIAVLINLALSIVLGAFLIPFWQAIKAVIYYDLRTRKEGID  118


>NTV33791.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=213

 Score = 54.8 bits (128),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 22/183 (12%), Positives = 56/183 (31%), Gaps = 23/183 (13%)

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
            L        I +         +  + +  +    L  ++  ++     LL +       +
Sbjct  30   LIGTLILWSILVFGVPALFGIAAAIAVPSLAMMGLGGVVAGIIAFVVILLAVWIFTTLFL  89

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFG---------RFVLLLVISLTLSFL  281
             +     V+  + +  ++AL +S+ L+ G     F            V+ ++I L +  L
Sbjct  90   NWLLADKVVVLEELAWMKALRRSKELMKGRTEPGFWKSIKTKASLIIVVGVLIGLGIHLL  149

Query  282  TARIPYVGEAAN--------------LAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPI  327
                  +                   +  + L T ++ +   L Y D++    G     +
Sbjct  150  FQLPGVLLGLVFSQGLVVTTVQGVLNMVATSLATAYTAIAMILFYYDIRVRKEGFDLKMM  209

Query  328  KRQ  330
              +
Sbjct  210  AEK  212


>KSW17292.1 hypothetical protein ATM99_18135 [Cellulomonas sp. B6]
Length=294

 Score = 55.9 bits (131),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 17/127 (13%), Positives = 37/127 (29%), Gaps = 18/127 (14%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
              G + +++  +            L  +       + ++  L  G +W + G ++L  V+
Sbjct  157  LLGGVGMVVATVWLTARLLLVPPALMLEGRRFWPTVARAWRLTRGSFWRLLGIYLLANVL  216

Query  275  SLTLSFLTARIP------------------YVGEAANLAFSLLLTPFSFLYYYLIYSDLK  316
               L +L                        +     +    L T F      L+Y D++
Sbjct  217  VSVLMYLFVLPASFVAGLVTVATGSQAATVIITALGQVVGLTLSTTFMAAVVALLYVDVR  276

Query  317  ANYRGPQ  323
                G  
Sbjct  277  IRREGLD  283


>HAJ00054.1 hypothetical protein [Dehalococcoidia bacterium]
Length=135

 Score = 53.2 bits (124),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 23/130 (18%), Positives = 45/130 (35%), Gaps = 6/130 (5%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            +    L+ I   L+  V  +F    +  +N   + A  +S  LV G+W  +FG   +   
Sbjct  6    LLSLVLVGIPMLLVLLVVLWFYPQTIMVENRDPVSAFRRSIFLVRGNWMRVFGIGAVYWA  65

Query  274  ISLTLSFLTA------RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPI  327
            + + L+              +    +     + TP+  +   L Y DL+    G     +
Sbjct  66   LPIVLAMAVLPFSNDPAHSTLVGIYSAFVGTVTTPWILIGSTLTYLDLRVRKEGYTVESL  125

Query  328  KRQWLPLTAA  337
            +      T  
Sbjct  126  EADLNTPTPP  135


>MBA2627168.1 zinc-ribbon domain-containing protein [Gemmatimonadales bacterium]
Length=93

 Score = 52.1 bits (121),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 12/67 (18%), Positives = 18/67 (27%), Gaps = 0/67 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
           V CP+C        +K+P     ARC  C          +        + T      Q  
Sbjct  3   VTCPNCATTYRVDPAKVPEAGVRARCAVCSAVFAVRRDMADSPPEAQGVETPAPRPRQPE  62

Query  64  IPSDRLE  70
             +    
Sbjct  63  TATAGQP  69


>XP_001639125.2 inner centromere protein [Nematostella vectensis]
Length=1120

 Score = 57.1 bits (134),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 39/314 (12%), Positives = 78/314 (25%), Gaps = 22/314 (7%)

Query  2     PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
                 CP C         KL                I   +  +R +        P     
Sbjct  812   IEYNCPKCVKAIKVEQDKLSPG--------ESNKEISPTSRPRRHRRLKIPKDFPEHAGN  863

Query  62    RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                 S   E +  + +  +          +  +  G   +        +          L
Sbjct  864   TSTKSKLGEKRKHSKDEEKTKPKSSETGFKIPKKVGKYKKYFFVQTCLTLLEKSLCPSML  923

Query  122   LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
             L  + L   +        L +    W    +    +  L  T+     GL       +  
Sbjct  924   LPYWGLCSSMLLPYWGLCLTMLLPYWGLCSSMLLPYWGLCLTMLLPYWGLCPSMLLPYWG  983

Query  182   ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
             +C + +  +  +   +          +LL       ++LL   GL   +   +  + L  
Sbjct  984   LCPSMLLPYWGLCPTMPLPYWGLCPSMLLPYWGLCLAMLLPYWGLCLAMLLPYRVFCLTM  1043

Query  242   DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
                     L     L        +G    +L+    L   T  +PY G   ++       
Sbjct  1044  --------LLPYWGLCPSMLLPYWGLCPSMLLPYWGL-CPTMLLPYWGLCPSMLL-----  1089

Query  302   PFSFLYYYLIYSDL  315
             P+  L   ++    
Sbjct  1090  PYWGLCLTMLLPHW  1103


>RMG15829.1 hypothetical protein D6731_07500 [Planctomycetes bacterium]
Length=270

 Score = 55.5 bits (130),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 31/224 (14%), Positives = 68/224 (30%), Gaps = 8/224 (4%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            F      +L    L              + PA +           +  A + Y  L  + 
Sbjct  33   FVCNLPQVLVQAFLVPDPTPLDPALGSPVDPAAFFRDFFAAMGVWVPAALIPYPFLQTAA  92

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
               +   +  +     +   +    +  +  +L  + +L V G    + +   +F  WF+
Sbjct  93   TLVAAQAFAPEERPLSWVLRRSLRLYPRALLVLAAIGLLNVVGVCPGMFVGYFVFAAWFY  152

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHW--------WAIFGRFVLLLVISLTLSFLTARI  285
                V   ++ G L+AL++SR L  GH+           F    ++   +    F     
Sbjct  153  VALPVAVLEDRGWLEALQRSRALARGHFGVLLALFLALHFAVPFVIAPFAALFQFAFMNS  212

Query  286  PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
            P+          +++  F  +   + Y  L+          +  
Sbjct  213  PWAIALLQGFLGMVVGVFPIVGPVVAYHHLRVVREHYDIQRLAE  256


>NOU39593.1 DUF975 family protein [Ferruginibacter sp.]
Length=204

 Score = 54.8 bits (128),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 24/131 (18%), Positives = 51/131 (39%), Gaps = 1/131 (1%)

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
                  I   L+      ++ I +   G+F +M  G ++ G+  +   L+ L+ G   +L
Sbjct  43   PFGGLLIAGPLALGIAGFYLKISRNGDGVFNNMFDGFKNFGNALIANFLVGLLTGLAFIL  102

Query  221  LIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            LI+PG++    +     ++ D+  + G  A+  S  L+ G     F   +  +  ++   
Sbjct  103  LIVPGIVVACGYSQVNRIMHDNPQMNGTDAMRASWKLMDGKKMDFFMLNLSFIGWAILCI  162

Query  280  FLTARIPYVGE  290
            F          
Sbjct  163  FTLGIGFLFLS  173


>MBE9520865.1 hypothetical protein [Proteobacteria bacterium]
Length=283

 Score = 55.5 bits (130),  Expect = 1e-05, Method: Composition-based stats.
 Identities = 35/241 (15%), Positives = 81/241 (34%), Gaps = 8/241 (3%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
                           + + +       F  +  G +          +L+  +  G +L I
Sbjct  36   LSILYGPLFGGYLLLVILLLRDDKKPAFNDLFNGFQAFRLLIPYFFILLAKI-IGFMLFI  94

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVS--GHWWAIFGRFVLLLVISLTLSF  280
            +PG+LF  W+ +   ++ D  IG  +A+  S   V+  G +  +    ++ ++  + L  
Sbjct  95   VPGVLFATWWIYVLPLMIDRKIGFGKAMRISSDKVTEAGFFMHMVFFLLVYVIPVIVLEM  154

Query  281  LTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFG  340
            L++ +P++     +  +LLL PF       +Y D          P  + +    T  I  
Sbjct  155  LSSFMPFL-----MVLTLLLMPFQVGCLVSLYLDQFKEQELATAPEKQHESAEATPMIPP  209

Query  341  WMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYK  400
               I    +   +    + + +    +  +Q       +  DL     +        + +
Sbjct  210  PTEIIESSVEEETESPQTGQSVSKVPETSEQPTDVSMPEGNDLEEKPDQGTDADQHQEDE  269

Query  401  L  401
             
Sbjct  270  G  270


>TAK07581.1 hypothetical protein EPO38_12535, partial [Rhizorhabdus sp.]
Length=131

 Score = 52.9 bits (123),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 10/39 (26%), Positives = 14/39 (36%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  + CP C      P + + A     RC  C  +   D
Sbjct  1   MILI-CPACQTRYLVPDTAIGAPGRQVRCASCKHSWFQD  38


>OYD09572.1 hypothetical protein CHM34_00725 [Paludifilum halophilum]
Length=298

 Score = 55.9 bits (131),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 27/181 (15%), Positives = 53/181 (29%), Gaps = 13/181 (7%)

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
                  ++I I    + L   + +             +  L             L     
Sbjct  110  IAAWLFLWILIAGGWIILAGLLFVPAVLARMAGAGEWIFGLGAFIFFPAAGAFFLFIMTR  169

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA------RI  285
            FF    V+ ++      A+++S  L    +W      ++L +++ +  F+ +       I
Sbjct  170  FFLIIPVIVEEKTAIFGAMKRSWQLTRSSFWRTMSLMIVLGLLTFSYQFVVSILSQSLFI  229

Query  286  PYVGEAANLAFSLLLTPFSFLYY-------YLIYSDLKANYRGPQHPPIKRQWLPLTAAI  338
            P      +   SL +T F  +          LIY D +  Y           W   +A  
Sbjct  230  PEWPWVLHGPISLFMTVFYCIPQVLPPILATLIYFDRRCRYEAMDLQAEIDGWNGESALP  289

Query  339  F  339
             
Sbjct  290  P  290


>MBA3895303.1 zinc-ribbon domain-containing protein [Gemmatimonadales bacterium]
Length=44

 Score = 50.2 bits (116),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 10/33 (30%), Positives = 13/33 (39%), Gaps = 0/33 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           V CP+C        +K+P     ARC  C    
Sbjct  3   VTCPNCATVYRVDPAKVPEAGVRARCAVCSAIF  35


>TFG50702.1 hypothetical protein E4H37_08895, partial [Gemmatimonadales bacterium]
Length=61

 Score = 50.9 bits (118),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 12/33 (36%), Positives = 15/33 (45%), Gaps = 0/33 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           VRCP+C        +K+P     ARC EC    
Sbjct  23  VRCPNCQTVFRVDPAKVPEAGVRARCSECDAVF  55


>AMB93864.1 hypothetical protein AWM72_03370 [Aerococcus sanguinicola]OFT95474.1 
hypothetical protein HMPREF3090_04330 [Aerococcus sp. 
HMSC23C02]PKZ21403.1 hypothetical protein CYJ28_05710 [Aerococcus 
sanguinicola]
Length=465

 Score = 56.7 bits (133),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 35/293 (12%), Positives = 77/293 (26%), Gaps = 21/293 (7%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLAD-DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
                L+L +  L+F   F F  ++  D +       L  SR ++ G+ + +F      +V
Sbjct  187  FALWLVLTLIYLVFIYGFAFTPFLPYDSETASASALLGISRQMMRGNKFKLFRIHFFYMV  246

Query  274  ISLTLSFLTARIPYVGEAAN----LAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
            +   L+FL   +              F LL    +F+   +        +          
Sbjct  247  VPYLLAFLFGLVLMATSLLFNLDAGLFGLLAGVIAFILVVVYVVFGLRAFTATAVFYRNY  306

Query  330  QWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPE  389
                       +  +        S   +  ++      +                     
Sbjct  307  IKQYRLELNEAFPELNLTTWGQGSETEIYQQEDRPQVPEETMAF---------------A  351

Query  390  EPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLS  449
             P  L   +         +      +     +        ++  P     ++    P L 
Sbjct  352  VPDELKGDEASEEDLSPSERAGAAAVFTEAESRSDADPSVEEDLPETTQVVDPQHDPELF  411

Query  450  LAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRS  502
                 S   +I    D++A+ ++D +    +     +  N T   D      S
Sbjct  412  DHMDQSFSDDIRGE-DEEAQAIFDPEDDLTYDEKTQLRDNPTLYVDQSDDQSS  463


>MBE0636037.1 hypothetical protein [Candidatus Bipolaricaulota bacterium]TFH08264.1 
hypothetical protein E4H08_08105 [Candidatus Atribacteria 
bacterium]
Length=232

 Score = 55.2 bits (129),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 37/196 (19%), Positives = 66/196 (34%), Gaps = 9/196 (5%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
            G       L        ++L     L+    +    I L     +L  L      + +  
Sbjct  10   GFDAFTERLPLLLGAWLVILGCQQLLDLLIPDTWLFIQLIASMVVLAPLYAGQHLLALKA  69

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ-----Y  237
             + +   FR +  G+  +G      +++ L+   G+L LIIPG++  + + F        
Sbjct  70   VRREPVAFRELFAGMHQLGPIIGAYLVVSLLTILGTLALIIPGIIITLMYSFVLIRFLDP  129

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR----IPYVGEAAN  293
             L D        L +S  +  G+   IFG  +LL +  + L  L+       P       
Sbjct  130  KLGDRRARVTDTLSESSHITKGYRGTIFGIGLLLAIPYMVLGILSWISMYHTPIPSWTIE  189

Query  294  LAFSLLLTPFSFLYYY  309
            +   L  T F      
Sbjct  190  IVAILSGTLFLGPVQA  205


>RKH91544.1 hypothetical protein D7Y13_38620, partial [Corallococcus praedator]
Length=39

 Score = 50.2 bits (116),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 9/35 (26%), Positives = 13/35 (37%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            V CP C    N    ++P   +  +C  C  T  
Sbjct  2   KVSCPSCQTNYNIDDRRIPPGGAKLKCARCQTTFP  36


>MAY79321.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=242

 Score = 55.2 bits (129),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 30/201 (15%), Positives = 61/201 (30%), Gaps = 3/201 (1%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
               L+    L I   F    +A      +       +W+   LLA V    + ++    +
Sbjct  32   MGTLIASIDLVIFNQFQIPMNAGAAANQSGFIKIMLSWEAVRLLAEVFLGPIVVAMTIFT  91

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
            +  +       L+++    L           +  L +  G       G+LF + + F   
Sbjct  92   VRTHTHGGKATLYKAFNFALARYSRIFKWHAITWLAINIGMSF-CFVGILFLLQYAFVDA  150

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
            +L  ++      L +S  L  G    IF     L+++   +S     +    E       
Sbjct  151  ILCLEDEEW--PLARSAKLTRGRRGRIFALAAPLILVQSVISIADLMVLGWAEPLFALAG  208

Query  298  LLLTPFSFLYYYLIYSDLKAN  318
            L    F    +  +   +   
Sbjct  209  LKTILFMANIWLFMAFYMFYE  229


>MBK0399387.1 zinc-ribbon domain-containing protein [Limibaculum sp. M0105]
Length=440

 Score = 56.3 bits (132),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 15/36 (42%), Gaps = 0/36 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  + CP C A+   P S +  +     C +C +  
Sbjct  1   MSEIICPSCEAKYRVPDSAIGPEGRRVHCSKCGEVW  36


>RVX03270.1 hypothetical protein CK203_020024 [Vitis vinifera]
Length=364

 Score = 56.3 bits (132),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 32/263 (12%), Positives = 73/263 (28%), Gaps = 12/263 (5%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
            AT+          T     Y     + L  S    L    +F        L++    +  
Sbjct  89   ATLCLFKAAYLLFTLIFSFYPLPPVLRLSHSCSTFLISWIAFMAPSNTGFLILFFLVIFY  148

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            ++  +   + +     V   ++  G++A+++SR L+ G        F  L   ++ +   
Sbjct  149  LVGLVYMSIVWQLANTVSVLEDSYGIEAMKRSRELIKGKVGVAVFIFFKLGFFNIIIQAA  208

Query  282  TARIPYVG-----------EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
              R+   G                   ++L  F    + +IY   K+ +           
Sbjct  209  FQRLVVHGESLDMINRAEYAIICFLLHVMLVLFGHALHTIIYFVCKSYHNENIDKLALSD  268

Query  331  WLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEE  390
             L   +     + + G   V   +      +L      + + +               + 
Sbjct  269  HLEAYSDHQDHVPLKGED-VKPGQLLGEITELQQEKNGVFRAVWHLQGSLQAHLLKEEDF  327

Query  391  PQRLSSADYKLLLSKQRKTTSEG  413
                   D+  +L       + G
Sbjct  328  HPNHPYFDHPTILHLLGSGWTYG  350


>WP_180325520.1 zinc-ribbon domain-containing protein, partial [Rhodobacter sphaeroides]
Length=233

 Score = 55.2 bits (129),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 9/40 (23%), Positives = 15/40 (38%), Gaps = 1/40 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
           M  + CP+C A+       +P +    +C  C       P
Sbjct  1   MRLI-CPNCDAQYEVSDEAIPPEGRDVQCSNCGHGWFQRP  39


>WP_187707700.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Sphingomonas sediminicola]QNP44742.1 glycerophosphoryl 
diester phosphodiesterase membrane domain-containing 
protein [Sphingomonas sediminicola]
Length=216

 Score = 54.8 bits (128),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 26/98 (27%), Positives = 42/98 (43%), Gaps = 3/98 (3%)

Query  216  GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
                ++ +  L+  V F     V + +NIG L  +++S  L SG++W + G  +LL+V +
Sbjct  87   LMFGIIGLAALIISVRFTLVSPVASAENIGPLAIIKRSWRLTSGNYWRLLGFILLLIVAT  146

Query  276  LTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYS  313
            L L         VG      FS  + P S     L   
Sbjct  147  LILMMAAG---VVGGLLARMFSPSIEPMSIGALILASF  181


>MBF1305591.1 hypothetical protein [Oribacterium sinus]
Length=445

 Score = 56.3 bits (132),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 37/375 (10%), Positives = 96/375 (26%), Gaps = 21/375 (6%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
                  + I       FS   +        +  N              L    +  S+  
Sbjct  71   FALYVFVCIFYFIVLGFSFRKMLLEMSRGEKKVNAAMIFYGFHPNSFPLLGRMLGFSILA  130

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
            ++  T + L + +       G        +   V           + F ++F    + + 
Sbjct  131  FLFSTVISLGQEVIDYFFGEGLIYWSYYGISFAVL----------IFFSLFFDMTFFTVW  180

Query  241  D-DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG-------EAA  292
            D +     Q ++KS  ++ G  ++ F + +   +++    F+     ++           
Sbjct  181  DGEGNSLWQNMKKSAGVMKGQKFSYFRQILYFSLVTSVAGFIAVMWSFILKSPALPIFVL  240

Query  293  NLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSL  352
             L  S+ L P+      LIY +              R+                  L ++
Sbjct  241  FLYLSIYLIPYLGFVQALIYRNG---AGDFSTISSARREAISPEQARDSETASIEQLTTV  297

Query  353  SRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSE  412
               ++     + A    +      P          P   Q  +         K+++    
Sbjct  298  EASSIEQPTTVEAASIEESATVETPSTEEPATLEAPSIEQPAAEEVPSTEEPKEKEGGFS  357

Query  413  GGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLY  472
              +S             + +      +++       + A +       +   ++D     
Sbjct  358  VVISEDQEERSLSGLSDNAEEGTEAGEVKQDTEGAGTEATETQQETATENKPENDGESKP  417

Query  473  DRQHSFEHPAFHWVG  487
            +     +    H + 
Sbjct  418  EADAEGKQDPNHKMS  432


>KEY99875.1 membrane protein, partial [Sphingomonas sp. BHC-A]
Length=113

 Score = 52.5 bits (122),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 15/70 (21%), Positives = 26/70 (37%), Gaps = 1/70 (1%)

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            +   V       V+ D   G + +L     L  GH+W +FG  + L ++S  +      +
Sbjct  7    IFASVRLLLLNPVVIDGTEGVMASLRHGWALTRGHFWRLFGFILALTLLSAIVGGAAQAV  66

Query  286  P-YVGEAANL  294
               VG     
Sbjct  67   FGLVGALIGG  76


>MYD36770.1 hypothetical protein [Dehalococcoidia bacterium]
Length=330

 Score = 55.9 bits (131),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 29/196 (15%), Positives = 60/196 (31%), Gaps = 12/196 (6%)

Query  136  IFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL  195
                +           ++   +++       I   ++         +            L
Sbjct  121  YMFVIGAHYVVGRVLISEALTFSLQRVGSLIITTLMALGVVFAVWTLLFLAP------FL  174

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
                    TL+  L  L+   G +  ++      + + F   V+A + + G+ AL +S  
Sbjct  175  IANATIDATLIAALFGLLSFLGFIGAVVITAYVAIRWTFIWPVIAFEGLIGVAALRRSWE  234

Query  256  LVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG------EAANLAFSLLLTPFSFLYYY  309
            +    WW  FG  V+  +    + F        G         NL    +  P   +  +
Sbjct  235  ITENFWWRTFGVVVMTTLAVFAIGFPGLIATGAGLDTVGNWYTNLISPAISGPIQTIMIF  294

Query  310  LIYSDLKANYRGPQHP  325
            L+Y+DL+     P   
Sbjct  295  LLYADLRTRKEDPTGY  310


>HAH45700.1 hypothetical protein [Planctomycetaceae bacterium]
Length=250

 Score = 55.2 bits (129),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 38/207 (18%), Positives = 84/207 (41%), Gaps = 4/207 (2%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
              +     + +  I +  ++ A     + ++ +    L        + I +  +  ++  
Sbjct  28   IHIIAYLLYVVGAIAVFMVLGAIGFAGALIIGQADQGLIIAGVVLLYIIAILLIYSVVFY  87

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
            L        + + + +      +  G   +G   L  I+ IL    G + LIIPGL+   
Sbjct  88   LMLGVLRYLLKVVRNEYPGMGEIFSGGPFLGRMILCSIVFILAYTVGLVALIIPGLIIAF  147

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY---  287
             F+   Y+L D ++ G+ A  +SR + +G+  ++F  ++++  +S+    L A +     
Sbjct  148  MFWPYAYLLIDRDLPGIDAFTESRKITNGNKLSMFLIYLIMTGVSMVPYGLMAAMLASLE  207

Query  288  -VGEAANLAFSLLLTPFSFLYYYLIYS  313
              G  A+    L+L  F   +Y L+  
Sbjct  208  QAGGQASALGFLVLLGFMAFFYVLLIP  234


>NDF13025.1 hypothetical protein [Proteobacteria bacterium]
Length=223

 Score = 54.8 bits (128),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 8/36 (22%), Positives = 12/36 (33%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M    CP C  +   P   +P    + +C  C    
Sbjct  1   MIL-TCPACSTQYTVPDEAIPPAGRTVKCTSCSHMW  35


>SUA44950.1 Uncharacterised protein [Nocardia africana]
Length=667

 Score = 56.7 bits (133),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 35/383 (9%), Positives = 84/383 (22%), Gaps = 23/383 (6%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
               +  +   +A      + +      L          +    +    +  +    +   
Sbjct  139  FSVLARVVFTVAVDTGDDSSISGFLAHLMLGLVFLAVVLCAIGIPVDAMVNAVCVITADR  198

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC-----------  229
             +    V L   +    R       L+++   V      L+ I   L             
Sbjct  199  AVRGERVRLSEVVGAARRRFWPLCRLMVVFYTVFLVAPWLVEIAAFLVAGFGTGMAALPF  258

Query  230  ---------VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
                     + F     V+  +  G +++L +S  LV   +  + G  +L  V+ +    
Sbjct  259  VFIAIYVLGIVFSLAPVVMTLEGTGVVESLSRSAALVKPAFLRVVGLQLLWSVLVVGALM  318

Query  281  LTARIPYVGEAAN---LAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAA  337
            L+     +           ++ +  +  +   + +   +A           R       +
Sbjct  319  LSGLPFGLISLLVPSDAVVTVFVPFYLTIAVVVAFPLFRAVQTLIYTDLRLRSGTYGGES  378

Query  338  IFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSA  397
             +                 +S    +   +                   LP  P  +  A
Sbjct  379  DWRSGKDGIGEHPVTGDDGISTSNAIVTLRPYHVCRIVAAFALFQFFFDLPGIPWLIIVA  438

Query  398  DYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSAR  457
                     R      G  +G        + +    P      E                
Sbjct  439  GCVAGEWFIRSRGLSWGPEIGATLAALRLYPSGTAQPDQATHDEPPQPDRDPANDPKPPA  498

Query  458  IEIDKVLDDDARDLYDRQHSFEH  480
             E          +  +     E 
Sbjct  499  AEAADPPGAPEPEGVNTIRLGEP  521


>WP_133494935.1 hypothetical protein [Stakelama pacifica]
Length=280

 Score = 55.5 bits (130),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 21/150 (14%), Positives = 49/150 (33%), Gaps = 5/150 (3%)

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
               +    +  ++ + +    V          +       +  L +  V   ++L  +  
Sbjct  102  LPAVIGILVLVTIVLMVLALPVLFVFWRAGMGQGAMHAPDVSPLYLAFVVLYAILYAVAL  161

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF----L  281
            ++          V+  +   GL AL +S  L  G+  AI G  +L  +++L        +
Sbjct  162  VVLIARLIVTTPVIVAER-NGLSALRRSWQLTRGYGAAIIGIVLLFGIVALIAQLAVKTV  220

Query  282  TARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
               I  +      +FSL     +     ++
Sbjct  221  LGSILLLFLGGEGSFSLSAILIACASGIVL  250


>MBD3160354.1 hypothetical protein [Candidatus Lokiarchaeota archaeon]
Length=308

 Score = 55.5 bits (130),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 22/222 (10%), Positives = 75/222 (34%), Gaps = 19/222 (9%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY--ILLGLSWMT  175
                + +  + I+       +        + +          +  + ++      +    
Sbjct  94   MDQFIVLVPMTIISMVIYAVAGGAAIKLAFDDYGEPGRGDVDMSLSYSFGKAWSLIGAQI  153

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
                + +      L   + +    +    +  +L ++ +   + +      +  V     
Sbjct  154  IVGLVLLILQIPTLLTFVFMLTGDIELIAIASLLSLVGMVLSAYIGTRLTPVSAVVIA--  211

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL---------TARIP  286
                 ++N G   A++++  L SG++W IFG  +LL ++   ++ +          +   
Sbjct  212  -----EENTGAFGAVKRAWGLTSGNFWHIFGGQLLLGLVVGIITIIVELVIGMLTFSIAG  266

Query  287  YVGEAAN-LAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPI  327
            +VG   + +   +L +   +++  ++Y DL++     +    
Sbjct  267  FVGIVISTIIVGILFSSVDYIFQTVLYRDLESRAAESEEDWW  308


>MBA3394167.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=244

 Score = 54.8 bits (128),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 32/216 (15%), Positives = 64/216 (30%), Gaps = 18/216 (8%)

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
              R +    +    +        + + L     +     +     + A V    L    +
Sbjct  17   WFRNFIPFTLIAAVLYSPVILWLATIDLSATRSVEDLLNDVFVRPIYALVGLSALLAPML  76

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
            T  +   +  T V +  S+K G+R +    LL +L  +        +I   +     +F 
Sbjct  77   TYRVIQELNGTKVSMMASVKFGMRGILPAILLAVLTNVAQLVPMGGIISAIITCI--WFV  134

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY-------  287
                   + +G + A  +S  L  G  W IFG   L+ +  + L        +       
Sbjct  135  AAPAAVAEQLGPVTAFTRSSELTRGRRWGIFGLTFLIGLALVALLMAWVIPMFEKSEADL  194

Query  288  ---------VGEAANLAFSLLLTPFSFLYYYLIYSD  314
                     +       F +       + Y L+  D
Sbjct  195  MSTMRQSAIMFVVTMGIFQMFTGIVQAVSYALLRLD  230


>OUO25217.1 hypothetical protein B5F87_18145 [Eubacterium sp. An3]
Length=314

 Score = 55.5 bits (130),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 43/313 (14%), Positives = 89/313 (28%), Gaps = 33/313 (11%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             VRC  CG +R+    +    K++ R  +  +T I      Q  ++              
Sbjct  24   RVRCQRCGTQRSFHHRRQWRGKATGRKMDGRRTHIVPAGSGQNERSFRYTMKRAADFRGI  83

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
               S      S                 +    S +          D   +  +   GLL
Sbjct  84   ARGSLSGHWASAVGTTLLAGLLGANITMQGSAVSLTSNLYNQYAGEDGSRVSIKLPPGLL  143

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
                  ++++   IF                             I   +S       + +
Sbjct  144  LTIAAILMVSALVIFIIA---------------------VVQYIIGSFVSLGLAIYNLNL  182

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                      +      +G    L + + +     SLLL+IPG++    +    +++A++
Sbjct  183  IDRKEARVGQIFCHTSIMGKAVWLRLRMSIFTFLWSLLLVIPGIIKSYSYSMSGFIMAEN  242

Query  243  -NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
              +   +A+E S  +++G+ W +F      +   +   F                 L L 
Sbjct  243  PEMSAKEAMEVSMRMMNGNKWRLFCLQFSFIGRGILCLFTFG-----------IGYLWLN  291

Query  302  PFSFLYYYLIYSD  314
            P+        Y +
Sbjct  292  PYMNAATAAFYDE  304


>NJS13976.1 hypothetical protein [Sphingopyxis sp.]
Length=174

 Score = 53.6 bits (125),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 27/156 (17%), Positives = 56/156 (36%), Gaps = 8/156 (5%)

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
            +   LL        M +++      +  ++      +    LL I+  + +G  +LL I+
Sbjct  1    MLVALLSSFAAVTVMRLWLSPGGTSVGEALAFAASLIPVIVLLFIIQAISMGFAALLFIL  60

Query  224  PGLLFCVWFFFCQYVLAD-DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL-  281
            P L     +     +LA  +  G + AL KS  +  G+ W I     L+ ++   ++ + 
Sbjct  61   PALYLTGRWAPMLALLAAGETRGPIDALAKSWAMTRGNGWRIALMLFLVQLVVAIVTLIL  120

Query  282  ------TARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
                        +G A     +  +     L  Y +
Sbjct  121  DSTGSLFGARGTIGHAVASVINAAMAALGALVAYAL  156


>MBI4017641.1 hypothetical protein [Candidatus Aenigmarchaeota archaeon]
Length=261

 Score = 55.2 bits (129),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 22/183 (12%), Positives = 57/183 (31%), Gaps = 4/183 (2%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
                  +     A +++  +Q  +      + A+ +   + +       I    +    +
Sbjct  75   AILFLVSGAFLQAFYVDVTSQFAKHKKPDISSAFYVAKRNLLPVLWVQAIIGILIFAVVA  134

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
              + +  V    L      L        +    L      +     +  +   G  A+++
Sbjct  135  AVVIVSVVFRGILASTADNLAFLASLFAV----LYVQTKTWMAVTSVVLEKKRGWSAVKR  190

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
            S  L  G +  I    + + + +  ++ +   IP+ G    L  SL  T +++      Y
Sbjct  191  SFALSKGRFAEILFIILTISIATSIVNTVFDTIPFAGAVLTLLASLFFTVWTYTAPAAFY  250

Query  313  SDL  315
             + 
Sbjct  251  FEY  253


>CDE11622.1 uncharacterized conserved membrane protein [Clostridium sp. CAG:354]
Length=310

 Score = 55.5 bits (130),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 31/203 (15%), Positives = 77/203 (38%), Gaps = 6/203 (3%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +  IY   + +       ++           N    W +    +  ++  L  +  ++ I
Sbjct  59   IPIIYGFMMTMMKLKRGESVGYFDFFKDGFTNFKRAWCLTGRLLLKMIAPLILIIVAIII  118

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLIL----LILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             +      L  +  +G+    S     I+    L ++     L   I   +  + +    
Sbjct  119  MVVSIIASLATAFAVGISTSTSTLGGAIVGVSALTIIAYIVVLAAYIWLFVKSLSYGLGI  178

Query  237  YVLADDN-IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
            Y+  D+  +  L  + KS+ L+ G+   +F  ++  +   + L+ + + IP++G  A +A
Sbjct  179  YLAIDNPTMKPLDCVNKSKELMDGNKGRLFCLYLSFIGW-IALAVVVSLIPFIGWIAAIA  237

Query  296  FSLLLTPFSFLYYYLIYSDLKAN  318
             +++LTP+      + Y +    
Sbjct  238  GTIVLTPYIGFASIVFYENRAGI  260


>TGU41990.1 hypothetical protein EN829_072995, partial [Mesorhizobium sp. 
M00.F.Ca.ET.186.01.1.1]
Length=92

 Score = 51.3 bits (119),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 20/92 (22%), Positives = 48/92 (52%), Gaps = 0/92 (0%)

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
                    S+++ +R +     + +L+ +      + LI+PG++  + +     VL  + 
Sbjct  1    GRRPSFGDSVQIAIRFLLPTLAIGLLVGIGSALAMIALIVPGIILWLGWSMSVPVLIQEQ  60

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
            +G   ++ +SR L  G+ W++FG F++L++I+
Sbjct  61   LGVFGSMSRSRALTKGNRWSLFGLFLILVIIA  92


>NLD60403.1 DUF975 family protein [Clostridiales bacterium]
Length=218

 Score = 54.4 bits (127),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 24/167 (14%), Positives = 59/167 (35%), Gaps = 14/167 (8%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +      L          +          +   LR  G    L IL         LL 
Sbjct  42   LVLLLFGPALRLGLYESISSLYSGGHPRASQLFSKLRFFGKALWLGILEAFFTFLWMLLF  101

Query  222  IIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLL---VISLT  277
            I+PG++    +    Y+L  +  +  + AL +S+ +++G+   +F  +   +   +++  
Sbjct  102  IVPGIIASFRYAMAFYILWKNPEMRAIDALRESKRMMNGNKGRLFCLYFSYIGWELLAAV  161

Query  278  LSFLTARIPY----------VGEAANLAFSLLLTPFSFLYYYLIYSD  314
             SF    +P+          +     +A  + ++ + ++  +  + D
Sbjct  162  PSFALILLPFAILPELSLHALAWVLTIAGGMFVSSYVYVGEFEFFKD  208


>WP_129120554.1 hypothetical protein, partial [Deinococcus metallilatus]RXJ07507.1 
hypothetical protein ERJ73_19705, partial [Deinococcus 
metallilatus]
Length=146

 Score = 52.9 bits (123),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 29/144 (20%), Positives = 56/144 (39%), Gaps = 2/144 (1%)

Query  188  GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGL  247
             LF       +++G +  L+ L+ L +      +IIPG++  + +    Y+L D  +   
Sbjct  2    PLFIFDGKYRKYMGEYFTLIGLMYLAIIPALFFMIIPGIIISIGWSLAIYILLDKGVAPG  61

Query  248  QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP--YVGEAANLAFSLLLTPFSF  305
            +A+ +S     G+ W IFG   LL +    L  +   I             +++ T  + 
Sbjct  62   EAMIRSNKATYGYKWTIFGVSFLLGLAFYVLMMIIFNIASGGFAMLLLFIIAIVYTVAAL  121

Query  306  LYYYLIYSDLKANYRGPQHPPIKR  329
                +IY +L A  +         
Sbjct  122  GCTAVIYRNLTAEAQPEATETAAE  145


>WP_073996011.1 hypothetical protein [Arcanobacterium urinimassiliense]
Length=392

 Score = 55.9 bits (131),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 24/184 (13%), Positives = 56/184 (30%), Gaps = 23/184 (13%)

Query  158  AILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGG  217
             +      ++     +    +   I    +       + +   GSFT       +     
Sbjct  207  PLPCLGWNFLRYIGLYAIYYLAFIIGGGIL-----FAILIATGGSFTATDFTSNIADSIF  261

Query  218  SLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT  277
              LL +  + F +        L  ++IG L A+++S  L   ++W + G   L  ++ + 
Sbjct  262  LTLLCLLLIFFSIRLSVAGPALVAEDIGPLAAIQRSWKLTKNYFWRLLGVVALTAILLIA  321

Query  278  LSF----------------LTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
             +                 +   I +           + TPFS     ++Y +++     
Sbjct  322  ATLVIIISFSVIIIAIDSSVATLISFFAILL--ITMAVFTPFSMAVTNMMYVNMRFAKEN  379

Query  322  PQHP  325
                
Sbjct  380  FAQQ  383


>NOZ34534.1 hypothetical protein [Chlorobi bacterium]
Length=167

 Score = 53.2 bits (124),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 18/109 (17%), Positives = 40/109 (37%), Gaps = 10/109 (9%)

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
             F   ++  +N    +++ +S  ++ G WW  FG  ++  +I  ++S++     Y     
Sbjct  1    IFIFLIIIYENKSATESISRSFEIIKGKWWQTFGLILVFGLIIGSMSYIFIIPIYAVVIV  60

Query  293  NL----------AFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
                           L L  F +   YL +  ++      Q+  I+   
Sbjct  61   AFLSGTQIAAGSVILLSLFIFLYFTAYLFFMSMQQIMVAFQYFNIRSGK  109


>KPQ16331.1 zinc-ribbon domain [Rhodobacteraceae bacterium HLUCCO18]
Length=431

 Score = 55.9 bits (131),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 16/36 (44%), Gaps = 0/36 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
            + CP+CGA      + +P +    +C +C  T   
Sbjct  20  RLTCPNCGARYEVDDALIPPEGRDVQCSDCATTWFQ  55


>WP_116689066.1 hypothetical protein [Pelagibaculum spongiae]PVZ63547.1 hypothetical 
protein DC094_20920 [Pelagibaculum spongiae]
Length=265

 Score = 55.2 bits (129),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 35/249 (14%), Positives = 82/249 (33%), Gaps = 1/249 (0%)

Query  70   EIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGI  129
              Q    +       +          +   + ++        ++  R          +  
Sbjct  1    MFQRYKESWFFYRGHWQSMVRLVLLINFPLILAVGWFAPTEQQVIDRFTMLNQVAVSVQQ  60

Query  130  VLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGL  189
                + I      K  +        +   + +       L L+ +T  M   I    V  
Sbjct  61   DSQQSAIEQQSTDKLLSQYQNSEPLFTPGLGVLQTICWALSLAVLTLFMQQRIAGQPVNE  120

Query  190  FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQA  249
                + GL+ +G   +  +L+ ++V  G  L I+PG+   +   F  +++ +D +  L+A
Sbjct  121  SALFRQGLKLLGMVGVATLLINMLVVMGLQLFILPGVWLMMKTTFVPFLIVEDRLSPLKA  180

Query  250  LEKSRLLVSGHWWAIFGRFVLLLVI-SLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYY  308
              +S ++  G    +F   VL  +I  L  +F    + ++   A L  +  L   +  + 
Sbjct  181  FRQSLIMTRGIASQMFLGMVLTGIIGLLVFTFTAGLLGFIAFPARLMVATTLMLLAMSFN  240

Query  309  YLIYSDLKA  317
             + +     
Sbjct  241  TVFFYRYFC  249


>MAR30385.1 hypothetical protein [Candidatus Marinimicrobia bacterium]
Length=259

 Score = 54.8 bits (128),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 26/184 (14%), Positives = 59/184 (32%), Gaps = 17/184 (9%)

Query  129  IVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVG  188
             +      F  +L      +      +   +    +  +   +  +  ++   I  T   
Sbjct  85   NMGMLFSSFDLILKSFNASILFSLAIFLMILPGLIIVLMACNIDSLFMAIISSIDLTASP  144

Query  189  LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQ  248
               +    L ++  +   L +L + V   ++      +   +   F QY + D+     +
Sbjct  145  PSLNFGNDLSNIEIYNKPLFILGIAVAIINV------IWCAIRLQFYQYFIVDEQHSAFK  198

Query  249  ALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYY  308
            +L+ S +L   H             I L  + L   I ++G        ++  PFS L  
Sbjct  199  SLKSSYVLTDNHID-----------ILLQFAILILGINFLGLLCFGIGLIITIPFSLLAM  247

Query  309  YLIY  312
              +Y
Sbjct  248  TKLY  251


>RNJ62560.1 thioredoxin [Porphyrobacter sp. IPPAS B-1204]
Length=306

 Score = 55.5 bits (130),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 12/44 (27%), Positives = 21/44 (48%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  + CP CG     P + + ++  + RC +C  +   DP E +
Sbjct  1   MI-IACPACGTRYAVPDAAIGSEGRTVRCAKCKHSWFQDPPELE  43


>WP_124054136.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Arcanobacterium ihumii]
Length=443

 Score = 55.9 bits (131),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 16/121 (13%), Positives = 40/121 (33%), Gaps = 20/121 (17%)

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
             +        +  +  G ++A+++S  L  G ++ I G  VL +   + +  +   I  +
Sbjct  319  SIRVMLAPTAVVIEGAGPIEAIKRSWNLTRGSFFHILGLIVLTIFFGIAVGIIFMIIFAI  378

Query  289  --------------------GEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIK  328
                                   + L  S L+ PF+     ++Y +++          + 
Sbjct  379  VGGITASTGSSGAQIATLISFSLSMLIGSALIVPFTTALTNMVYINMRFRRENFHQQLLM  438

Query  329  R  329
             
Sbjct  439  E  439


>NLE65142.1 hypothetical protein [Elusimicrobia bacterium]
Length=273

 Score = 55.2 bits (129),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 29/158 (18%), Positives = 58/158 (37%), Gaps = 4/158 (3%)

Query  151  QNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL  210
                  + +L        L +S     + + + +        +      V  + L  +  
Sbjct  107  PFPRAVYYLLAFLFWAAGLLMSVGFTKVHLMLMRDQEPEVSELFTNGNLVVPYLLGSLCY  166

Query  211  ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
             L V GG +LLI+PG++  +      Y++ D  +G L AL +SR++  G  W +      
Sbjct  167  GLAVLGGFILLIVPGIILSIMLGLYAYLIVDKGLGPLAALRRSRVITRGQRWRLAN----  222

Query  271  LLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYY  308
               + L L+        VG       S++ + + +   
Sbjct  223  FWGMLLLLNLAGLLCLLVGAIVTGWISVIASAYVYEKL  260


>MBE9555140.1 zinc-ribbon domain-containing protein [Proteobacteria bacterium]
Length=99

 Score = 51.7 bits (120),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 10/56 (18%), Positives = 15/56 (27%), Gaps = 1/56 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCP  56
           M  + CP C  +     ++L       RC +C       P               P
Sbjct  39  MI-ISCPSCKTQFRVDEARLTPDGKKVRCSKCGHVWQAMPDGQSAPSVEQPAPVMP  93


>WP_042268190.1 DUF975 family protein, partial [Clostridium perfringens]
Length=225

 Score = 54.4 bits (127),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 22/165 (13%), Positives = 61/165 (37%), Gaps = 1/165 (1%)

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV  214
            W+  I       +      +  S+   +    +      +       ++  +L   I + 
Sbjct  28   WRNFIKKFLALILFELPISLIASIIAVVSFISIISNHFYEYLFMASMNYEDILSQYIGIF  87

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
                L++    ++  ++FF  +Y++ ++  +G  +A+ K+  ++ GH W +F   +  + 
Sbjct  88   IIIILIVATYNIIVSLFFFPVKYIIVEEPELGIWEAVGKAFKMMKGHKWELFVLILSFIG  147

Query  274  ISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKAN  318
             ++           +    NL+  +L      L ++ +Y D    
Sbjct  148  WAILAVLPIVLGSIIIVLMNLSVYILPIFAIGLIWFFVYRDTIYR  192


>WP_126643587.1 hypothetical protein [Embleya hyalina]GCE02021.1 membrane protein 
[Embleya hyalina]
Length=564

 Score = 56.3 bits (132),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 44/342 (13%), Positives = 85/342 (25%), Gaps = 43/342 (13%)

Query  21   PAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRR  80
            P + +                 +Q           P+    +        +    +    
Sbjct  202  PHQGAD----HPGAAPDPRWGGTQPLWGAPPGWNGPYVQAAKPGVIPLRPLSVGEILDGA  257

Query  81   CNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSAL  140
                          A    L +    +  +W L      G L     G  L      ++ 
Sbjct  258  FAAMRTHWKVMIGIAVVIALLTQCLEVPATWLLNREFSPGDLSDEPTGEELWRYLRDTSA  317

Query  141  LLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHV  200
            +L     ++   Q     +L   V+  ++  S      +  +      L           
Sbjct  318  VLIVPIVVSTLGQIAATGMLTVVVSRAVIAKSMSAAQAWKAVRPLLPRLLGVTFATWLVP  377

Query  201  GSFTLLLILLILVVGGGSLLLIIPG-------------LLFCVWFFFCQYVLADDNIGGL  247
                L+ IL  L +       +                +   V       VL  +     
Sbjct  378  VGTLLVAILPGLALLAVGADGLGALLLLPGLIGGVVAAIYLYVCLTLAGPVLMLEKQTVR  437

Query  248  QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP---------------------  286
            +ALE+SR LV+G WW + G  +L+ +I   +  +                          
Sbjct  438  KALERSRKLVTGSWWRVCGILLLIWLIMAIVGGIIQMPFLLVSDGFTTLTASKASDIPDP  497

Query  287  -----YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                  +     +  + LL PF+     L+Y D +       
Sbjct  498  TFVDLLITGVGAVIAAALLYPFAAGATALLYIDQRIRREALD  539


>WP_101754290.1 zinc-ribbon domain-containing protein [Paracoccus zhejiangensis]AUH66313.1 
hypothetical protein CX676_11560 [Paracoccus zhejiangensis]
Length=210

 Score = 54.0 bits (126),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 15/61 (25%), Positives = 19/61 (31%), Gaps = 0/61 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + CP CGAE    +S +PA      C  C        A   +   T    T P      
Sbjct  2   RLTCPECGAEYRVDASAIPADGRDVECSSCGHGWHEPGAAVPKGPMTAAPETGPRLNRPL  61

Query  63  R  63
            
Sbjct  62  P  62


>MBI4023262.1 hypothetical protein [Candidatus Berkelbacteria bacterium]
Length=235

 Score = 54.4 bits (127),  Expect = 2e-05, Method: Composition-based stats.
 Identities = 25/136 (18%), Positives = 53/136 (39%), Gaps = 0/136 (0%)

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
             L G     G +     K  +  +    +   +  +   L I+L + +  G    ++PG+
Sbjct  89   WLAGAGAAAGILVTQKQKKVLRPWTPYAVAWANYPALFALSIVLGITLWIGYWAFVLPGI  148

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            +         YV+  + +  L+AL  S  L   H+  I    + L++I   ++     +P
Sbjct  149  ILSTLLALSSYVVIGERVTMLEALTTSWNLGIAHFSTILVTGLALVLIGWLIALTAGLVP  208

Query  287  YVGEAANLAFSLLLTP  302
             VG+   +  S+  + 
Sbjct  209  VVGQYLLVLVSVFASV  224


>NOU34840.1 hypothetical protein [Polyangiaceae bacterium]
Length=51

 Score = 49.8 bits (115),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 11/39 (28%), Positives = 15/39 (38%), Gaps = 0/39 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  V C  C A       ++P      RCP+C  T +  
Sbjct  1   MLRVECESCKAPYQVDEKRVPPTGLKMRCPKCGHTFMVQ  39


>WP_166885743.1 hypothetical protein [Massilia sp. CCM 8734]NHZ95063.1 hypothetical 
protein [Massilia sp. CCM 8734]
Length=229

 Score = 54.4 bits (127),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 42/198 (21%), Positives = 69/198 (35%), Gaps = 9/198 (5%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
                  A             Q W   ++ A +      ++ +   +     +  V    +
Sbjct  29   VLLFALAGSGFLLGQTEYLGQAWALYLIGAFLVLCPALMAPLVVRVHASAHERSVSAGEA  88

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
            ++ GLR          L +L VG GSL+ I+PG    +   F    L  D +  L+AL  
Sbjct  89   LRRGLRCFLPCLAGAALFMLAVGIGSLMFIVPGTYLFIALAFWWIALVVDGLPVLKALMS  148

Query  253  SRLLVSGHWWAI-FGRFVLLLVISLTLSFLTARIPY--------VGEAANLAFSLLLTPF  303
            S  LV GHWW + FG   +  V+    +F    +          V     +    L+  F
Sbjct  149  SLRLVRGHWWHVAFGFSYVYPVVFCVNAFDVNMVSLSNQVVEAVVATVIAVVLCALVPMF  208

Query  304  SFLYYYLIYSDLKANYRG  321
            +     +IY DL    + 
Sbjct  209  ASANMVVIYHDLTLRRQR  226


>HBH00242.1 hypothetical protein [Rhodobacteraceae bacterium]
Length=56

 Score = 50.2 bits (116),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 8/34 (24%), Positives = 10/34 (29%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP CGA        +P       C  C    
Sbjct  2   QLTCPECGAIYEIDDEAIPPGGRRVECSACWHVW  35


>MBI2303021.1 hypothetical protein [Armatimonadetes bacterium]
Length=246

 Score = 54.4 bits (127),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 29/239 (12%), Positives = 53/239 (22%), Gaps = 3/239 (1%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              VRC HCGA    P  K        RC E  Q      A++   +  +          +
Sbjct  3    IQVRCAHCGAVYQVPDDKAGQSG-KCRCGELMQVPAKPAAQAAPARPAEVRRPAAPTLAE  61

Query  62   RRIPSDRLEI--QSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGW  119
                     +                    +    A                 +  R   
Sbjct  62   AAGHERAGPMVECRHCGMKTHEGMECEWCHQPLGTAPPVPTLHRPAGEQMHERVHERHSG  121

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
             + GI ++ +VL    + +  +L  A             +         +        + 
Sbjct  122  AVPGIVVVVLVLIGLSLLAIPVLAVAMRGAMLGFAAAMGVPPQAGIANFVLAVAGVTWLL  181

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
                   +         +  V     LL    L +    +  +    L  V F+     
Sbjct  182  SLGLFIGILKRSPAAYWIYLVLQILALLGGFGLAIFVHRIPYLSLINLTIVRFWCMPNC  240


>TNC74117.1 hypothetical protein FHG71_02655 [Rubellimicrobium roseum]
Length=374

 Score = 55.5 bits (130),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 9/53 (17%), Positives = 17/53 (32%), Gaps = 0/53 (0%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATC  55
             + CP+C A+   P   +P +    +C  C        A +       +    
Sbjct  82   RLTCPNCNAQYEVPPDAIPPEGRDVQCSACSHRWHEPGAAAAPAPEPWDEVPR  134


>RME92470.1 DUF4339 domain-containing protein [Verrucomicrobia bacterium]
Length=324

 Score = 55.2 bits (129),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 30/145 (21%), Positives = 46/145 (32%), Gaps = 0/145 (0%)

Query  135  PIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMK  194
                AL       L+   Q    A  +A +       S +   M                
Sbjct  107  FPLFALGSLILWLLSFLGQTVVCAGPIAALLLTGPLTSGLCLVMLKRARNEPAAPRDQFT  166

Query  195  LGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSR  254
                    F LL ++  + VG G +L ++PGL+    + F   + AD   G   AL  S 
Sbjct  167  FMGPLFVPFMLLGVVHQVAVGLGMVLCVVPGLVLAWLWSFAFVIAADRQCGFWDALRASA  226

Query  255  LLVSGHWWAIFGRFVLLLVISLTLS  279
             LV     A      +  + S+  S
Sbjct  227  GLVRARPLATLVLVTVAWLPSILAS  251


>CCX41659.1 uncharacterized protein BN454_00973 [Clostridium sp. CAG:1024]
Length=302

 Score = 55.2 bits (129),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 27/158 (17%), Positives = 58/158 (37%), Gaps = 2/158 (1%)

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
            L  A + + L+   A  L+         ++     +++  L+     + + + +      
Sbjct  67   LCLAWLLTILVSGLAGMLSVVMPLLGVLMVPMATVFVIQLLNGGQRIVGLLVYRQKTFET  126

Query  191  RSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQA  249
              + L  R    +   L   +L     +LL  I G++    + F  Y+L D   +   QA
Sbjct  127  NDIFLCFREYSRYLAGLSWQLLWTSLWNLL-PIIGIVKAYSYSFVPYILYDHPELTAKQA  185

Query  250  LEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
            L++S  L  GH   +F   +  +  +L        + +
Sbjct  186  LKQSMRLTDGHKMDLFVLDLSFIGWNLLNYMTVGILGF  223


>WP_131283476.1 DUF975 family protein [Alloscardovia theropitheci]TCD54461.1 
DUF975 family protein [Alloscardovia theropitheci]
Length=329

 Score = 55.2 bits (129),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 44/323 (14%), Positives = 82/323 (25%), Gaps = 22/323 (7%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
                +G+ +   ++             A     Q       I +      +        +
Sbjct  21   WGLAVGLLITLSIIPVILAAIQFSSIAAAAAARQLPTASPFIGILIAIAGIYTAITGIVA  80

Query  178  MFIYICKTDVGLFRSMKLGL--RHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
                   TD G  + +         G+  +  IL I+     S+L  IPG++    +   
Sbjct  81   FLHIFDGTDQGYGKELTSAFTDGKTGANLVTYILQIVFTFLWSMLFWIPGIVASYSYSMS  140

Query  236  QYVLADD-----NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
             Y++ D         G +A+  S+ ++ GH   +F   +  L   +   F    +     
Sbjct  141  LYIVDDWAKQGYKAQGTEAITASKNMMKGHKGELFVLDLSFLGWYILSGFTFGLLN----  196

Query  291  AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLV  350
                   L +TP+        Y  L A        P+                  G    
Sbjct  197  -------LYVTPYHMATRAAFYRALVA----QSATPVAPVNPMAPTGAPYGAPQAGYGQP  245

Query  351  SLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTT  410
                          A     Q    QP Q      ++P   Q ++    + +   Q    
Sbjct  246  VQGYTPAQGYTQAPAQPVQAQAAPMQPNQPAQPYNAVPVYSQPVAPQPAQPVQPVQPTAP  305

Query  411  SEGGLSLGPVTLFADRFWADDQN  433
             +   S        D        
Sbjct  306  VQPEQSTENPAEPTDPTAQPPYQ  328


>HHB83669.1 hypothetical protein [Devosia sp.]
Length=86

 Score = 50.9 bits (118),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 11/62 (18%), Positives = 20/62 (32%), Gaps = 0/62 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
           + CP+C A  +   + L A     +C +C Q+         ++Q        P       
Sbjct  18  ITCPNCQARYDVDPATLGAHGRQVKCAKCHQSWKARIEGPAKSQVKPLDNQAPRSDKNLM  77

Query  64  IP  65
             
Sbjct  78  PD  79


>KYC30039.1 hypothetical protein A0J57_22825 [Sphingobium sp. 22B]
Length=86

 Score = 50.9 bits (118),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 14/78 (18%), Positives = 19/78 (24%), Gaps = 1/78 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V CP+C      P S +       RC  C  +   + A     +     A  P    
Sbjct  1   MILV-CPNCATRYIVPDSAVGPNGRQVRCAACKHSWFQEGAALAPREEALTTAGVPEVAA  59

Query  61  QRRIPSDRLEIQSKTVNC  78
                      Q      
Sbjct  60  PSWNQLCLQAAQRTWRPF  77


>PKN03838.1 hypothetical protein CVU75_00255 [Candidatus Dependentiae bacterium 
HGW-Dependentiae-1]
Length=254

 Score = 54.4 bits (127),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 37/200 (19%), Positives = 72/200 (36%), Gaps = 6/200 (3%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
            +++ +        +  L   +            A+    + ++++  +  T  + +    
Sbjct  49   FVIMVPPQALAHAAIQLQVISEVSGSYLIMGLRAVSFFVLFFLMVRFAGGTLKVLLDYWD  108

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
            T    FR++ L            +L  L+   G +  I+PG+     FF C  ++ D N 
Sbjct  109  TKELHFRNLSLSNSFAWRVFCATVLFFLLSALGLVFFIVPGIYIMGRFFLCILIMFDKNT  168

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT------LSFLTARIPYVGEAANLAFSL  298
               ++L+ S  L  G+    F    L L I +        SF+   +  VG     A  L
Sbjct  169  TIRESLKISWDLTKGYERIFFVLAALQLGIWIVSNYLTRFSFMEPLVGVVGAFIGSAVGL  228

Query  299  LLTPFSFLYYYLIYSDLKAN  318
            +L+   FLY   I+  +   
Sbjct  229  VLSIALFLYTSFIWVYVYRK  248


>GAX20217.1 hypothetical protein FisN_12Hu062 [Fistulifera solaris]
Length=279

 Score = 54.8 bits (128),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 25/220 (11%), Positives = 62/220 (28%), Gaps = 12/220 (5%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              +  +        + + ++   A +   +    +  ++ A   +  +    +  S+  Y
Sbjct  51   YMVERVLCYPIAIFVQAGIIHVVANFYTQKIATLKRCMMFALSRFRAVFCFALLYSVLFY  110

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            I    +        GL        L  + +L +   S L+    +           +   
Sbjct  111  IFFIGIVGLAIFLYGL--FEDIPHLSYIRVLPLITASALI----IYAMTSLTITLPIFVI  164

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT------LSFLTARIPYVGEAANLA  295
            +      A+++S  L SG+   IF  F+L             L         +    +  
Sbjct  165  EKRSPCDAIKRSFELFSGYRRYIFWIFLLFFTSYFVSLMYQRLIGAIFGFSTLAVVLSRL  224

Query  296  FSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLT  335
              ++  P   +   ++Y  L+    G     +        
Sbjct  225  PGIVTLPLQTIIITVLYISLRVQTEGLDMEILINDVQMPI  264


>TNE61745.1 hypothetical protein EP335_15095 [Alphaproteobacteria bacterium]
Length=371

 Score = 55.5 bits (130),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 11/37 (30%), Positives = 15/37 (41%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  V CP C A    P   +P +    RC +C  +  
Sbjct  1   MILV-CPSCSANFKIPDGAIPPEGRKVRCAKCKHSWH  36


>MBI2344541.1 hypothetical protein [Candidatus Dependentiae bacterium]
Length=240

 Score = 54.4 bits (127),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 39/213 (18%), Positives = 75/213 (35%), Gaps = 14/213 (7%)

Query  99   GLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWA  158
             +    + L     + C   +  +G+  +G++ A        +      +          
Sbjct  18   FINEQFKRLDIWVMISCISYFAGIGLIAVGVLFACIGFGGFYITNFVFNIFSIFNLQATP  77

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSM-----------KLGLRHVGSFTLLL  207
            +        +L   ++   +F         L  +                    SF L  
Sbjct  78   LNFIVSIAFMLLFIFIAYKIFQKAIGILNALMLNSLAATEDKELPRFKQRDERLSFLLYS  137

Query  208  ILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGR  267
            +L  L+   GS+ LIIPG++F V F F   ++ D+     +AL+KS  +  G +W IF  
Sbjct  138  LLYFLIFILGSIFLIIPGIIFFVRFAFGYLIMLDEKCSPFEALKKSWNITEGVFWQIFIF  197

Query  268  FVLLLVISLTL---SFLTARIPYVGEAANLAFS  297
             V + +I+L +   S +   +P         +S
Sbjct  198  IVPVFLITLFIPLSSIIFLFVPLNFWIFAYLYS  230


>NKX48573.1 hypothetical protein [Rhodobacteraceae bacterium R_SAG8]
Length=60

 Score = 50.2 bits (116),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 12/45 (27%), Positives = 18/45 (40%), Gaps = 1/45 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           M  + CP+C A+   P   +P+     +C  C QT      E   
Sbjct  1   MRLI-CPNCDAQYEVPDEVMPSSGRDVQCSNCGQTWFQHHPEFPP  44


>KPJ72821.1 hypothetical protein AMJ52_05145, partial [candidate division 
TA06 bacterium DG_78]
Length=76

 Score = 50.5 bits (117),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 12/34 (35%), Positives = 16/34 (47%), Gaps = 1/34 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           M  V C HC  + N   +K+PA     RC +C  
Sbjct  1   MI-VECSHCHRKYNVDDNKIPAAGVKVRCKQCQN  33


>WP_153653054.1 hypothetical protein [Aeromicrobium sp. MF47]QGG41788.1 hypothetical 
protein GEV26_10665 [Aeromicrobium sp. MF47]
Length=253

 Score = 54.4 bits (127),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 32/251 (13%), Positives = 74/251 (29%), Gaps = 9/251 (4%)

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
            + PS                      P+    +       +  LLA              
Sbjct  4    QPPSHGTPYPYAYGPPSYAAPPPFRIPDAFGWSWDMLRVHLKVLLAAMIPTVVVGLVLYA  63

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYIL---LGLSWMTGSMF  179
              +   + +                           +  A +   +   L    +     
Sbjct  64   MYFWRLVPIWIDMAEPTTSSTVQLDRFQDLWATVGLLYAAALVVAIPLSLFHGNLVRMCL  123

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
            +            +    R  G   +   +L++    G++L ++PG+ F  +  F   +L
Sbjct  124  LIADGGTPSYRMILST--RRAGRVMVTTAVLLVAGVLGTILCLLPGIAFAAFSMFTLPLL  181

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLL  299
             D ++G   A+  S  LV  H     G  +L   + + +++    + +VG AA++    L
Sbjct  182  LDRDLGTFAAIRGSFALVRAH----LGLCLLTFGLLIAVAYAGTMLCFVGIAASMPLGTL  237

Query  300  LTPFSFLYYYL  310
            +  +++    +
Sbjct  238  ILVYAYRSLQV  248


>WP_157151108.1 hypothetical protein [Brachyspira sp. SAP_772]
Length=256

 Score = 54.4 bits (127),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 21/216 (10%), Positives = 60/216 (28%), Gaps = 6/216 (3%)

Query  103  ISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLA  162
                  +   +        +   +    + F P     +     W   +     +  L  
Sbjct  26   FRHNFINFLLVGFLCAMPTIITTIYFPPVMFDPTKIETVKDLVDWFQNEVNEGFYINLFL  85

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
            +    ++    +   +   I         ++    + +       I+ +++   G    I
Sbjct  86   SWFLDIISAVSIALLVEGLIYDKIRTASYAIVKTFQMIIPILATSIITMIIYFFGLSFFI  145

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
             PG++  + F F   + A  +  G+ A+  S  LV   ++        +++    L    
Sbjct  146  FPGIILMILFMFTTNICALRHTWGIDAIRYSFSLVKPKFFKSLSMLAFIVLFQNVLVITF  205

Query  283  ARIP------YVGEAANLAFSLLLTPFSFLYYYLIY  312
               P       +    ++        +  +   L +
Sbjct  206  PSAPMDTREGVLYYILSMIILYFFDTYFKILISLYF  241


>MBG6121300.1 hypothetical protein [Corynebacterium aquatimens]
Length=317

 Score = 55.2 bits (129),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 26/246 (11%), Positives = 54/246 (22%), Gaps = 11/246 (4%)

Query  261  WWAIFGRFVLLLVISLTLSFLTARI---PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKA  317
            W  I G   L  +       L   +   P++   + L   L+         Y + +    
Sbjct  75   WPRILGAGALSGLWLGLFVGLLFMLFGEPFLSLISALLIGLIFGVVFAAMSYWLTN---G  131

Query  318  NYRGPQHPPIKRQWLPLTAAIFGWMLIP-----GLLLVSLSRQNLSAEQLLSAGKDIQQR  372
                     I      +                  L  + +    S            Q 
Sbjct  132  KRDFSSATAIVAGRYDVLCEPSHAPAARDAIASMGLGTAGAVGMHSQPASEQPQTSETQS  191

Query  373  LGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQ  432
               Q  +        P+                Q +      ++              + 
Sbjct  192  SHAQSFEAQSFEAQQPQAQSFEVQQSQATPAQHQPQPGRSTQVNQASSAQVQSAQAPQNH  251

Query  433  NPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTD  492
             P    + E    P     Q   A  + D   ++    +  R  +  + A   V  N + 
Sbjct  252  QPVQPNQAESYQAPVQRAEQPHYAEPQNDAQPNEGHWSVVQRNDAQPNEAQPNVAPNSSP  311

Query  493  ENDLFS  498
            +++   
Sbjct  312  QSEDNQ  317


>WP_012626308.1 RDD family protein [Cyanothece sp. PCC 7425]ACL43210.1 RDD domain 
containing protein [Cyanothece sp. PCC 7425]
Length=590

 Score = 55.9 bits (131),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 31/199 (16%), Positives = 59/199 (30%), Gaps = 5/199 (3%)

Query  127  LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD  186
            L     F  +               ++ W +A     V   L  +      +   +    
Sbjct  101  LISRQMFMVLRQQDEPIATARTELYSRTWVFAKSNLWVGLTLFLVYLGLALVGYVLYLGL  160

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL--LFCVWFFFCQYVLADD-N  243
              +   +   +  +     L+   +L V    LLL             FF   VLA +  
Sbjct  161  WPILTLLFEEVGKLEPQEALIWFALLTVLVLILLLGALLAVSYIAARLFFVDVVLALEPE  220

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR--IPYVGEAANLAFSLLLT  301
            +  LQ+L +S  L   +         +  +I    + L     + ++     L  ++ L 
Sbjct  221  VTALQSLNRSWQLTRNYALQTLTVISIAFIIVTPATILANIANLLFIIPVLGLLINVALF  280

Query  302  PFSFLYYYLIYSDLKANYR  320
            PF      ++Y DL     
Sbjct  281  PFWQGIKAVMYYDLCRYNE  299


>MBI3963363.1 hypothetical protein [Candidatus Kerfeldbacteria bacterium]
Length=113

 Score = 51.7 bits (120),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 21/102 (21%), Positives = 41/102 (40%), Gaps = 0/102 (0%)

Query  210  LILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFV  269
            + ++V  G  L I+PG++     F    V+  +N   + AL +S  L  G    +F   V
Sbjct  1    MAVLVALGLALFIVPGVMLLFLAFLAPSVVFLENRSPVSALRRSAELTRGIRMRLFLYAV  60

Query  270  LLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
             + ++      + + +PY+G    +   +          YL 
Sbjct  61   GVYLLLTLFLLVMSAVPYLGTFVEMVVVVPFLIIIVYRIYLA  102


>MAQ70459.1 hypothetical protein [Flavobacteriales bacterium]
Length=204

 Score = 53.6 bits (125),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 34/188 (18%), Positives = 67/188 (36%), Gaps = 0/188 (0%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
              R  + +  I            F  + L     +          I        L   + 
Sbjct  1    MSRIEFSISHIIARSWDRFKENPFFWIGLAFLNIIITPPAGLPPIISFPVTLLGLYISAS  60

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
            +T     Y+    VG+   + +  +   ++ L  I  ++ V  G  LLI+PGL   +   
Sbjct  61   ITLITIKYMRGESVGVRDLISIDFQVFLNYILFTITALVGVFIGLFLLILPGLYLAIRLI  120

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAAN  293
            F  +++ D  +G  QA+EKS  + + +   I    +++L++     F+      V  A  
Sbjct  121  FAPFLIVDQKMGFDQAIEKSWQMTAQNVGKITMYILIVLLMVFIGFFVFFFGIIVALAVT  180

Query  294  LAFSLLLT  301
               + +L 
Sbjct  181  GLSNAILY  188


>WP_131156190.1 hypothetical protein [Egibacter rhizosphaerae]QBI21197.1 hypothetical 
protein ER308_17560 [Egibacter rhizosphaerae]
Length=357

 Score = 55.2 bits (129),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 37/178 (21%), Positives = 58/178 (33%), Gaps = 8/178 (4%)

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
             V F      +  +  G L AL +S  LV   +W   GR +LL+++   L+ + + + +V
Sbjct  174  YVLFAIVTPAVMLEGRGPLSALGRSSELVRRRYWPTLGRVLLLVLLYWVLTLVLSPLQFV  233

Query  289  GEAANLAFSLLLT--------PFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFG  340
            G     A S ++         PF  +   L+Y DL+A   G           P       
Sbjct  234  GMFTGAAGSAIVMTVTEVLVTPFLPVALTLVYLDLRARTEGTDLAASYGGGPPRPWWEQP  293

Query  341  WMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSAD  398
                             S +Q  +     Q    +  QQ     R  P E    S   
Sbjct  294  GSAGGWGAHGQQGYGPPSGQQGYAPPSGEQGSGSSPGQQGQGWGRRPPGESGGPSGDP  351


>MBI2410862.1 hypothetical protein [Candidatus Kerfeldbacteria bacterium]
Length=258

 Score = 54.4 bits (127),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 32/206 (16%), Positives = 62/206 (30%), Gaps = 7/206 (3%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            + L        L  Y  G +  F      ++      +         ++   T       
Sbjct  43   FVLINAFTLFALTTYASGRMAYFIATMLVIVTIILIQVWAFIALIYISLHHETSTVAESF  102

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF--  228
               +            V     +     ++    + +IL    +   +       L+   
Sbjct  103  HHALHFFWRFVKLGLAVSGIFFLSFLAGYIVVGIIGIILGHFSLSVLNSTFDWLTLIPLG  162

Query  229  -----CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
                   ++ F  + + D + G + AL+ SR LV GH+W    R VLL V+    ++   
Sbjct  163  VSAVVSTYYLFAPFSIIDTHAGTIAALKHSRHLVRGHFWPTAIRVVLLYVVVGMFTYAFQ  222

Query  284  RIPYVGEAANLAFSLLLTPFSFLYYY  309
             +P VG   +L      +       Y
Sbjct  223  YVPAVGSILSLLLVSPFSVIYLSVLY  248


>TQF77868.1 hypothetical protein FK498_10925, partial [Elioraea sp. Yellowstone]
Length=181

 Score = 53.2 bits (124),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 15/36 (42%), Gaps = 0/36 (0%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            MP + CP CGA  +  +  +     S RC  C    
Sbjct  108  MPRIACPSCGARYDVAAGMIGPAGKSVRCARCGHVW  143


>NOZ34535.1 hypothetical protein [Chlorobi bacterium]
Length=301

 Score = 54.8 bits (128),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 29/178 (16%), Positives = 59/178 (33%), Gaps = 1/178 (1%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
             +L     F P+            +    +    I       +L         +      
Sbjct  43   IVLIRWFGFHPVEDIYTQIQNFQYSFGGTSSFMMIFSVFQNVMLYTFIGSYIKVLHKKGY  102

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
             +V +        R    F   LIL  +++  G ++ +IPG+   V  +    ++  + I
Sbjct  103  RNVEINDIWTEIKRFFWPFAGALILGNIIIVIGIIMFVIPGIYIAVALYPLFAIIIFEEI  162

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVIS-LTLSFLTARIPYVGEAANLAFSLLLT  301
            G   +L +S  L+ G+WW   G  +++ +I     + LT    ++        S    
Sbjct  163  GVKNSLPRSFELIKGNWWLSLGLIIIMFIILTTGTAILTLFFNFLYRTIAAVGSAYFV  220


>HBH89043.1 hypothetical protein [Hyphomonadaceae bacterium]
Length=59

 Score = 49.8 bits (115),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 10/40 (25%), Positives = 14/40 (35%), Gaps = 1/40 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
           M    CP C A+       + A   + RC  C  +    P
Sbjct  1   MIL-TCPSCSAQYFADDKAIGANGRTVRCAACAHSWFAQP  39


>WP_052433283.1 hypothetical protein [Streptacidiphilus carbonis]
Length=439

 Score = 55.5 bits (130),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 21/130 (16%), Positives = 36/130 (28%), Gaps = 20/130 (15%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
                 L      L  C+        L  +  G   A+ +S  LV G WW + G  V++ +
Sbjct  296  TALLLLPGSAVALWLCISMSLAAPALVLERQGIRAAVARSFRLVRGAWWRVLGVTVVVGI  355

Query  274  ISLTLSFLTARIP--------------------YVGEAANLAFSLLLTPFSFLYYYLIYS  313
            ++   S + A                        +          L  P +     L+Y 
Sbjct  356  LTDMASGIIALPFTVVDFAINGSGSDGTAASSLLLAAIGGFIGGALTFPVTAGSSTLLYI  415

Query  314  DLKANYRGPQ  323
            D +       
Sbjct  416  DQRIRREALD  425


>TET33779.1 hypothetical protein E3J61_03705 [Candidatus Dependentiae bacterium]
Length=222

 Score = 54.0 bits (126),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 28/179 (16%), Positives = 58/179 (32%), Gaps = 2/179 (1%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            W +       +  I     +       + L     T L   +       +L  +  +   
Sbjct  34   WIIALLPLAIIGLIMNRAFLALLPMAGANLKQLIVTHLLQTSGMIILTTVLIFLLLLSWI  93

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
                           +  +       L  +  F  ++++L +    G +L I+PG+ FC+
Sbjct  94   SVGFVRIGLRIHDTGEANISTLFPSPLLVLKVFIGMIMILCI-AFIGFILFIVPGVYFCI  152

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG  289
              +F  Y L  +  G  +A   S  +  G  W I    ++  +I+    F  + I  + 
Sbjct  153  RAWFYMYAL-VEGEGIFEAFSTSFRITRGKGWQILPLLIMSSIIAAITIFFGSFICVLS  210


>WP_018632467.1 zinc-ribbon domain-containing protein [Neomegalonema perideroedes]
Length=234

 Score = 54.0 bits (126),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 11/38 (29%), Positives = 13/38 (34%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  V CP C A      S  P +    RC  C +    
Sbjct  1   MI-VSCPECRARFMVDDSLFPPRGRRVRCSSCGKAWFQ  37


>MBC7540212.1 hypothetical protein [Bacteriovorax sp.]
Length=239

 Score = 54.0 bits (126),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 31/186 (17%), Positives = 62/186 (33%), Gaps = 4/186 (2%)

Query  128  GIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDV  187
              ++    I   +L++  +          +     ++   +     M   +   +     
Sbjct  25   YKMIYLGLIVVNILIRIPSSHLGYKSVNIFYNTFVSIILQVCISVLMANLIITKVALDKN  84

Query  188  GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGL  247
                 +           L   L  +    G +LLIIP  L  ++F     +   +   GL
Sbjct  85   KPKADLVNSSLKSIKGQL---LYYVSATIGFVLLIIPAFLALIFFCLTPTLEILEEHKGL  141

Query  248  QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI-PYVGEAANLAFSLLLTPFSFL  306
             +L +S  LV      +F   +++LV  + L  + +R+            S L+T F  +
Sbjct  142  SSLRRSHFLVRKDLPLVFILSMIILVFYIVLELVFSRLRSTGPGIVIYIMSALITTFISI  201

Query  307  YYYLIY  312
               LIY
Sbjct  202  IVTLIY  207


>MBI2252145.1 hypothetical protein [Armatimonadetes bacterium]
Length=220

 Score = 53.6 bits (125),  Expect = 3e-05, Method: Composition-based stats.
 Identities = 22/188 (12%), Positives = 65/188 (35%), Gaps = 14/188 (7%)

Query  146  TWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTL  205
             +   +N   Q   +  +    ++G S  T  +     K ++   +++   L +  +   
Sbjct  14   AFNIFKNNLLQSFFMFLSSIIPVIGDSLHTCIVEQIFKKEEISARKALLKALGYFWNLLE  73

Query  206  LLILLILVVGGGSLLLIIPGL---LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWW  262
            ++IL  + +     + I   +    + ++      V+  + + G +A+++ R ++  +  
Sbjct  74   VIILFTISMTLWIFVPIYGWIKAVYYSLYSAMISNVVILEGMAGKKAMQRCREIIMRNKE  133

Query  263  AIFGRFVLLLVISLTLSFLTARIPYVGE-----------AANLAFSLLLTPFSFLYYYLI  311
            +        +   L +S +   I ++                +  S +      +   LI
Sbjct  134  SQSLAIGFFVGFPLIISCVFILIGFISSLHNNFNLKLTNIIMIVLSFIFLSVFEIGNSLI  193

Query  312  YSDLKANY  319
            Y +L    
Sbjct  194  YYELIKRE  201


>MTI96510.1 hypothetical protein [Firmicutes bacterium]
Length=255

 Score = 54.4 bits (127),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 32/187 (17%), Positives = 70/187 (37%), Gaps = 10/187 (5%)

Query  136  IFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL  195
            I+          +  ++        +  +    L    +   +   I  +    F    +
Sbjct  44   IWFYGQSAWIPGVPGEDLWATLLDAIVPLVLSGLMFLSLMVIIAKTIDNSYTDWFAVWGV  103

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
                + ++    ++L + + G SLLLI+P ++F V++ FC + +A      L AL  S+ 
Sbjct  104  VWARLPAYIGTSLMLTVFIFGLSLLLIVPAIIFGVFWLFCLHAVALRKRDFLAALSYSKE  163

Query  256  LVSGHWWAIFGRFVLLLV-------ISLTLSFLTARIPYVGEA---ANLAFSLLLTPFSF  305
            +VSG WW + G  VL ++       +   LSF+   +  +            +       
Sbjct  164  IVSGRWWLMLGYIVLTMISAAVIIWLGAALSFIPTLLLSMFAFMQPLLGLIYVFAIYLLM  223

Query  306  LYYYLIY  312
             +  +++
Sbjct  224  SFIAVVW  230


>HHR82506.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=152

 Score = 52.5 bits (122),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 11/48 (23%), Positives = 18/48 (38%), Gaps = 1/48 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQT  48
           M  + CP C  +      K+P K +   CP C  + +    +   T  
Sbjct  1   MI-IECPECKKKYKIDPEKIPEKGAKITCPACSHSFMVRKKKEPETPQ  47


>WP_123193109.1 DUF975 family protein [Paraeggerthella hongkongensis]RNL38253.1 
hypothetical protein DMP08_11975 [Paraeggerthella hongkongensis]
Length=227

 Score = 54.0 bits (126),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 32/191 (17%), Positives = 61/191 (32%), Gaps = 12/191 (6%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
            G   L  V                  +           +  +         +       I
Sbjct  30   GFRYLAHVPFAFEPAPFAFELAFAGPSGFLGIVAGVYGIVVLIIGGAVRQGLCQFNINLI  89

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
             K     F  +     ++G   LL + + L++   SLLLIIPG++    +    Y++A +
Sbjct  90   KKDAPAEFNVLFSKFSNLGKCLLLNLAMWLLILAWSLLLIIPGIIAAYRYAMAPYIMAQN  149

Query  243  -NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
             +IG + A+ +S+ L+ GH   +F   +  +  +            +         L L 
Sbjct  150  PDIGVMDAIGQSKELMRGHKGRLFWLDLTFVGWA-----------LLSVLTFGIGFLWLN  198

Query  302  PFSFLYYYLIY  312
            P+        Y
Sbjct  199  PYMEAARAAFY  209


>HEN14014.1 zinc ribbon domain-containing protein [Schlesneria paludicola]
Length=225

 Score = 53.6 bits (125),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 39/235 (17%), Positives = 73/235 (31%), Gaps = 21/235 (9%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
                C HCG   +T   K      +A+CP C + +      +                  
Sbjct  3    IEFSCDHCGKALSTTDDK---AGRTAKCPGCGEAITVPFPSAADN----------DEDSV  49

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                +    +              C +P ++    G        +L+ SW +F +    +
Sbjct  50   AATSTVTCPMCGAENRGDARECESCGEPLKKTVKRGHERIDAGDVLSASWRIFKQEMGMV  109

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQ--------NQNWQWAILLATVAYILLGLSW  173
            +G  L+  V+ FA      +L     +  Q                 L         ++ 
Sbjct  110  IGGVLVAGVINFAISLPQSVLGAIAGIMQQQGEGETALLLQTLSWCFLPIAYLGQWFITC  169

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
                M + I + +   F  +  G +++       IL  L++  G L LIIPG+ +
Sbjct  170  GLQRMLLNIARGESAQFGDLFSGGKYLWRMAGASILFGLMIAVGLLCLIIPGIFW  224


>TAM99469.1 hypothetical protein EPN39_06515, partial [Chitinophagaceae bacterium]
Length=226

 Score = 53.6 bits (125),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 18/145 (12%), Positives = 46/145 (32%), Gaps = 0/145 (0%)

Query  140  LLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRH  199
               +           + +          +          +        G+     +  R+
Sbjct  75   FFGQNLFQHYLSWPFFLFIFFTWLGFTSMNVCVGAYMKYYDQHQGEKPGIEEVWNIFKRY  134

Query  200  VGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSG  259
                 +L I + ++   G  L ++PG+   V F    +VL  ++    + +++   L   
Sbjct  135  YLKVLILSIPVAIITIIGYFLCLLPGIFLSVVFVPFPWVLMMEDASLGEGMQRCFGLTKE  194

Query  260  HWWAIFGRFVLLLVISLTLSFLTAR  284
             +W  FG +++  +I    S +   
Sbjct  195  FFWISFGIYLVAYLIYSFASGIIGV  219


>MBA3615465.1 hypothetical protein [Rubrobacteraceae bacterium]
Length=166

 Score = 52.9 bits (123),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 28/159 (18%), Positives = 54/159 (34%), Gaps = 8/159 (5%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
            A   Y++     +  +          G+   +      V S  +  ++ + V      LL
Sbjct  1    AYYLYLVYAEGIVRKAHRGEQRLGLRGVLDDLIGAAPFVSSVLVAALISLSVTTIAIGLL  60

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI-------  274
            +IPG+     +     V+ ++ IG L A  +S  LV GH+W +F    +   +       
Sbjct  61   VIPGMWLYTRWSLTTPVIREEEIGPLAATRRSNELVRGHFWLVFMTATVAYYLEGVVIHE  120

Query  275  -SLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
             +L     T    +         + L  P +     L +
Sbjct  121  GALVAGAFTGSHTWGAWVGGSIVATLAMPLAAFATSLAH  159


>NBP71683.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=299

 Score = 54.8 bits (128),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 8/44 (18%), Positives = 13/44 (30%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M    CP C A      +K+     + +C +C            
Sbjct  1   MIL-TCPSCSARYVVDPAKIGPNGRTVKCAKCGHAWAEPAPPPD  43


>PKL18483.1 hypothetical protein CVV49_05870 [Spirochaetae bacterium HGW-Spirochaetae-5]
Length=317

 Score = 54.8 bits (128),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 30/224 (13%), Positives = 68/224 (30%), Gaps = 1/224 (0%)

Query  86   CLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPA  145
                  E              +  S +           ++  G ++     F  + L  A
Sbjct  52   MGFINAETMGFMFDTMKNPADIQKSNDAMLNMFSSNGVLFGAGYIVIILFSFIIIALMQA  111

Query  146  TWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTL  205
                             T   I+ G       +              +   +        
Sbjct  112  YAYEYMILYQNKDYETITYKDIIAGGKKNGLKIAFTNLGIIFIFIAGVFFLVLLSVIAAA  171

Query  206  LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
            +  +L + +    ++ +  G  F + F     +   ++ G  ++L +S  L+ G++W  F
Sbjct  172  ISEILAVFIPIIVMISMFSGFYFMIMFILTPAIRVFESKGFWKSLVRSVKLMYGNFWNTF  231

Query  266  GRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYY  309
            G   +L++    L  +   IPY+     +AF+++  P +   Y 
Sbjct  232  GLLFILMIAVSILGIVF-MIPYIAVVGFMAFNVISGPAANPVYM  274


>NBU84785.1 thioredoxin [Sphingomonadaceae bacterium]
Length=83

 Score = 50.5 bits (117),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 8/38 (21%), Positives = 10/38 (26%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M    CP C          +P      RC  C  +   
Sbjct  1   MIL-TCPACETRYLIADGAIPPAGRQVRCASCKHSWFQ  37


>MBA3344587.1 zinc-ribbon domain-containing protein [Gemmatimonadales bacterium]
Length=104

 Score = 50.9 bits (118),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 11/42 (26%), Positives = 14/42 (33%), Gaps = 0/42 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           V CP+C        +K+P     ARC  C           Q 
Sbjct  3   VTCPNCATVYRVDPAKVPEAGVRARCGVCSAVFAVHRQGQQP  44


>NNK61736.1 hypothetical protein [Gemmatimonadetes bacterium]
Length=113

 Score = 51.3 bits (119),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 12/45 (27%), Positives = 14/45 (31%), Gaps = 0/45 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRT  46
            TV CP C         K+P      RC  C      D  E +  
Sbjct  3   ITVECPSCQTTFPVDPRKVPDGGVKVRCSVCSGIFFVDKPEIEEP  47


>MBE0592420.1 hypothetical protein [Gemmatimonadales bacterium]
Length=168

 Score = 52.9 bits (123),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 15/147 (10%), Positives = 43/147 (29%), Gaps = 1/147 (1%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
                   ++   V+                    +        +       L    +  +
Sbjct  22   YRHHFWTFVTIAVVCEGVPTILNAYVLLGGGWIAHPMMGLLGFVLAAFGGFLAAGAIVRA  81

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
            +        +    +++  L  +    +      L++  G++ L +PG++    +     
Sbjct  82   IAAAYMGQQLPAADALRFALGRIWPLFVAGTSAYLLIVLGAIALFVPGIILACGYSVVAQ  141

Query  238  VLADDNI-GGLQALEKSRLLVSGHWWA  263
            V+  +++ G   AL +S  L  G+   
Sbjct  142  VVILEDLPGATDALPRSWRLTKGYKGK  168


>WP_021761059.1 hypothetical protein [Desulfovibrio gigas]AGW14099.1 hypothetical 
protein DGI_2346 [Desulfovibrio gigas DSM 1382 = ATCC 19364]
Length=246

 Score = 54.0 bits (126),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 28/171 (16%), Positives = 59/171 (35%), Gaps = 2/171 (1%)

Query  154  NWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILV  213
                  L      + L    +T  +        +   ++ +L             L   +
Sbjct  65   WNILIFLCYFFLLVPLQQGIITPLLAAEYLGEPITRSQAFRLAQARFWRLVWARTLRYTL  124

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFG--RFVLL  271
            V  G  + ++ G+   V +FF + V+  ++   +++L +S  L     W I       L 
Sbjct  125  VLAGLPVFLVLGIWLYVRYFFVEEVVILEDSSPVRSLGRSGQLSRHAVWMILLTSTVFLT  184

Query  272  LVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGP  322
              I+  +   + R P +  +  +A S+LL P       ++Y   +    G 
Sbjct  185  GSIAAFIHLESMRDPMLRMSLEVALSVLLVPAELAVNCVLYFSARCRREGF  235


>VVB74047.1 Uncharacterised protein [uncultured archaeon]
Length=291

 Score = 54.4 bits (127),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 17/196 (9%), Positives = 60/196 (31%), Gaps = 0/196 (0%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                 + +   GI   F  +    + +   +    +    +      +    + ++    
Sbjct  65   FMIVPIALKHYGIKTGFGSVSLKNVTEWVKYTVVNSVVPFFNWQDRRMLMAQIVIAVAGV  124

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             ++       +G   ++      V +   + + ++ +      L+ +  +   + +    
Sbjct  125  LLWFGGFDQQIGALNALDPINPSVQALDGINLTIVALGLLVLGLIWLLQMYNSIRYALLI  184

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
             V   +    + ++ K+  L  G++      F +  + S  + +  + I  V    N   
Sbjct  185  QVRLSEGKDIMDSVGKAWKLTKGNFVTTLLLFAVWSIFSYIVGWAASFIGNVLPFFNGIV  244

Query  297  SLLLTPFSFLYYYLIY  312
               +T      +  +Y
Sbjct  245  FSFMTVTWAYVFTGLY  260


>WP_156825759.1 hypothetical protein [Lewinella cohaerens]
Length=268

 Score = 54.4 bits (127),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 32/197 (16%), Positives = 65/197 (33%), Gaps = 1/197 (1%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
              G  LL + +  ++ A   + +  +++   W   +            +  +L  L    
Sbjct  60   WIGTILLTLIISPVLNAGFYLAAHSVMEGGDWDFKRFWGAIPQAGPLVLNNLLGILITGI  119

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
              + IY     +G     +  L +  +      +          L ++P +   V F + 
Sbjct  120  VILPIYYLFQRIGFMDWYQEVLNNPVNPPEPPQMSSTDSTVF-FLNLVPLIYLQVGFSWA  178

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
              ++       L+ALE SR LV+  W A F   +    + + +S L + +          
Sbjct  179  YPLILFWGANPLEALELSRRLVTKRWGAQFMLLLTFFSLFMLVSILLSPLAIAAPGLANV  238

Query  296  FSLLLTPFSFLYYYLIY  312
             S  L       Y  +Y
Sbjct  239  VSFGLFLILPWVYCSLY  255


>WP_131848433.1 hypothetical protein [Baia soyae]TCP69232.1 hypothetical protein 
EDD57_11129 [Baia soyae]
Length=225

 Score = 53.6 bits (125),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 27/207 (13%), Positives = 71/207 (34%), Gaps = 1/207 (0%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
            +   ++++       L+ I L+  +  +  ++  + +                  +    
Sbjct  1    MGISTFQIIKEHWRKLIMIQLILYLPLYIGLYFVVNIFLIQANLAGLGFLGPIFNMIFTL  60

Query  166  YIL-LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
                +    +   +     +  V     +   +       L  I+  L+V  G +L+I+P
Sbjct  61   IAWGMVQIPLVLLVASEYKEEQVTAGSILIRSIEKTFYVYLFAIIFSLMVILGMMLVILP  120

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            GL+  ++FFF    +      G +A+++S   +  +     G  ++  V+   +  +   
Sbjct  121  GLIAFIFFFFYPQFILLYGQKGWRAMKESARFMQKNLLKSLGYLIIFAVVIAVIEAIALF  180

Query  285  IPYVGEAANLAFSLLLTPFSFLYYYLI  311
            +      A  A  L+ T  +     L+
Sbjct  181  VTLQFSNAVWAVVLVQTLMNMTLSTLL  207


>EEY34820.1 hypothetical protein HMPREF0554_2324 [Leptotrichia goodfellowii 
F0264]
Length=211

 Score = 53.6 bits (125),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 21/111 (19%), Positives = 51/111 (46%), Gaps = 0/111 (0%)

Query  213  VVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
            ++    +++II   ++C++F +   +    N+   ++L+ +  L   +   IF   ++  
Sbjct  87   MLIIICVIIIIQVFIYCLFFIYFTPLYLIRNLTFSESLKYNFHLCKSNKARIFFPMLITE  146

Query  273  VISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
            +  L ++ +  RIPY+G   N+  S+    F      +IY +++   R   
Sbjct  147  IPILAVNSIFNRIPYLGSLLNIGLSVFGNIFIAALMTIIYLNVEYMDRKKD  197


>WP_153546430.1 zinc-ribbon domain-containing protein [Epibacterium sp. SM1969]MQY42347.1 
thioredoxin [Epibacterium sp. SM1969]
Length=337

 Score = 54.8 bits (128),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 11/37 (30%), Positives = 19/37 (51%), Gaps = 0/37 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
            + CP+CGA+   P   +PA+    +C  C +T   +
Sbjct  2   RLTCPNCGAQYEVPEDVIPAEGRDVQCSNCGKTWFQE  38


>MBA1148201.1 hypothetical protein [Ectothiorhodospiraceae bacterium WFHF3C12]
Length=293

 Score = 54.4 bits (127),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 22/141 (16%), Positives = 50/141 (35%), Gaps = 0/141 (0%)

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
             ++     +   I    V         +  + +  LL+ +  +     + +L +  L   
Sbjct  104  AVAVGFKRLLPMIWTAIVWYLALAGSAIPAIAAAALLMDVSPMAGALATAVLALLPLAVL  163

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG  289
            V+  F   +   D   GL+A+  S  LV G+WW +     ++++I + +S + A +    
Sbjct  164  VYLMFAFVLTVTDRESGLKAVRHSYALVKGNWWRVLLIVTVIMIIYMIISGVFAAVGGFL  223

Query  290  EAANLAFSLLLTPFSFLYYYL  310
                   +  +     L    
Sbjct  224  VGLVAVGNAQIQMMLSLGGTY  244


>NDC14010.1 hypothetical protein [Synechococcaceae bacterium WB9_2_170]
Length=195

 Score = 53.2 bits (124),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 31/185 (17%), Positives = 57/185 (31%), Gaps = 11/185 (6%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
            +      A   +  A+L      +                  + L  S    +  +   K
Sbjct  11   WRGFSRNALVLVGFAVLYTIVAAVVVAYGVTHLLWQRPLQVVVHLVGSVALLTGALLAAK  70

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
                 F  +    + V +   L +   + +  G L   IPG+   V ++F  ++L D  +
Sbjct  71   GQTFKFGQLFSETQQVFNLLGLHVFAAIAILLGLLAFGIPGIYLSVAYWFSAFLLIDQRV  130

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFS  304
              L A+  +R LV+ HW+ +             L  +   I  VG  A         P +
Sbjct  131  SFLDAMAGARALVTPHWFDV-----------AVLLLVIVAISAVGFLACGVGLFATGPLA  179

Query  305  FLYYY  309
                 
Sbjct  180  ICIGA  184


>WP_048481137.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Clostridium tetani]
Length=287

 Score = 54.4 bits (127),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 19/159 (12%), Positives = 47/159 (30%), Gaps = 26/159 (16%)

Query  194  KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKS  253
                  +  F    I    ++     +L++  + F + + F  +++  +N    +AL++S
Sbjct  12   FTSFVKIPDFIEDYIYNDNILIVFQYILMMIMIYFALRWIFSLHIIILENKSATKALKES  71

Query  254  RLLVSGHWWAIFGRFVLLLVISLTLSFLTARI--------------------PYVGEAAN  293
              LV  +      R++   +++  L F+   +                      +G    
Sbjct  72   GQLVKRNLKEFIKRYITFSILNGILYFIILVLWMSLVANISKNISYGSYSGKFILGGIVF  131

Query  294  L------AFSLLLTPFSFLYYYLIYSDLKANYRGPQHPP  326
                     S +  P    +   +Y            P 
Sbjct  132  FHQMGAVLLSFIYIPIQVQFTTRMYYKFAYINENKTIPS  170


>MUL66471.1 hypothetical protein [Mycolicibacterium sp. CBMA 234]
Length=215

 Score = 53.6 bits (125),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 23/197 (12%), Positives = 67/197 (34%), Gaps = 3/197 (2%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
                 +L + +   ++                ++ Q  +  + + +  +  ++  ++++ 
Sbjct  1    MAWLAILTVAMAPFIIGTVVFGPESTRDADGHVSQQPVSPTFVVSMILLYAVIFAIAFVM  60

Query  176  ---GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
                            +        R +G+F  + +L+ L+   G+L  I+PG++     
Sbjct  61   SNCLMATQLDVGDAKPVTLGTFFKPRRLGAFLGVSMLIFLMTAVGTLACIVPGIILGFLA  120

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
             +  Y + D  +  + A++ S  LV  +       +++ +       F T      G  A
Sbjct  121  QYAPYFVVDRQMDPVAAIKASFTLVRENVGTTILVYLIGMAAVFVTEFGTVLTCGFGGLA  180

Query  293  NLAFSLLLTPFSFLYYY  309
             +   + +     +  Y
Sbjct  181  LIPAMVSIMGLMHVVTY  197


>WP_180946190.1 zinc-ribbon domain-containing protein, partial [Corallococcus 
exiguus]TNV51413.1 hypothetical protein FH620_38735, partial 
[Corallococcus exiguus]
Length=45

 Score = 49.0 bits (113),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V+C  C      P  K+  K    RC +C  T 
Sbjct  1   MI-VQCEQCQTRFKIPDEKVTEKGVKVRCTKCQNTF  35


>WP_193183634.1 hypothetical protein [Nisaea sp. NBU1469]
Length=250

 Score = 54.0 bits (126),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 29/164 (18%), Positives = 57/164 (35%), Gaps = 0/164 (0%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            +       +     + L  V +  P      +                 +          
Sbjct  18   FHGAFEIYFSRFWFFTLVTVASSLPGALFASVALDGDFGGGPTWAIVVTIAINTVVFAAL  77

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
             +    ++ + +C   V + R +  G    G   + ++L+ + +  G LLLIIPGL+   
Sbjct  78   SAVFVYAVVMQMCGKFVPVSRLVLRGFASAGRALITILLMNVFIAIGLLLLIIPGLIMMC  137

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
             +F    V A +  G   A+ +S  L  G+   I G  ++  +I
Sbjct  138  EWFVAVPVSAVERTGPFAAMGRSSDLTEGYRGRILGYILVFWLI  181


>PCI01364.1 hypothetical protein COB76_01515 [Alphaproteobacteria bacterium]
Length=223

 Score = 53.6 bits (125),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 9/38 (24%), Positives = 15/38 (39%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M    CP C  + + P + + A+    RC +C      
Sbjct  1   MIL-TCPQCETKFSVPDNAITAEGRKVRCAKCKHEWHQ  37


>NLT20123.1 zinc-ribbon domain-containing protein [Syntrophomonadaceae bacterium]
Length=738

 Score = 55.5 bits (130),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 63/613 (10%), Positives = 120/613 (20%), Gaps = 37/613 (6%)

Query  4    VRCPHCGAERNTPSSKLPAKK-----SSARCPECCQTLIFDPAESQRTQTTDNIATCPHC  58
            V CP CGA         P+         A C  C   L        +             
Sbjct  9    VNCPKCGASSPDNVRFCPSCGNELITQIAWCLACNAELRPGTRFCGKCGQPVEAGADTRV  68

Query  59   GLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRG  118
            G  +          +  +  R       L            L+S+  ++           
Sbjct  69   GPGKADSIQVSAGGAAIIADRPPLSWGNLLKLIWKNTWQGLLKSLPMMIIIFVISMIVHT  128

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
            + L+ +        +                         +  A         +      
Sbjct  129  YMLVFVNEGFGSGTWMGRQLLATQGNVISATILWMLISGMVFQAIGRI----RAVGFNGY  184

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
               +      +   +K   R      L  + L L++ G    +    L   V   F   V
Sbjct  185  VSEVVNLPQEMTAYVKEAGRDGVVLFLAGVGLALIISGVMSDVGNIFLAAGVVGLFASPV  244

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVL----LLVISLTLSFLTARIPYVGEAANL  294
                 +    A        + +   +  +F +    L ++   L F  A I  VG     
Sbjct  245  GRVIALLLQTAWTTIMSQFAPNQTKVIAKFGIVSAYLSIVGSALGFALAAILPVGPLIGF  304

Query  295  AFSLLLTPFSFLY----------YYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLI  344
               ++L   ++              LI +      R          W      +  W   
Sbjct  305  LLLVVLAVMTYNNNKPGTGQTSALILIVAAGIVLLRAKGVLADDGGWQEAGGTLSSWAGS  364

Query  345  PGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLS  404
             G +   L     S    +           +        N     +        Y     
Sbjct  365  EGSIRAVLHGLGPSMGAAVGPALSNSLSGISPNDFATADNSDDEGDDGGSGDGGYDDNTG  424

Query  405  KQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVL  464
                       +    +   D   + D +       +     N                 
Sbjct  425  DWSDEGRPDSAADETSSDADDTGDSQDTSQEGEESSDGEGDDNQYDEDGYDQDGYDRDGY  484

Query  465  DDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELT  524
            D D  +         +   +       D  +     R  +   G   E         E  
Sbjct  485  DRDGFNRNGYDRDGYNRDGYDSSGFDRDGFNSEGLDRDGFYSNGFDKEGYDRNGFDAEGY  544

Query  525  LPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPL  584
                +           +     G        G +       G      + +  +      
Sbjct  545  DRSGLNQEGFD----REGYDKDG----FNSNGLDREGYDRDGFDQQGYDRNGYDRQG---  593

Query  585  REIGFTWQKSGDA  597
                 +W    D 
Sbjct  594  ---YDSWGYDKDG  603


>OGK62482.1 hypothetical protein A3K47_02720 [Candidatus Roizmanbacteria 
bacterium RIFOXYA2_FULL_38_14]OGK63681.1 hypothetical protein 
A3K27_02720 [Candidatus Roizmanbacteria bacterium RIFOXYA1_FULL_37_12]OGK65527.1 
hypothetical protein A3K38_02720 [Candidatus 
Roizmanbacteria bacterium RIFOXYB1_FULL_40_23]OGK68311.1 
hypothetical protein A2334_05600 [Candidatus Roizmanbacteria 
bacterium RIFOXYB2_FULL_38_10]OGK69932.1 hypothetical 
protein A3K21_02725 [Candidatus Roizmanbacteria bacterium RIFOXYC1_FULL_38_14]OGK71799.1 
hypothetical protein A2446_00825 
[Candidatus Roizmanbacteria bacterium RIFOXYC2_FULL_38_9]OGK73674.1 
hypothetical protein A3K52_02720 [Candidatus Roizmanbacteria 
bacterium RIFOXYD1_FULL_38_12]
Length=230

 Score = 53.6 bits (125),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 39/217 (18%), Positives = 75/217 (35%), Gaps = 15/217 (7%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            F      L  I L     A       LL   ++           + +L   A I L    
Sbjct  13   FKIVKDNLFVIILYPFSSAVIIAALYLLGISSSLFFISQLTKGISGILLFWALITLLFGL  72

Query  174  MTGSMFIYICK------------TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
            +T  +  Y+                V + +  K  L+  G   + L++   +V  G +  
Sbjct  73   LTYILEHYLMNIYILLLESGRVNQAVPVGQIFKYSLQKTGPVIISLLVYGSLVVVGFIFF  132

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            IIPG++  + +    Y+   D +   +A  +S+ +  G+   +   F+++  IS  L  L
Sbjct  133  IIPGIILLLQYSQVYYLTLIDGLSIKEAFSESKRMTKGNELRLLFLFIVIGGISYLLGHL  192

Query  282  TARI---PYVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
               I     +     L     L   S+  +Y + ++ 
Sbjct  193  LNVIRMPSILISLVQLYAGNYLIIVSYSTFYYLKNNC  229


>MBF0386199.1 hypothetical protein [Candidatus Omnitrophica bacterium]
Length=233

 Score = 53.6 bits (125),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 28/156 (18%), Positives = 59/156 (38%), Gaps = 4/156 (3%)

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
            V +  L    +  S +      +V ++  +       G   L  +  + +V  G LL I+
Sbjct  69   VIFATLANIMVIYSAWRIYKGEEVPVYEVLNKAFLKYGVCLLATLFSLSIVFSGLLLFIL  128

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL--  281
            PG+ F   + F    +  +      +   S  L    +  +    +++ VI L L  L  
Sbjct  129  PGIYFATVYSFVIIFIVIEGSPLFWSFSLSIGLARKAFARVLVFSLIMPVIFLALYSLAK  188

Query  282  --TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
                + P +  +  LA + +LTP   +   +++  +
Sbjct  189  EYFDKYPTLVYSVTLALTAILTPLGVIAQVILFCRM  224


>TNE64350.1 hypothetical protein EP335_07760 [Alphaproteobacteria bacterium]
Length=245

 Score = 54.0 bits (126),  Expect = 4e-05, Method: Composition-based stats.
 Identities = 32/172 (19%), Positives = 68/172 (40%), Gaps = 5/172 (3%)

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
                   ++ T      +  T + +  +++  L++     +  I+L +  G G +LL++P
Sbjct  73   VLWGAVGTFCTTGNLALLDGTPLSVGDTLRQSLKYCLPILVAEIVLTIAAGIGMILLVVP  132

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
              +   +      V+  D  G L AL++S  ++ G++W      +L +V  L L  L   
Sbjct  133  YFIVVGFTMVVMPVIVADRRGPLAALKRSGQMIRGNFWRALALALLYMVPYLVLEELLWY  192

Query  285  I-----PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
                  P+         S      +  +   +Y  L+A + G  H  I++ +
Sbjct  193  QIPDPEPFWYAIGTFGMSATTAVVATAFSASLYQHLQAAWSGDSHAEIEQVF  244


>NNK85474.1 cyclic nucleotide-binding domain-containing protein [Desulfobacterales 
bacterium]
Length=287

 Score = 54.4 bits (127),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 13/69 (19%), Positives = 25/69 (36%), Gaps = 0/69 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
           ++CP+C  E     +++P K   +RC +C              ++ DN           +
Sbjct  3   IQCPNCKTEYKIDDARIPDKGVYSRCKKCQTKFFVKKETQASEKSQDNKWVECPECGLTQ  62

Query  64  IPSDRLEIQ  72
            PS   +  
Sbjct  63  APSQTCKYC  71


>HCF16891.1 hypothetical protein [Rhodospirillum rubrum]
Length=53

 Score = 49.4 bits (114),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 9/37 (24%), Positives = 15/37 (41%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  + CP C A+   P   L ++    +C +C     
Sbjct  1   MI-ITCPSCAAKFTLPDGALGSEGRKVKCAKCAHVWQ  36


>KAF0121402.1 hypothetical protein FD151_1246, partial [bacterium]
Length=55

 Score = 49.4 bits (114),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 10/46 (22%), Positives = 17/46 (37%), Gaps = 1/46 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRT  46
           M  V+C  C  +      ++P     ARC +C    I +   +   
Sbjct  3   MI-VQCERCQTKFKLNDERVPKGGGKARCSKCQHIFIIEKPLTPGM  47


>MBI4438882.1 hypothetical protein [Candidatus Woesearchaeota archaeon]
Length=336

 Score = 54.8 bits (128),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 32/199 (16%), Positives = 58/199 (29%), Gaps = 1/199 (1%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY  166
               SW  F         + LL  +  F      L      +    +   +          
Sbjct  105  HNFSWSGFYELVTVQNAVLLLVHIGLFVVAAIYLGSLKFAFSALASTGRKVRWREGFRLS  164

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLI-LLILVVGGGSLLLIIPG  225
                L  +  S+ + +            + L  +   +     + +L V    ++ I+  
Sbjct  165  HSQVLWMVCLSVLLVLVYLSPLFLLGFFVWLVALFFPSGTYFPVTVLSVLMLFVVFIVVY  224

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
                + F F   V+  D+ G   AL  S  +  G     FG  V  L I+   S L A  
Sbjct  225  ASLVLRFLFSFQVMYVDDAGPFAALRNSFEVTRGRKLRAFGVLVAFLAITGGASSLLAAP  284

Query  286  PYVGEAANLAFSLLLTPFS  304
             Y      +  S  +  ++
Sbjct  285  MYQAFFYLIVASSPVAFYA  303


>KAF5178990.1 hypothetical protein FRX31_031418 [Thalictrum thalictroides]
Length=176

 Score = 52.5 bits (122),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 27/131 (21%), Positives = 54/131 (41%), Gaps = 3/131 (2%)

Query  190  FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQA  249
            F  +   L  +   T   I+++ +     LL I   +   + +     +   ++  GLQA
Sbjct  26   FFMLVSPLFLLVILTKKSIVVLTIGTVLILLAICLYMYLALVWMLSIVISVLEDCYGLQA  85

Query  250  LEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYY  309
              K   L+ G     FG   +L+++      ++     +     +AF++LL  FS + Y 
Sbjct  86   FGKGAQLLKGRKVVGFGLTFVLMILIGVNVLVS---NIMISTIAMAFAVLLKMFSMMVYT  142

Query  310  LIYSDLKANYR  320
            + Y D K ++ 
Sbjct  143  VFYFDCKQSHG  153


>WP_113930654.1 hypothetical protein [Bacillus sp. P14.5]
Length=316

 Score = 54.4 bits (127),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 28/183 (15%), Positives = 62/183 (34%), Gaps = 10/183 (5%)

Query  156  QWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG  215
                     + I+ GL      +   +  T  G+F         + S     I+ ++ + 
Sbjct  134  FSRFWPLVGSSIVFGLIVFGLVILPVLIITFSGVFMFGFSESFAIDSLAG-SIMTVIFIL  192

Query  216  GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
               L + +        ++F   V+A +       + KS  L  G  W + G F++L +I+
Sbjct  193  LIFLAVAVGIGYLLTRWYFYLGVVATERAAPG--IGKSWNLTKGQGWKLLGVFIVLFLIT  250

Query  276  LTLS-------FLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIK  328
              +S               +        +L+ T F  + + +++ DLK          + 
Sbjct  251  GVVSAALEFTLGAFLGNSVLFNLLYNLITLITTIFLSVGFAVMFFDLKIRNDAEDLQDMI  310

Query  329  RQW  331
            + +
Sbjct  311  QDY  313


>WP_034232473.1 hypothetical protein [Arcanobacterium sp. S3PF19]KGF05937.1 hypothetical 
protein HMPREF1631_03750 [Arcanobacterium sp. S3PF19]
Length=350

 Score = 54.8 bits (128),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 20/178 (11%), Positives = 57/178 (32%), Gaps = 19/178 (11%)

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
            ++L +  +T  +F  +    + +     L +  V S      + + +     + ++    
Sbjct  168  LILRIVALTAVIFFIMLFFLILVAACAALTIGGVFSVFGQSGISVALSVLAGIAVLAAEG  227

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            +  +        L  +      A+ ++ +L       +F   +L  V+   ++ + + + 
Sbjct  228  VLYLRLMTASCALVAEETTVFGAISRAWVLTRKSIRYLFAVMLLSTVLITIVAGVASAVG  287

Query  287  YV-------------------GEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHP  325
             +                        L  S +L P+  +   LIY +++      Q  
Sbjct  288  VISVASLIDGTNGLGIGGVIASSVVFLLISAILVPYMTVLNNLIYVNMRFRRENFQQQ  345


>TFH27306.1 hypothetical protein E4H00_09665, partial [Myxococcales bacterium]
Length=328

 Score = 54.4 bits (127),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 13/40 (33%), Positives = 19/40 (48%), Gaps = 0/40 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
           M  V CP C +  +    +LPA     RCP+C ++    P
Sbjct  1   MIKVSCPSCNSSYDVDEHRLPADGLRMRCPKCSESFQVHP  40


>HBC14709.1 thioredoxin [Erythrobacter sp.]
Length=120

 Score = 51.3 bits (119),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 10/39 (26%), Positives = 15/39 (38%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  + CP C      P S +     + RC +C  +   D
Sbjct  1   MI-IACPACSTRYVVPDSAIGIDGRTVRCAKCKHSWFQD  38


>WP_174287002.1 zinc-ribbon domain-containing protein [Sphingomonas bacterium]
Length=240

 Score = 53.6 bits (125),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 9/38 (24%), Positives = 12/38 (32%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M    CP C      P S +     + RC  C  +   
Sbjct  1   MIL-ECPECTTRYLVPDSAIGPSGRTVRCANCRHSWFQ  37


>RJP42402.1 hypothetical protein C4547_00650 [Phycisphaerales bacterium]
Length=386

 Score = 54.8 bits (128),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 38/378 (10%), Positives = 83/378 (22%), Gaps = 18/378 (5%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            MP VRC  CGA      + +      ARC  C Q                          
Sbjct  21   MPKVRC-QCGATYRVDENAV---GRKARCKACGQVFQVAAEPDAGPI------PLAGGID  70

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
                 S      ++    R    S       +         + + +          R  G
Sbjct  71   LAEEASAAASRAAEQATTRSPLLSPGEARFEQRDEDRFPGMTTTVMPVADKHTRFLRAVG  130

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
               ++          +   LL+                + +         L+ +  +   
Sbjct  131  WALLFPTNTSNMATFVILWLLVSLEILQGFAPCVGVIGVFIIEGWIAAFSLNVVATAANG  190

Query  181  YICKTDVGLFRSMKLGLRH-VGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
                  + L      G+      +  + +L +       +L+++             Y++
Sbjct  191  EEDLPPITLLEGFFEGIVIPFFKYLAVRVLALAPAVAFFVLVVLSTAQTGPGLSVRDYIV  250

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA----ANLA  295
                 GG+  L        G   A F   +   +    +  L   +   G        L 
Sbjct  251  GAVQSGGVGGLAGVFQ---GAKLATFFLILAGGLFIWPILLLVVAVGGFGALVRIDLMLI  307

Query  296  FSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQ  355
              +   P   +   +++         P    + +       + F  +         +   
Sbjct  308  TIVRTLPGYIVCVAVVWGASALAAGLPAIVKLIQGPGAGGISAFYVVGAGLRAYTQIVAM  367

Query  356  NLSAEQLLSAGKDIQQRL  373
             +                
Sbjct  368  RVIGLHYHHYKHRYAWDW  385


>NNF52901.1 hypothetical protein [Acidimicrobiales bacterium]
Length=284

 Score = 54.0 bits (126),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 35/180 (19%), Positives = 69/180 (38%), Gaps = 14/180 (8%)

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
            I L    +   +  +    D  +   M+  +  +  +    I   L +G G +L+I PG+
Sbjct  96   ISLIGVSVAYLINAWGEGRDPSVQDVMRFTVNRIPIWAATFIAAKLAIGVGLILVI-PGV  154

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
               + F     ++A +  G   +L+++  L+      + G +V  +++ L++SF    IP
Sbjct  155  ALALAFALLSPLIAIEQQGPFASLKRAYSLMRLRTSQMIGLYVGCVIVGLSVSFSLTFIP  214

Query  287  YVGE-------------AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLP  333
             +                 ++  + LL PF+     L Y DL+    G        Q  P
Sbjct  215  IIAATFVGTDVAWPLISVVSILSTSLLAPFNAAAMTLFYFDLRYRSEGFDLQRRAAQLFP  274


>HAW80520.1 hypothetical protein [Balneola sp.]
Length=202

 Score = 52.9 bits (123),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 23/167 (14%), Positives = 56/167 (34%), Gaps = 0/167 (0%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            +    + ++  ++L             +   +           +  +  ++  +      
Sbjct  36   YFGIPFYVVQAFILQEYSTNLFNSIMAIESGSDNFGGLFGWEYFLNIFISITALSALSVI  95

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
                  +     +  L + M   L  V    LL I++ + +  G+LL IIPG+   +   
Sbjct  96   TLKHFKLTSSGKETELDQIMDGILPLVMWMALLFIIIYICISIGALLFIIPGIFIGIKLS  155

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
            F       +      ++ +S  +  G WW  FG  +++ ++    S 
Sbjct  156  FTPAAFILEEKDIFDSMRRSWEVSKGFWWITFGLLIVIYIMIYFSSL  202


>MBI1179550.1 DUF3426 domain-containing protein [Alphaproteobacteria bacterium]
Length=316

 Score = 54.4 bits (127),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 10/62 (16%), Positives = 15/62 (24%), Gaps = 1/62 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + CP+C      P  K+       RC +C       P   +                
Sbjct  1   MI-ITCPNCSTRYTLPQEKIRLGGQKVRCAKCGHVWHQMPEPEEAALVEPAPPPETDTAP  59

Query  61  QR  62
             
Sbjct  60  AW  61


>WP_198924911.1 zinc-ribbon domain-containing protein, partial [Nitrospirillum 
amazonense]
Length=39

 Score = 48.6 bits (112),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 9/38 (24%), Positives = 11/38 (29%), Gaps = 2/38 (5%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M T  CP C        + L  +    RC  C      
Sbjct  1   MIT-TCPSCATRFRVDDALLG-RGRRVRCSACGSVWHQ  36


>MAH04829.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=295

 Score = 54.0 bits (126),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 10/39 (26%), Positives = 14/39 (36%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  + CP C      P S +       RC  C +T   +
Sbjct  1   MI-IVCPKCSTRYMIPDSSIGDDGRMVRCNSCGETWHQE  38


>OQC25098.1 hypothetical protein BWX68_01720 [Verrucomicrobia bacterium ADurb.Bin063]
Length=314

 Score = 54.4 bits (127),  Expect = 5e-05, Method: Composition-based stats.
 Identities = 29/179 (16%), Positives = 60/179 (34%), Gaps = 6/179 (3%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
            G+    +           LL+          +       A    +L  +  +       +
Sbjct  19   GVLYFLLKSPLTGGVLYFLLRNIRRQPTGISDVFAGFRRAFGQLLLGYVVMIILIYLAML  78

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                + ++      L  +     +    I V   G + LI+P +   V + F   ++ D 
Sbjct  79   PGVALMVWP-----LLTLARQQAVAAGPIFVALLGFICLIVPAIYLSVSWTFLLPLIIDR  133

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
             +    A++ SR +V  HWW +FG  V+  +I+  + FL   +           ++L  
Sbjct  134  QMKFGPAMKASRRMVGKHWWQVFGLVVVCGLIN-VVGFLFCGVGIFLTLPISLGAILYA  191


>PIW37039.1 hypothetical protein COW24_02120 [Candidatus Kerfeldbacteria 
bacterium CG15_BIG_FIL_POST_REV_8_21_14_020_45_12]PJA94040.1 
hypothetical protein CO132_00195 [Candidatus Kerfeldbacteria 
bacterium CG_4_9_14_3_um_filter_45_8]
Length=274

 Score = 54.0 bits (126),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 30/165 (18%), Positives = 64/165 (39%), Gaps = 14/165 (8%)

Query  154  NWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILV  213
              Q A+        LL  + +  S+ + +   +    R     LR      +  ++ ++ 
Sbjct  86   WIQHALEALEAISALLITAIVISSVALALQGQEPNFGRIFHKALRSWPMLIIATLISLMG  145

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            +  G   LIIPG++  V++ F    +  +      AL  S  L+ GH++ +  R +LL +
Sbjct  146  LLLGFAALIIPGIVLWVYWTFVGQAVVLEEKYFFSALAYSAKLIKGHFFEVLSRLLLLYL  205

Query  274  ISLTLSFLT--------------ARIPYVGEAANLAFSLLLTPFS  304
              + ++F                A I  + E   +  ++ +T + 
Sbjct  206  AIILITFAIVSSTSNLAVYPGVTALIATISELVGIFATIFITVYF  250


>WP_128145248.1 hypothetical protein [Nocardia africana]
Length=484

 Score = 54.8 bits (128),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 33/336 (10%), Positives = 75/336 (22%), Gaps = 23/336 (7%)

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
             +  +    +    +    V L   +    R       L+++   V      L+ I   L
Sbjct  3    AMVNAVCVITADRAVRGERVRLSEVVGAARRRFWPLCRLMVVFYTVFLVAPWLVEIAAFL  62

Query  228  FC--------------------VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGR  267
                                  + F     V+  +  G +++L +S  LV   +  + G 
Sbjct  63   VAGFGTGMAALPFVFIAIYVLGIVFSLAPVVMTLEGTGVVESLSRSAALVKPAFLRVVGL  122

Query  268  FVLLLVISLTLSFLTARIPYVGEAAN---LAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
             +L  V+ +    L+     +           ++ +  +  +   + +   +A       
Sbjct  123  QLLWSVLVVGALMLSGLPFGLISLLVPSDAVVTVFVPFYLTIAVVVAFPLFRAVQTLIYT  182

Query  325  PPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLN  384
                R       + +                 +S    +   +                 
Sbjct  183  DLRLRSGTYGGESDWRSGKDGIGEHPVTGDDGISTSNAIVTLRPYHVCRIVAAFALFQFF  242

Query  385  RSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSD  444
              LP  P  +  A         R      G  +G        + +    P      E   
Sbjct  243  FDLPGIPWLIIVAGCVAGEWFIRSRGLSWGPEIGATLAALRLYPSGTAQPDQATHDEPPQ  302

Query  445  FPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEH  480
                          E          +  +     E 
Sbjct  303  PDRDPANDPKPPAAEAADPPGAPEPEGVNTIRLGEP  338


>PZR11478.1 hypothetical protein DI536_17790 [Archangium gephyra]
Length=294

 Score = 54.0 bits (126),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 34/221 (15%), Positives = 63/221 (29%), Gaps = 4/221 (2%)

Query  69   LEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLG  128
             +   +                        GL      LA +   F      L  + +L 
Sbjct  17   CKRCGRFACASCMPDGSFCAECAPLANDPYGLNRRLDHLAAAQLAFKLILADLPKLLVLV  76

Query  129  IVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVG  188
             V +         L P                L    +  +G   M   +        +G
Sbjct  77   FVFSVPAALMQTALVPDGDDLKSISAANRVDNLYNFIFGAIGTQAMLAVLIARSEGRVIG  136

Query  189  LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC-QYVLADDNIGGL  247
            + R++  G  +   F        L +   +L L++PG+   V   F     L   N+  L
Sbjct  137  IGRALSEGAMNWPRFFGARFRSGLWILLFALALLVPGVWQAVMLIFAGTAALRTRNVDPL  196

Query  248  QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
            +A  +   LV G +W + G  ++     + +      +  V
Sbjct  197  EASRR---LVKGRFWPVLGVGLMCGATIVAVIVPLGIVAIV  234


>MAX12684.1 hypothetical protein [Candidatus Marinimicrobia bacterium]
Length=210

 Score = 52.9 bits (123),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 30/197 (15%), Positives = 72/197 (37%), Gaps = 11/197 (6%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY  166
            + + W+      W +       ++     +F  L        +    N Q  ++LA    
Sbjct  1    MIEYWKRNILESWDVFSRNSYLLITPCVLLFLILYYLGINLKSEIISNTQLILILAVSLI  60

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLG-------LRHVGSFTLLLILLILVVGGGSL  219
             +          +  I    +      +         +  +  F  L ILL +    G +
Sbjct  61   AVGIHIGAISITYQSIINKKIAFTDIFQKFHILHIILIPQIAFFAFLFILLNIFPISGYV  120

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
             ++I  +L+ ++ FF  Y++  +N+   +A+ ++  LVS +        +  +++S  + 
Sbjct  121  FVLIAIILYSLFLFFYDYLIVIENLSMKEAILRNYQLVSTYPQ----IVLQFMLVSFLIG  176

Query  280  FLTARIPYVGEAANLAF  296
            F    +P +G   ++ F
Sbjct  177  FCLTILPLIGAVLSMCF  193


>WP_188872400.1 hypothetical protein [Halarchaeum rubridurum]
Length=259

 Score = 53.6 bits (125),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 27/193 (14%), Positives = 64/193 (33%), Gaps = 10/193 (5%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
              ++L +++          L+                 LA+    L+    +     + I
Sbjct  27   AFFVLQLLVTLFSTNMNAALQSTVSPAEVAPVVLPVGALASGVLSLVAYVCIAYLGIVAI  86

Query  183  CKTDVGLFRSMKLGL------RHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
                  +   +            +  + +  I++ ++V  G +LL+IPG+   +   F  
Sbjct  87   RSMVSDVTDHIPGEFFSRRVGPALVWWFVASIVVGILVTIGFVLLVIPGIYLSLGLAFTL  146

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI----PYVGEAA  292
              +A ++     A++ S  L +GH W +   F++  V+ L +  +   +       G   
Sbjct  147  VYVAVEDETAFTAMQSSWDLAAGHRWRLLATFLVPAVVYLVVVGILGVVLPSTTVAGWVV  206

Query  293  NLAFSLLLTPFSF  305
                  +   F  
Sbjct  207  TGLVGAVWGVFVQ  219


>MBI2792287.1 hypothetical protein [Gammaproteobacteria bacterium]
Length=247

 Score = 53.6 bits (125),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 43/193 (22%), Positives = 75/193 (39%), Gaps = 1/193 (1%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
                R+ + L   Y L   L      +   +         + +    I++         L
Sbjct  19   FQLYRQVFSLSFPYTLCATLLMFIPHTLSTIDMLNKGLLGHSDLALWIMVFCWLGGFTFL  78

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
              +   ++ Y  +       S+K  L  + S  LL  L  L+V  G++LLIIPG++F + 
Sbjct  79   CALIFRIYCYCYQIPSHFIGSIKHALFKLISVLLLSTLYCLIVLSGTMLLIIPGMIFMIT  138

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
              F   ++  DN   LQ L  S  LV GHWW +     +  +++LTL+     +  +   
Sbjct  139  LMFSFILVIIDNQNVLQTLTTSHRLVWGHWWHVVAVMAVPFLLNLTLTLCI-MLSIIFLL  197

Query  292  ANLAFSLLLTPFS  304
             N    +    F+
Sbjct  198  TNQGLKIAEITFA  210


>PYV21798.1 hypothetical protein DMG24_18580 [Acidobacteria bacterium]
Length=491

 Score = 54.8 bits (128),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 26/221 (12%), Positives = 67/221 (30%), Gaps = 12/221 (5%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI-LLGLSWMTGSM  178
             ++ I             +  L +     +    +   ++  +    + LL ++ +    
Sbjct  258  CIMVIAWGAAYAVALGATTFALSRLYLNQDTTIGSAYRSMRESVWRLVKLLAVTSLWVLS  317

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
              +     + L+  ++     +    + L +L  ++    L + +   L  + +      
Sbjct  318  PFFALGIMIVLWHFLRGLHPTLSVLAVPLAILRALLVLFLLAVPLTIWLTLLRYGVAVPS  377

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP-----------Y  287
            L  +N+G  +AL++S +L   +   +    +L+ +I       T  I             
Sbjct  378  LLLENLGVRRALKRSAVLTKSYKGRLVLIGLLMTLIVAMTELATGVILVSNGQISAWKVV  437

Query  288  VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIK  328
                       L  P   +   L Y D +    G     + 
Sbjct  438  ARLLTGSVSGALTGPLVAISLALTYYDARVRKEGFDLQAMM  478


>WP_025079669.1 hypothetical protein [Porphyromonas macacae]
Length=176

 Score = 52.5 bits (122),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 32/140 (23%), Positives = 53/140 (38%), Gaps = 3/140 (2%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLF--RSMKLGLRHVGSFTLLLILLILVVGGGSL  219
              +   L+  +                    +      R    + +L I++ L+V  G  
Sbjct  8    VYLFACLVMQTGFCRIALHLAAGGSFSFSESKKFFWAPRLYIPYFVLSIIVALLVSVGLA  67

Query  220  LLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            LLIIPG+   + F+   YV  D+   G  + LE+   +  GH+  I G F+L +V SL  
Sbjct  68   LLIIPGIYLAIKFYLMPYVYFDESERGIFEVLERLWKISKGHFPGILGFFLLAIVASLLG  127

Query  279  SFLTARIPYVGEAANLAFSL  298
              L     +V     +  S 
Sbjct  128  FLLLGVGIFVTMPLYMILSA  147


>MBJ02401.1 hypothetical protein [Planctomycetes bacterium]
Length=264

 Score = 53.6 bits (125),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 29/233 (12%), Positives = 67/233 (29%), Gaps = 22/233 (9%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
                 + L +       +           +         + +     I L ++     + 
Sbjct  24   CFGVSFALWLPARLLRTWYNTEFGLGPPDDVLGILAFSGVGMLMPILIPLIVAAALAHVT  83

Query  180  IYICKTDVGLFRSMKLGLR--HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
                +       +    LR        L++ L+ ++ G G+L  I+PG+           
Sbjct  84   YATIQGQGARRATTFAVLRPVLFLRLMLVIFLVFVLTGIGTLACILPGVYLAWRLSTATT  143

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP-----------  286
             L  + +  ++++++S  L  G +    G F +   + L  S   A +            
Sbjct  144  ALVIEGLAPMESVKRSFALTRGTFLRWLGLFFVQQCLLLPFSGPVAVLDDPMLRSEVLSQ  203

Query  287  ---------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
                      +    +  F  L + F+ L   + Y D +    G        +
Sbjct  204  VALSPPALEVLSLCLSTFFLSLASAFAGLVMAVYYLDCRVRADGFDLDMALER  256


>RKX70378.1 hypothetical protein DRP53_05190 [candidate division WOR-3 bacterium]
Length=131

 Score = 51.3 bits (119),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 11/37 (30%), Positives = 16/37 (43%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  V C  C  + N   SK+P +    RC +C   + 
Sbjct  1   MI-VVCEKCQKKYNVDESKIPEQGIKVRCAQCGNIIF  36


>TSA50649.1 zinc-ribbon domain-containing protein [archaeon]
Length=297

 Score = 54.0 bits (126),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 29/304 (10%), Positives = 74/304 (24%), Gaps = 14/304 (5%)

Query  6    CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIP  65
            CP CG         +             +                 +       +     
Sbjct  4    CPRCGL--EVSDEAI-HCPRCGTEIHYSKDAAEPERRGAVEHLKYAVNLARDKPMVFSPS  60

Query  66   SDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIY  125
               L I   T         +    +       S           +   F    +      
Sbjct  61   IVELLIAVVTQRILESWAVYNDFIDALMDYMYSQTGVSPVSYVSTGFEFDYTRFISWVPA  120

Query  126  LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKT  185
             L  +   + + S   ++ +       +             IL  +         ++  +
Sbjct  121  ALIGLSLISWVASLASIRASWRAVRWEEPMFQESFSYVGRRILRFVYASILMTAFFVLAS  180

Query  186  DVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIG  245
             V +   +        S  L+  +++L +   ++L     ++              ++ G
Sbjct  181  GVAVTPLIFSESVGFCSAFLIFFVVMLGLFAVTILAAPTFIVMIG-----------EDEG  229

Query  246  GLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSF  305
             + +L K+     G +W   G  +LL ++   L+ +     Y+        ++ +     
Sbjct  230  FMPSLRKTVRFTRGVFWTYVGLGILLALMFFGLNLVPYVGYYLTFIPGAIGNIAIIDLYN  289

Query  306  LYYY  309
             Y  
Sbjct  290  QYKA  293


>OYW46715.1 hypothetical protein B7Z36_04565 [Novosphingobium sp. 12-63-9]
Length=222

 Score = 53.2 bits (124),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 19/126 (15%), Positives = 46/126 (37%), Gaps = 2/126 (2%)

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
                +   ++   +  ++ + +    +S++   R   S     IL  L    G +LL++P
Sbjct  53   IVGSIATFFVQYLVAEHVLRAEGMFDQSLRG--RRYASVFGASILTTLGGLAGMILLVLP  110

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            G      +     ++  +      A+ +S        W IF  +++ L+  L ++     
Sbjct  111  GFYVLARWSLTVPIIIAEGKTATDAIGESWRRTETSVWPIFAIYLVCLLGVLAMAAAIGG  170

Query  285  IPYVGE  290
            +  V  
Sbjct  171  LTAVAT  176


>HHR99629.1 hypothetical protein [Acidobacteria bacterium]
Length=217

 Score = 53.2 bits (124),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 23/142 (16%), Positives = 51/142 (36%), Gaps = 11/142 (8%)

Query  157  WAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRH----VGSFTLLLILLIL  212
            +   LA    I+   + +  +  I +   +  +   + +             L   + ++
Sbjct  76   FFFYLANYFVIVFFNTALVSAASIRLEGGNPTVRDGLHIAWSRVGVIFQWAVLAATVGMV  135

Query  213  -------VVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
                       G L+  + G+ + +  FF   VLA +N+G ++AL++S  L   +W    
Sbjct  136  LRMIEDRSSLIGRLVASLVGIAWTLATFFVVPVLAFENLGPIEALKRSAELFRRNWGEEV  195

Query  266  GRFVLLLVISLTLSFLTARIPY  287
                   +I   L+     +P 
Sbjct  196  VGTFSFGLIFFLLAVPGVLLPV  217


>KKU73052.1 hypothetical protein UX98_C0012G0012 [Parcubacteria group bacterium 
GW2011_GWA2_47_26]
Length=235

 Score = 53.2 bits (124),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 32/148 (22%), Positives = 60/148 (41%), Gaps = 2/148 (1%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
             ++         +  + + Y     + +  +++  L+ + SF  LLI+  +VV  G   L
Sbjct  58   FSILVAAWTELALVRAFYNYSSGATLNIAATLREALQKLPSFIFLLIIWFVVVLVGLAFL  117

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            IIPG++F  WF F    L  ++  GL    +S+ LV+ +++ +F R V   ++   +  L
Sbjct  118  IIPGIIFATWFMFLSSALVIEDQRGLAVFRRSKQLVANNFFGLFWRAVASALLISVI--L  175

Query  282  TARIPYVGEAANLAFSLLLTPFSFLYYY  309
               I                 F F    
Sbjct  176  ITAIQGFVAILQATGLRAFNQFLFDVIT  203


>TXD43503.1 hypothetical protein FRC96_01695, partial [Bradymonadales bacterium 
TMQ2]
Length=206

 Score = 52.9 bits (123),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 10/63 (16%), Positives = 19/63 (30%), Gaps = 1/63 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V+CP C +      + +P      +CP C    +  P      +   +  +      
Sbjct  1   MI-VQCPSCSSRYRVNDANIPPSGGKIKCPSCAHAFVVYPEAPAEPEHEADKTSIAERPN  59

Query  61  QRR  63
              
Sbjct  60  IHE  62


>MBC8791299.1 hypothetical protein [Tagaea sp. CACIAM 22H2]
Length=410

 Score = 54.4 bits (127),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 13/36 (36%), Gaps = 1/36 (3%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            M  V CP C +    P   L A     RC  C  + 
Sbjct  90   MI-VACPECSSRFRVPEDALGAGGRYVRCGNCGHSW  124


>WP_066156678.1 hypothetical protein [Alkalihalobacillus krulwichiae]ARK32003.1 
hypothetical protein BkAM31D_20330 [Alkalihalobacillus krulwichiae]
Length=215

 Score = 52.9 bits (123),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 29/193 (15%), Positives = 65/193 (34%), Gaps = 3/193 (2%)

Query  130  VLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGL  189
              A   + +  ++   ++++  + +    I        L   S     +   +    + L
Sbjct  15   YYAICSLLAVFVMFLISYVHIISTSLLTEITFINNRVGLFVSSLFYLMISTSLYGLTIML  74

Query  190  FRSMKLGLRHVGSFTL---LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGG  246
            ++ M          T+     +L  L V  G    +IPG++  + F F  +V+  D    
Sbjct  75   YQKMNGVFISGMYITVLLTTSMLYALFVSIGFFFFLIPGVILMIVFAFYPFVVIKDGKSN  134

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFL  306
             QAL +S  +V   W+ +        ++ + L+     +P++ E      +  L      
Sbjct  135  GQALRESASIVKTVWFRLASLTFFFYLLYILLATCFVYVPFIDEMLAQVTAASLLVPFEA  194

Query  307  YYYLIYSDLKANY  319
            + Y          
Sbjct  195  FAYYFMYQKGKMK  207


>WP_119895037.1 hypothetical protein [Pseudomonas sp. K2W31S-8]AYC34384.1 hypothetical 
protein D3880_19320 [Pseudomonas sp. K2W31S-8]
Length=218

 Score = 52.9 bits (123),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 33/191 (17%), Positives = 65/191 (34%), Gaps = 7/191 (4%)

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
            L    +    L +         +      LL  + +  L  + +   +        V   
Sbjct  24   LCLPLVVLESLAQRLLASATGMEASPAYGLLVGLLFYPLYTAALILFLDARSRGLAVHHR  83

Query  191  RSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
              + + L+    F +L  L  L++  G+ L I+PGL   +   F +Y+L  D +  L AL
Sbjct  84   DLLAMALQLWPRFAVLTALSSLLIMLGASLFILPGLWVLIRLAFAEYLLVLDGLPPLAAL  143

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTL---SFLT----ARIPYVGEAANLAFSLLLTPF  303
             +S  L    +W +    + ++     L   S         + +V +       L  +  
Sbjct  144  RESFRLTGEPFWRVLVCVLSVMAPLWLLDAWSLPLLASDGPLAFVLDCLRGFLQLFASVV  203

Query  304  SFLYYYLIYSD  314
             F  + L+   
Sbjct  204  VFRLFMLVSPR  214


>PTC38616.1 hypothetical protein CLJ1_0894 [Pseudomonas aeruginosa]
Length=291

 Score = 54.0 bits (126),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 34/149 (23%), Positives = 57/149 (38%), Gaps = 0/149 (0%)

Query  130  VLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGL  189
             L    IF   L++         Q      L+A + +  L  + +   M          +
Sbjct  96   PLCLPWIFLESLVQQQIDEAVGPQQMGAWSLVAGLLFYPLYTAALILFMDARGRDERPRI  155

Query  190  FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQA  249
             +     LR    F LL  L  L++  G  LLI+PG+   V   F ++ L       LQA
Sbjct  156  GQLWSAALRLWPGFALLAALTSLLIVLGLSLLILPGIFVMVKLAFAEFCLVLRGRSPLQA  215

Query  250  LEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            L +S     G ++ I    +++L+   +L
Sbjct  216  LRESFEFTRGRFFVILACSLVILLPVWSL  244


>NIM07516.1 hypothetical protein [Armatimonadetes bacterium]NIO98997.1 hypothetical 
protein [Armatimonadetes bacterium]
Length=116

 Score = 50.9 bits (118),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 11/77 (14%), Positives = 19/77 (25%), Gaps = 1/77 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M   +CP C A      +K+    +  +C +C  T        Q+ Q             
Sbjct  1   MIA-QCPTCEARYRVDDAKVGPGGTKFKCKKCRNTFTVFREVQQQEQQDPVRRAPTTRNC  59

Query  61  QRRIPSDRLEIQSKTVN  77
                    +       
Sbjct  60  PHCGKPIPFQAVKCRYC  76


>OPZ67706.1 hypothetical protein BWY81_01183 [Firmicutes bacterium ADurb.Bin467]
Length=309

 Score = 54.0 bits (126),  Expect = 6e-05, Method: Composition-based stats.
 Identities = 24/167 (14%), Positives = 59/167 (35%), Gaps = 14/167 (8%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +      L          +          +   LR  G    L IL         LL 
Sbjct  133  LVLLLFGPALRLGLYESISSLYSGGHPRASQLFSKLRFFGKALWLGILEAFFTFLWMLLF  192

Query  222  IIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLL---VISLT  277
            I+PG++    +    Y+L  +  +  + AL +S+ +++G+   +F  +   +   +++  
Sbjct  193  IVPGIIASFRYAMAFYILWKNPEMRAIDALRESKRMMNGNKGRLFCLYFSYIGWELLAAV  252

Query  278  LSFLTARIPY----------VGEAANLAFSLLLTPFSFLYYYLIYSD  314
             SF    +P+          +     +A  + ++ + ++  +  + D
Sbjct  253  PSFALILLPFAILPELSLHALAWVLTIAGGMFVSSYVYVGEFEFFKD  299


>TDI58471.1 hypothetical protein E2O92_09730 [Alphaproteobacteria bacterium]
Length=255

 Score = 53.6 bits (125),  Expect = 7e-05, Method: Composition-based stats.
 Identities = 8/37 (22%), Positives = 12/37 (32%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  + CP+C         ++P      RC  C     
Sbjct  1   MI-ITCPNCSTRFMLEDDQMPDAGRKVRCARCAHVWH  36


>PYT06496.1 hypothetical protein DMF60_09230 [Acidobacteria bacterium]
Length=425

 Score = 54.4 bits (127),  Expect = 7e-05, Method: Composition-based stats.
 Identities = 31/295 (11%), Positives = 74/295 (25%), Gaps = 21/295 (7%)

Query  83   RSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLL  142
              +         AS S    I+ +   +        W  + +  +  +         L  
Sbjct  47   MFWYGYTSLLTSASSSRGMPITAIWMLALGGLGYPIWMFVLLLTVSGLSRVVGDHLMLGT  106

Query  143  KPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGS  202
                         +   +      +++ L      +   I    + +   +         
Sbjct  107  PITFRGCFAAVRRRIGAITLMGLLMVVLLFAAYIVVAFVIFAIFLLVALIVGAVAAAQLP  166

Query  203  FTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWW  262
              +  + L + V     L ++   +      F   ++  +      AL ++  L  G+W+
Sbjct  167  QWVATVTLTISVIVAVALGLLLICVVASRVVFLPQIVMIEGESAGNALGRAMRLGKGNWY  226

Query  263  AIFGRFVLLLVISLTLSFLTARIPYVGEAANL---------------------AFSLLLT  301
             +    V    +SL+L        + G   +                         LL  
Sbjct  227  RVAAIVVFTYFVSLSLLAAITLPVFFGLYMSGMLTTEFFLSPAWNILYTSFRDVTGLLSL  286

Query  302  PFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQN  356
            P   + + L+Y D +          + R+  P          +P    +      
Sbjct  287  PIWIVSFTLLYFDSRVRKEAYDVDLLAREINPGFYWQPALQPVPPGYQMPGQSGP  341


>MBD3241668.1 hypothetical protein [Chitinivibrionales bacterium]
Length=221

 Score = 52.9 bits (123),  Expect = 7e-05, Method: Composition-based stats.
 Identities = 27/166 (16%), Positives = 59/166 (36%), Gaps = 0/166 (0%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              +                 +     L          + +       +G S+   +  I 
Sbjct  15   FSLESAKRHWLKFFGIMIGGILGLALLATIGFKINENLGILLAMLGAIGFSFGLFANVIR  74

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            +        ++         +F + +ILL++ +  G +LLIIPG++  + F    +++ D
Sbjct  75   LASNQGFSIKAFIPEPMVFLNFLVGMILLVVAIMIGLILLIIPGIIVALMFSLVPFLIVD  134

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
              +  +QA  +S  L  GH   IF   ++  ++   LS     + +
Sbjct  135  KKMSFIQAFSESARLTKGHKMDIFIGGLVTNLVISLLSIPVITLFF  180


>WP_191430587.1 DUF975 family protein, partial [Lachnoclostridium sp. An196]
Length=141

 Score = 51.3 bits (119),  Expect = 7e-05, Method: Composition-based stats.
 Identities = 19/124 (15%), Positives = 42/124 (34%), Gaps = 1/124 (1%)

Query  224  PGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            PG++    +    Y++A+   +    A+ +S+ ++ G+ W +F   +  +  SL    + 
Sbjct  2    PGIVATYSYAMVPYIMAEHPELRARDAIRESKNMMKGNRWRLFCLELSFIGWSLLAVLVF  61

Query  283  ARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWM  342
              +   G         ++    F    L  S      R   +  +  Q          W 
Sbjct  62   TVLFVAGVLVGSVGIAMIGLLIFFVGLLFLSPYIEASRAAFYRELTEQRYSNPQPEAQWR  121

Query  343  LIPG  346
             +P 
Sbjct  122  EVPQ  125


>WP_102153783.1 MULTISPECIES: glycerophosphoryl diester phosphodiesterase membrane 
domain-containing protein [Erythrobacteraceae]ROT96524.1 
hypothetical protein EB810_00735 [Altererythrobacter sp. 
FM1]
Length=275

 Score = 53.6 bits (125),  Expect = 7e-05, Method: Composition-based stats.
 Identities = 25/199 (13%), Positives = 57/199 (29%), Gaps = 20/199 (10%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
                      LL          S                   +I    +   ++    M 
Sbjct  30   FFFLPYFAFALLMGNRMTEVEASMASNGDPEAAMQAMTALYGSIWWVIILVTIVQGIGML  89

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII------------  223
            G + +   +    +  ++ +G +    + +  +L+   +G   ++ I             
Sbjct  90   GLLALLTDRRRPTVGEALAIGAKLFVPYLVAQLLVGFAMGLLMIVPIAIGAAGSVAAAVI  149

Query  224  -------PGLLFCVWFFFCQYVLADDNIG-GLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
                     +   V F     V+A + +   + AL +S  L  G+   IF   +LL++  
Sbjct  150  VGLALVVLAIYVFVKFVMVAPVIAIERVSNPIAALRRSWRLTKGNSLRIFLFLMLLMLAI  209

Query  276  LTLSFLTARIPYVGEAANL  294
              +  +   I  +  A   
Sbjct  210  AVVGSVIGLIVGLILAIGG  228


>MSR31538.1 hypothetical protein [Gemmataceae bacterium]
Length=343

 Score = 54.0 bits (126),  Expect = 7e-05, Method: Composition-based stats.
 Identities = 40/326 (12%), Positives = 77/326 (24%), Gaps = 25/326 (8%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              V C  CG +       L       +CP C          +    +   + +       
Sbjct  3    VLVTCQSCGKKLKVNDEIL---GKRVKCPGCAGIFTAVADGAASAPSIPPMPSPKATRAM  59

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
              + +   +  ++  +          +     + +    R   Q     + L     +  
Sbjct  60   ASLSAMAGKSNAEEEDKGEEREETGEERSVNKKVAAKETRLGWQATRTGFNLLLIASYLY  119

Query  122  LGIYLLGIVLAFAPIFSALLLK---PATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
            L   +L I           LLK   P +            I +       +  S   G  
Sbjct  120  LSGIILQICSMLLVRLLGFLLKPGEPPSPTIIYTIITLLIICVILGLAAFIVHSVGLGFC  179

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
                 K          L            IL   +       ++I    F +      Y+
Sbjct  180  LYVPKKEGYSTKTLALLTFIFWCIGIGFYILGFPLTLVCIGFILIIIAPFLLLAAHVLYL  239

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL-----------------  281
                ++G L   +      +G  +A  G  + + +      FL                 
Sbjct  240  FFLRSVGLLMKRKDLAGSATGFMFANLGYVLSMFIFGGVFFFLVESAPRGGGRQAVIAWV  299

Query  282  --TARIPYVGEAANLAFSLLLTPFSF  305
                    +G  A   F+L L  +  
Sbjct  300  QTLGIGVLIGVVALGIFALGLLIWYM  325


>WP_173081207.1 hypothetical protein [Phytohabitans rumicis]GFJ94228.1 hypothetical 
protein Prum_078700 [Phytohabitans rumicis]
Length=251

 Score = 53.2 bits (124),  Expect = 7e-05, Method: Composition-based stats.
 Identities = 37/227 (16%), Positives = 64/227 (28%), Gaps = 13/227 (6%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIF---SALLLKPATWLNPQNQNWQWAILLA  162
            +    W LF       + I L+   +  A       A +                   L 
Sbjct  11   MYLTRWPLFIGIAAVFIPIALVIAAVEAAVFGSASIAGIDTSGESGGVFASLAVAVAALL  70

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL--L  220
            T+  + L  +    ++        +G  R+ ++ L    +    L + +  V    L   
Sbjct  71   TLGGVGLVQAATARALAEVDAGRPIGPLRAYRMALGRFPALLGALAIAVGAVVLLGLSVA  130

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
            L+   +     +      +  +    L AL +S  LV   W+      V    I+L L  
Sbjct  131  LLPVAIWLAGRWALLAQAVELEGESVLGALRRSGRLVRRRWFKTTSLVVAGAAIALVLGP  190

Query  281  LTARIPYVGE--------AANLAFSLLLTPFSFLYYYLIYSDLKANY  319
            +   I   G                LL  PF  L    +Y D     
Sbjct  191  VLGMILIFGTDAPFTLVNVVAGLVYLLAMPFVALTTAYVYHDAVVRE  237


>WP_169701313.1 zinc-ribbon domain-containing protein [Planktomarina temperata]
Length=220

 Score = 52.9 bits (123),  Expect = 7e-05, Method: Composition-based stats.
 Identities = 16/172 (9%), Positives = 46/172 (27%), Gaps = 0/172 (0%)

Query  6    CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIP  65
            CP C AE   P   +PA+    +C  C +T           +TT +         + +  
Sbjct  5    CPKCDAEYEIPDDVIPAEGRDVQCSGCQETWFVPANTPPPERTTIDPKVSSILQQEVQRE  64

Query  66   SDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIY  125
             +  + + +  +     ++   +         +           +           + I 
Sbjct  65   MEARKAEKRMAHETEPEQAPATRGLDRPPPPVTPPIEPRPQPMAAPTSKNLPPIDTVKIS  124

Query  126  LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
             +      A   +  ++           + +    +A    + +  +    +
Sbjct  125  SVVSATDDAAPLTKPVMPSKAPEEVAPLDSRQRGTVAAFLILAVLTAIYIFA  176


>MBI2825313.1 hypothetical protein [Planctomycetia bacterium]
Length=433

 Score = 54.4 bits (127),  Expect = 7e-05, Method: Composition-based stats.
 Identities = 33/316 (10%), Positives = 72/316 (23%), Gaps = 17/316 (5%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              V CP CG + +   + +       RCP+C + +       ++           +    
Sbjct  79   VAVTCPTCGTKLHPRVALV---GKRVRCPDCRRPVTVPEPREEKPIKAPPRPAGAYGIGA  135

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
               P    E   + ++           P   F     G       ++    L     +  
Sbjct  136  APEPIAMPETVLQDLDRTITAPPPEPAPRLWFVTGVFGFPWYPGTISRWMILTLFLLFAN  195

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              +   G  +      +      AT            I+       + G           
Sbjct  196  TIVVFGGKAIMGMVAGAGAGDAYATAFKVALSVALGMIVWTLTMAYIAGCVVTVIRDTAA  255

Query  182  ICKTDVGLFR------SMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
                             ++             +   +              +  V     
Sbjct  256  GNNDIEDWSDTEITEGFLRSIYLWFPLVAAGAVCYGIHYVTAMFAPEWAVAVSSVALLVL  315

Query  236  QYVLADDNIGGLQALE----KSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR----IPY  287
              +     +    AL      +  L+SG WW     +V + +++   + L+      +PY
Sbjct  316  FPIFFVSALEHGSALVLVSPTALRLISGFWWGWLLLYVEVGLVTGGWAGLSWLGLDRLPY  375

Query  288  VGEAANLAFSLLLTPF  303
            V           +   
Sbjct  376  VTALLGAPVLAAVIFI  391


>VVB07961.1 unnamed protein product [Arabis nemorensis]
Length=897

 Score = 54.8 bits (128),  Expect = 7e-05, Method: Composition-based stats.
 Identities = 30/214 (14%), Positives = 63/214 (29%), Gaps = 21/214 (10%)

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
                   L      I + F   FS L      +        +     +T++ I L L  +
Sbjct  70   QTNHEWTLLFVYQFIYVIFLFAFSLLSTAAVVFTVASLYTGKPVSFSSTMSAIPLVLKRL  129

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLL-ILLILVVGGGSLLLIIPGLLFCVWFF  233
              +         V     +   +  + +  L   IL +  +    ++ +   +    W+ 
Sbjct  130  FITFLWVSLLMLVYNSIFLLFLVVLIIAIDLQSVILAVFSMVVIFVMFLGVHVYMTAWWH  189

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGH----WWAIFGRFVLLLVISLTLSFLT-------  282
                V   + I G+ A++KS  L+ G        +F    L   I+     +        
Sbjct  190  LASVVSVLEPIYGIAAMKKSYELLKGRTNMACSMVFMYLALCGFIAGVFGSVVVRGGDDF  249

Query  283  ---------ARIPYVGEAANLAFSLLLTPFSFLY  307
                       +  +    NL   L+ + F ++ 
Sbjct  250  GLFTKIVVGGFLVGILVIVNLVGLLVQSVFYYVC  283


>MBA2298610.1 hypothetical protein [Actinobacteria bacterium]
Length=177

 Score = 52.1 bits (121),  Expect = 7e-05, Method: Composition-based stats.
 Identities = 28/125 (22%), Positives = 53/125 (42%), Gaps = 0/125 (0%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
            A+V   L+G SW+  ++   + +            +  + +  L  +   L    G LLL
Sbjct  12   ASVLIGLIGYSWVYAALIATLARRTRSPLEPYGRTVDRLPALALANLTAGLATVLGLLLL  71

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            I+PGLL    +     ++  ++ G  +ALE S  L+ G  W +    + +++ S  L+  
Sbjct  72   IVPGLLLAARWSTAGPLIVLEHKGPFEALETSNGLIRGRTWPVVRAGLAVVLFSAVLALP  131

Query  282  TARIP  286
               I 
Sbjct  132  GGVIA  136


>NND02970.1 DUF2510 domain-containing protein [Acidimicrobiia bacterium]
Length=325

 Score = 54.0 bits (126),  Expect = 7e-05, Method: Composition-based stats.
 Identities = 36/266 (14%), Positives = 66/266 (25%), Gaps = 21/266 (8%)

Query  48   TTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLL  107
               +  T P               +           S      R          S+ +  
Sbjct  7    NWYDDPTDPTMERWWDGEKWTEHRRPAVPAYAGGAASHQAGELRPVGDMIGHAFSLIRAR  66

Query  108  ADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI  167
                          + I     +LA   +            N         + +      
Sbjct  67   LGGIIGVGVIAAVAIAIAYGVFILAAFGLAFETNAGELFETNTDAIVLLVFLFVVAGVLS  126

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL-------  220
            +      T  ++            +   G R +  F   +IL  L +    L        
Sbjct  127  VASFLATTTLLWDAAVGRKRSWGAAFGNGFRRMFPFLGWIILGSLPIYALILFAFAIGGG  186

Query  221  ------------LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
                         +IP + + V  +F    +  +   G  AL  S  LV G WW IFGR 
Sbjct  187  TEGGAIGFIFVVFMIPWIYWSVVLYFVPVTVVRE--TGDNALVASFQLVKGRWWRIFGRM  244

Query  269  VLLLVISLTLSFLTARIPYVGEAANL  294
            ++  ++ + +      +  +  AA  
Sbjct  245  LVWGLVMIGVGIGLGIVFSLVAAAVG  270


>NOY23820.1 hypothetical protein [Acidobacteria bacterium]
Length=226

 Score = 52.9 bits (123),  Expect = 7e-05, Method: Composition-based stats.
 Identities = 8/34 (24%), Positives = 15/34 (44%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP+C  +     S +P K    +C +C +  
Sbjct  2   EISCPNCSKKYRVDESLIPEKGRKVKCRKCGEVF  35


>HBJ76497.1 hypothetical protein [Porphyromonadaceae bacterium]
Length=202

 Score = 52.5 bits (122),  Expect = 8e-05, Method: Composition-based stats.
 Identities = 28/163 (17%), Positives = 51/163 (31%), Gaps = 2/163 (1%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
               +        +        N              +  S     + + I +    L  S
Sbjct  23   MVGLALGYGFIFSLIYWVFVGNGVPLAYEVVSGLFSIFFSLAYTKISMDIAEGKDALLGS  82

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
                +   G+  L  ++L + +  G  LLIIPGL     F F  Y +  +    + AL+K
Sbjct  83   FGEVIPFFGNALLCGVMLCIPIMIGMFLLIIPGLYIASRFMFSTYFI-LEGEKAIPALKK  141

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
            S      +    F  +++  +I L  + L     +   A    
Sbjct  142  SWKATK-NTKGTFPLYIVFGLIILLGALLLIFGIFPAFALVSI  183


>WP_142094079.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Propioniferax innocua]TQL58273.1 glycerophosphoryl 
diester phosphodiesterase family protein [Propioniferax 
innocua]
Length=401

 Score = 54.4 bits (127),  Expect = 8e-05, Method: Composition-based stats.
 Identities = 17/95 (18%), Positives = 35/95 (37%), Gaps = 0/95 (0%)

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
                + +L+       L I   +  +   F    +     GG+ A ++S  L  G +W +
Sbjct  164  AGSAVGMLLGYLLMFALEIGIFILSIKLMFLMPEVTVQGNGGITAAKRSWTLTRGRFWRL  223

Query  265  FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLL  299
             G  +L  VI   + +    +      A++  S+ 
Sbjct  224  LGYTLLFSVIYSVIYYAIMMLGAFIMMASVLTSIP  258


>WP_146540482.1 zinc-ribbon domain-containing protein [Reyranella sp. CPCC 100927]TWT13986.1 
hypothetical protein FQU96_08800 [Reyranella 
sp. CPCC 100927]
Length=202

 Score = 52.5 bits (122),  Expect = 8e-05, Method: Composition-based stats.
 Identities = 9/82 (11%), Positives = 17/82 (21%), Gaps = 0/82 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + CP+C A      + +     + +C  C        +    T       T     + R
Sbjct  2   QLTCPNCSARYLVDPAAIGPTGRTVQCFRCGHKWQARLSTPVGTAEPVPAPTPVPDFIIR  61

Query  63  RIPSDRLEIQSKTVNCRRCNRS  84
                                 
Sbjct  62  PPSQPEANYLPAIPADPGMPTW  83


>ABC77958.1 hypothetical transport protein [Syntrophus aciditrophicus SB]
Length=630

 Score = 54.8 bits (128),  Expect = 8e-05, Method: Composition-based stats.
 Identities = 49/327 (15%), Positives = 97/327 (30%), Gaps = 34/327 (10%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  +RCPHCG  +                                    +          
Sbjct  2    MTKIRCPHCGTSQEL--------------------------FPHIILCRNCYGDLRGEFE  35

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
            +    +  +E                 +   +    G+                      
Sbjct  36   KHSSQTISVEKPPAESGPAVRKYQAIRKYPLKKIGPGNSSLFDILGKTGCLSFKRFFPLF  95

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
             L  + +   L      S L L                +    +   LL   +   ++ +
Sbjct  96   PLCYFSVFFFLLIGIFISKLGLPIVFPEYFPLDPSLRYLAGGGILTCLLASLYTQTALLL  155

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +    + L   +      + S+TLL++L+ +++G G  +LI PG++  V   F  ++LA
Sbjct  156  AVSNQHLDLGDVLAKAWSRLVSYTLLILLMAIIIGLGYSILIFPGVIAIVLLIFAPFILA  215

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFV-LLLVISLTLSFLTARIPYVGEAANLAFSL-  298
             +N+G  +A+ KS   V+  W  +F     + L+I  +L F       +      AF+  
Sbjct  216  AENVGVTEAISKSVSYVAHDWLRVFLCLAPVPLLIIFSLMFFAYGGTPILWVTRNAFAFV  275

Query  299  ------LLTPFSFLYYYLIYSDLKANY  319
                  +  PF  +  Y+ +       
Sbjct  276  VIVSAVISVPFMLMTLYIYHVYDDLRK  302


>TMK56128.1 hypothetical protein E6G51_10770 [Actinobacteria bacterium]
Length=237

 Score = 52.9 bits (123),  Expect = 8e-05, Method: Composition-based stats.
 Identities = 33/158 (21%), Positives = 59/158 (37%), Gaps = 6/158 (4%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
               + L+    MT    +   +    +   +   L  + + TL+ +L +  V    + LI
Sbjct  58   AAFFALVQAVAMTVLRDLRERRPASSIGDLLATALPPLPAATLVGVLALAAVTVALVFLI  117

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            +PGL     +     V   +  G   A  +SR LV G+ W + G  +LL ++    + L 
Sbjct  118  VPGLYLMTIWAVVLPVAVVERPGVFDAFGRSRGLVRGNGWKVLGVVLLLGLLLAVSAALA  177

Query  283  ------ARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
                  A  P V        S ++ P   L   ++Y  
Sbjct  178  LLLHRHAAGPVVSILFGSLLSSVIAPIQMLVLGVLYFR  215


>OGH68413.1 hypothetical protein A3D53_03145 [Candidatus Magasanikbacteria 
bacterium RIFCSPHIGHO2_02_FULL_45_10]OGH81400.1 hypothetical 
protein A3I29_03215 [Candidatus Magasanikbacteria bacterium 
RIFCSPLOWO2_02_FULL_44_11]
Length=276

 Score = 53.2 bits (124),  Expect = 8e-05, Method: Composition-based stats.
 Identities = 38/117 (32%), Positives = 59/117 (50%), Gaps = 14/117 (12%)

Query  211  ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFV-  269
             LV+ GG+LLLIIP ++F VWF F    L  +N  G++AL  S+ LV+G WW IF R + 
Sbjct  140  WLVMFGGALLLIIPSIIFVVWFCFAMQELIYENKRGVKALSSSKDLVAGRWWTIFVRLIA  199

Query  270  -------LLLVISLTLSFLTARIP------YVGEAANLAFSLLLTPFSFLYYYLIYS  313
                     + + + +  L   IP       +   AN+A S++  P   +   ++Y 
Sbjct  200  PPIVFAVAFMAVYMVMGLLFGLIPSDMARLILIVLANIALSIVFVPLLAIPPIVLYF  256


>MBI2415545.1 hypothetical protein [Candidatus Kerfeldbacteria bacterium]
Length=245

 Score = 53.2 bits (124),  Expect = 8e-05, Method: Composition-based stats.
 Identities = 22/172 (13%), Positives = 58/172 (34%), Gaps = 2/172 (1%)

Query  144  PATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSF  203
                ++         + +     ++     +  ++     +  + L   ++  +R     
Sbjct  62   LIGIISHPTYIVNVVLQVVVNVALVFIAIVIILTLQHAYHRQVITLHALVREAIRFYPRA  121

Query  204  TLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA  263
             +L  +  ++   G  LL++PG++  V+F      L  D +    A+  S  +V G WW+
Sbjct  122  IVLHSITGIITLIGLGLLLVPGIIVAVFFSMALPALVWDKLTIRAAMVASWRMVRGRWWS  181

Query  264  IFGRFVLLLVISLTLSFLTA--RIPYVGEAANLAFSLLLTPFSFLYYYLIYS  313
            +        V++    +L        VG          +     + + ++  
Sbjct  182  VCLYLTSTYVLTDVAGWLIITVLPNTVGFTTAGLTIAAIINVFAIIFTVVVF  233


>WP_199729054.1 zinc-ribbon domain-containing protein, partial [Corallococcus 
sp. CA053C]RKG96111.1 thioredoxin, partial [Corallococcus sp. 
CA053C]
Length=72

 Score = 49.0 bits (113),  Expect = 8e-05, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V+C  C      P  K+  K    RC +C  T 
Sbjct  1   MI-VQCEQCQTRFKIPDEKVTEKGVKVRCTKCQNTF  35


>OAS14804.1 hypothetical protein A8708_04695 [Paenibacillus oryzisoli]
Length=195

 Score = 52.1 bits (121),  Expect = 8e-05, Method: Composition-based stats.
 Identities = 36/177 (20%), Positives = 70/177 (40%), Gaps = 14/177 (8%)

Query  157  WAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGG  216
              I+   +      L   +   F+ + + +      +  G   +   TL  +L +L +  
Sbjct  16   HWIVDIALFLFSGALMLSSVHYFLRLHRNERAEISDLLYGFNQLIPSTLTYLLFLLFILL  75

Query  217  GSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
             +LLLIIPG++  + +    Y+L D+ +I   QA+++S  ++ GH W  F   +  +   
Sbjct  76   WTLLLIIPGIIAGLRYAMTFYILNDEPDIKPHQAIKRSSEMMKGHKWNFFKLQLSFIGWY  135

Query  276  LTLSFLTARI------------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
              L  +   I            PY        F+L +  ++       Y +LKA  +
Sbjct  136  -LLGIIAGFITIAALPYVQMNDPYWSGLIIALFTLPVLTYTAAATAAFYENLKAMQQ  191


>MBD3181089.1 hypothetical protein [Candidatus Poribacteria bacterium]
Length=305

 Score = 53.6 bits (125),  Expect = 8e-05, Method: Composition-based stats.
 Identities = 26/210 (12%), Positives = 66/210 (31%), Gaps = 38/210 (18%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
                    +  +T S+   +     G+        + +      + +  +++     +  
Sbjct  89   LFFITPFLVGILTISISSMLLNKKAGIVDVYSRLSKKILPLIGTVFITGVMMAVVFFMSA  148

Query  223  IPGLLF----------------------CVWFFFCQYVLADDNIGGLQALEKSRLLVSGH  260
              GL                         VW+ F    +  +  GG+ A+++S+ LV G 
Sbjct  149  SLGLSMIFAGSQPGLLVVIAGMFMTGVLLVWYAFISQAVIFEGEGGIGAMKRSKYLVKGS  208

Query  261  WWAIFGRFVLLLVISLTLS-------------FLTARIPYVGEA---ANLAFSLLLTPFS  304
            +   F   ++ L+     +             F +  I   G A    +   S+++ P  
Sbjct  209  FARTFLLVIVSLIAITFAAELASLGVYQLFSLFGSYGITLAGGASEGVSNIISVIVEPLR  268

Query  305  FLYYYLIYSDLKANYRGPQHPPIKRQWLPL  334
             +   ++Y D +    G     + +++  +
Sbjct  269  IIIIIILYYDFRIRKEGYDPEIMAQEFKGI  298


>QOV90846.1 hypothetical protein IPV69_05665 [Phycisphaerales bacterium]
Length=918

 Score = 54.8 bits (128),  Expect = 8e-05, Method: Composition-based stats.
 Identities = 41/544 (8%), Positives = 102/544 (19%), Gaps = 54/544 (10%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD------------PAESQRTQTTD  50
             V+C +C    +       +     +C EC    +              PA         
Sbjct  4    LVQCVNCQRRYSLDE---KSAGKKVKCKECGTVFVAQAAGQSAGLTSAPPASPPPAAPRV  60

Query  51   NIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADS  110
               T               E                  PE           S+ Q  A  
Sbjct  61   APRTAAPTIQTPPPRPRMEENDDPFAAMSLLEAGTAPPPETSGPMYSGAAMSVRQAPAAP  120

Query  111  WELFCRRGW-----------------GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQ  153
               +                       +L +     +     +    +   A       +
Sbjct  121  ATPYTFAPPPRKRTNTTKSLGLDSLTPILLLCFFVGLAVVLYMGLTHVTDKAEPGTHPLE  180

Query  154  NWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILV  213
                   +  +        ++  +  + +           +L          +  L ++V
Sbjct  181  IKAAKNAVWIMVITFAVSHFVIVAPLVLLAVFIASKIMKFELPGAGYMRAAGVGALPLVV  240

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            + G  +LL     L  +       +          AL+    L+ G     +G   +  V
Sbjct  241  LLGSDILLPANMALRLILLVSILAL-------AFYALKNIFELMIGEALVAYGFTCVFFV  293

Query  274  ISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLP  333
            + + +S   + I                             + A     +   +      
Sbjct  294  VGIVVSITMSGIVAAAGIVQS-------------TADAQRAINAEQEQKRDQELADLRGS  340

Query  334  LTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQR  393
                             +                   QR           N       + 
Sbjct  341  RPPGYRPSGTPSSGGGATEPPPPPVDPM--ETRAQSLQRRLEAFDARNLDNAGRESVERD  398

Query  394  LSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQK  453
             ++   ++  +           S+  +     +            ++      +   A  
Sbjct  399  FATLRAEVASATGPLRARPEWGSIDQLHKSILQKINALPTEQPDPQIFKPVVADTEFAPP  458

Query  454  GSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQ  513
             +    + + +      +     +            Q           S+         Q
Sbjct  459  ANGVAALGEEVSFGKYFIRPAADARMDLRTSNPSRYQWTAGQSRGESFSLSSVPRKNDRQ  518

Query  514  VHSI  517
            +   
Sbjct  519  LRPW  522


>MBA3259230.1 zinc-ribbon domain-containing protein [Gemmatimonadales bacterium]
Length=63

 Score = 48.6 bits (112),  Expect = 9e-05, Method: Composition-based stats.
 Identities = 10/33 (30%), Positives = 13/33 (39%), Gaps = 0/33 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           V CP+C        +K+P     ARC  C    
Sbjct  3   VTCPNCATVYRVDPAKVPEVGVRARCAVCSAVF  35


>HCG96087.1 hypothetical protein [Halieaceae bacterium]
Length=281

 Score = 53.2 bits (124),  Expect = 9e-05, Method: Composition-based stats.
 Identities = 29/176 (16%), Positives = 67/176 (38%), Gaps = 0/176 (0%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
                       +A I   + L     +     +   + +   +  IL  L    G + I 
Sbjct  85   FKWSYFAAAFVYAVISIVITLVQEAAVGSAGDDVAASFVEILITLILFPLGVGLGLLGIR  144

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                      ++     H     ++ +L+++++  G  LL++PG+   + + F  Y++ +
Sbjct  145  RAAGRETPVSTLWEPYSHALPLIVMFVLMVVLIIAGFFLLVLPGIYLSIAYSFAPYLIVE  204

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
             N+G  +ALE SR  ++ +WW  FG  ++ +V+ +          +         +
Sbjct  205  KNMGVWEALETSRKAITHYWWRYFGLMLVAMVLIIIGLIPLLIGLFWVLPIVAIAT  260


>OHA53576.1 hypothetical protein A3A30_00260 [Candidatus Terrybacteria bacterium 
RIFCSPLOWO2_01_FULL_48_14]
Length=298

 Score = 53.6 bits (125),  Expect = 9e-05, Method: Composition-based stats.
 Identities = 26/134 (19%), Positives = 58/134 (43%), Gaps = 0/134 (0%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
            A+  +  L    +  ++ +      +    ++  G   +  + ++L +  ++  G  +  
Sbjct  53   ASFLFYFLAPIIILQTIALNQSGGRLEFREAINRGFGVLFPYGIVLFIAGIIRFGAFVPF  112

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            IIPG++  V+    +   A +   G  AL +S  LV G  W++FGR +LL +I + ++ L
Sbjct  113  IIPGIIVAVFLALVEPAAAIEGRRGFDALARSWALVRGSGWSVFGRLLLLWLILIAVAVL  172

Query  282  TARIPYVGEAANLA  295
                 ++       
Sbjct  173  LVLPVFLVTIGLTL  186


>NDC59313.1 DUF3426 domain-containing protein [Alphaproteobacteria bacterium]
Length=427

 Score = 54.0 bits (126),  Expect = 9e-05, Method: Composition-based stats.
 Identities = 8/41 (20%), Positives = 11/41 (27%), Gaps = 1/41 (2%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
            M    CP C        + +     + RC  C  T      
Sbjct  96   MIL-TCPSCSTRYFADDASIGPSGRTVRCASCAHTWFCQGH  135


>RMF67257.1 hypothetical protein D6742_07940 [Cyanobacteria bacterium J069]
Length=211

 Score = 52.5 bits (122),  Expect = 9e-05, Method: Composition-based stats.
 Identities = 26/148 (18%), Positives = 49/148 (33%), Gaps = 0/148 (0%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
            + L    A   +   LL+   + +          +     + +   L      +   + K
Sbjct  58   WELFKKNAGGFVGFTLLMLALSAVPQILPERLRPLGSIASSVLSGPLGAGFYIVAFKLIK  117

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
                 F     G  +     L  +L  ++   G +LL IPG+   V + F    + D   
Sbjct  118  QRGTTFSDFFRGFNNFLPLFLASLLTSILTVVGFILLFIPGIYLAVAWAFTTLFIVDRRF  177

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLL  272
                A+E SR ++S  W++     V   
Sbjct  178  DFWDAMEASRKVISKRWFSWLLFIVAFY  205


>NLM97551.1 hypothetical protein [Halanaerobiaceae bacterium]
Length=360

 Score = 53.6 bits (125),  Expect = 9e-05, Method: Composition-based stats.
 Identities = 23/174 (13%), Positives = 50/174 (29%), Gaps = 5/174 (3%)

Query  145  ATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT  204
             +          +  +L     +L  L      +          L   + L       F 
Sbjct  114  FSNFWHIWLTQYFTGILMLGGVLLFMLPIFIVFIISVGVDLINSLPWMLLLYGNSSIPFY  173

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
                 L L      ++  +  + + + F F  +   D      +A+     + + + W I
Sbjct  174  SYSGSLYLYFLILCIVSFVAFIYWALKFMFTTFAAVDKKFKTAEAIFYGSQITNSYKWKI  233

Query  265  FGRFVLLLVISLTLSFLTARIPY-----VGEAANLAFSLLLTPFSFLYYYLIYS  313
            F   +L ++I   +S L+  +             L    +  P+       IY+
Sbjct  234  FLAVLLPVIIYSLVSILSGYVFGDESWPALIIRFLFGLFIYAPWLSSVLGEIYN  287


>HBF67473.1 hypothetical protein [Candidatus Magasanikbacteria bacterium]
Length=249

 Score = 52.9 bits (123),  Expect = 9e-05, Method: Composition-based stats.
 Identities = 29/170 (17%), Positives = 67/170 (39%), Gaps = 0/170 (0%)

Query  95   ASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQN  154
               S  + I Q      + +     G+  ++++  +        A  +   +        
Sbjct  1    MLPSFWQLIEQAWNLYGKHYRTYLPGVFVLFVIAFITPLFRFLYAPSVAALSEQPLNIFI  60

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV  214
                ++       L     +   ++  I +        + +  RH     ++ ++++L  
Sbjct  61   TYAILVFVLSLISLYISFTLLRIIYASITQKQARWKDELMITARHFIPAFVVALIIMLAT  120

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
              G+L++IIPGL+  V+ FF QY +  +    +++L+K   LV G W+A+
Sbjct  121  VVGALIVIIPGLILFVFLFFSQYYVLFEGDSIVESLKKGWHLVRGRWFAV  170


>MBI5354742.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Chloroflexi bacterium]
Length=153

 Score = 51.3 bits (119),  Expect = 9e-05, Method: Composition-based stats.
 Identities = 28/126 (22%), Positives = 43/126 (34%), Gaps = 16/126 (13%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            V    LLL     L    +     V   ++ G   ++ +S  L   H+W +FG      +
Sbjct  13   VIVFLLLLFPIISLLTTRWSLATTVAVLEDTGAATSMRRSWTLTEQHFWRVFGTSFAAGL  72

Query  274  ISLTLSFLTARI----------------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKA  317
            +S+ LS L A                    +         +L TP +   Y LIY DL+ 
Sbjct  73   LSMLLSTLPALFITYLFEQVIHAPIRLSTIITMVLGQLTIILTTPLTVGVYVLIYYDLRI  132

Query  318  NYRGPQ  323
               G  
Sbjct  133  RKEGFD  138


>PYV11776.1 hypothetical protein DMG23_03280, partial [Acidobacteria bacterium]
Length=277

 Score = 53.2 bits (124),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 26/130 (20%), Positives = 43/130 (33%), Gaps = 19/130 (15%)

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR-----------------  284
            +NI   +AL++SR+L  G +  IF   +L L+IS  ++F+                    
Sbjct  1    ENITAREALKRSRVLTKGQFGRIFLAGILTLIISWVIAFVIQGPFSVAATLMVVNKVQPP  60

Query  285  --IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWM  342
              +  V   A         P   +   LIY D++    G     +          +    
Sbjct  61   AWLNLVSMIAGGLSGAFAGPLYAIALALIYYDVRVRKEGYDLQLMVEALEDTAPGVDKAP  120

Query  343  LIPGLLLVSL  352
             I     V L
Sbjct  121  RIATGAGVQL  130


>RKZ07008.1 hypothetical protein DRQ05_03820, partial [bacterium]
Length=33

 Score = 47.8 bits (110),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 10/34 (29%), Positives = 14/34 (41%), Gaps = 1/34 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           M  V C  C  +      K+P K    +CP+C  
Sbjct  1   MI-VTCESCHTKYILDDDKVPEKGIRVKCPKCSF  33


>PAV66383.1 hypothetical protein WR25_19351 [Diploscapter pachys]
Length=340

 Score = 53.6 bits (125),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 10/41 (24%), Positives = 15/41 (37%), Gaps = 1/41 (2%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
            M    CP C      P S +  +  + RC  C  +   +P 
Sbjct  85   MIL-ECPECSTRYLVPDSAIGVEGRTVRCANCRHSWFQEPP  124


>OFW84268.1 hypothetical protein A2018_07130 [Alphaproteobacteria bacterium 
GWF2_58_20]HAU29247.1 hypothetical protein [Rhodospirillaceae 
bacterium]
Length=224

 Score = 52.5 bits (122),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 17/65 (26%), Positives = 23/65 (35%), Gaps = 1/65 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M    CP CGA+ N   + +PA     RC +C    +  P           +   P  G 
Sbjct  1   MIL-TCPACGAKFNLDDALMPAAGRKVRCGKCAHVWLAMPVSPVEAVPEAVVVERPIPGT  59

Query  61  QRRIP  65
            R  P
Sbjct  60  LRPEP  64


>OYW58129.1 hypothetical protein B7Z31_08445, partial [Rhodobacterales bacterium 
12-65-15]
Length=210

 Score = 52.5 bits (122),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 29/170 (17%), Positives = 59/170 (35%), Gaps = 1/170 (1%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
               +  L           AL +         +       ++  +    +  + +    + 
Sbjct  39   FPTLVALAASGLINGWEVALGISEPILTGWADLIPFGITVMVQLIAYGITAALLVQLAYD  98

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
               K  V + R +   L       +L +L  +++  G L LI+PGL     F      +A
Sbjct  99   AKLKRPVEINRYVAPALAVAFPIAILGLLSGILMVLGILALIVPGLWIYAVFSVMPAAVA  158

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
             +   G   L +S  L  G+ W I G  +L+ +++  +S +   I  +  
Sbjct  159  IER-NGFSGLGRSARLTRGYRWPIVGATILIGIMNAVVSAIAMFIVSLFA  207


>MBG85303.1 hypothetical protein [Verrucomicrobiales bacterium]
Length=393

 Score = 54.0 bits (126),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 34/328 (10%), Positives = 76/328 (23%), Gaps = 18/328 (5%)

Query  15   TPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSK  74
                  PA   +  C EC     FD       +              +   S   +    
Sbjct  67   VTEVDEPASGGNITCVECNNRFSFDEVIRLGDRYVCAECKPFAVQKMQEGVSFTSKSGMT  126

Query  75   TVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFA  134
                   +    +        +         +              ++      + L  +
Sbjct  127  IDQMLDSDYDNGVMSCIRGGGAVFRAHFFPIIGVSIVVFLVMGAMQVVPFLGALLSLVLS  186

Query  135  PIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMK  194
                              +                 +     S  +       G+   + 
Sbjct  187  GPIIGGYWLYVIRRVRGEETSIGDAFAGFGPGFGNLMLGYIVSSILAPLPLLPGVVLLLI  246

Query  195  LGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSR  254
             G     +    + L+ + V   +L+  +  +   + +FF   ++ D  +    A+   R
Sbjct  247  AGFASANAGEANVALIAMGVIL-TLVGSLFFIRLAISWFFTCAIIIDKKMKFWPAMSFGR  305

Query  255  LLVSGHWWAIFGRFVLLLVISLTLSFLT-----------------ARIPYVGEAANLAFS  297
             +V+ HWW+  G   L  VI   + F+                  + + ++         
Sbjct  306  SMVNKHWWSTLGIGFLFGVIIWGVIFVFVIAGGALAAVAGSGGGDSAVAFIMLIVLAPVM  365

Query  298  LLLTPFSFLYYYLIYSDLKANYRGPQHP  325
            LL  P+    Y   Y  +          
Sbjct  366  LLSIPWGICAYAYRYDVVFGPLESQDEF  393


>KWV91702.1 hypothetical protein AUC45_10870 [Erythrobacter sp. YT30]
Length=397

 Score = 54.0 bits (126),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 10/44 (23%), Positives = 17/44 (39%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M    CP C      P S +  +  + RC +C  +   +P   +
Sbjct  1   MIL-TCPACATRYVVPDSAIGGEGRTVRCAKCKHSWFQEPNLPE  43


>WP_199693393.1 zinc-ribbon domain-containing protein, partial [Sorangium cellulosum]
Length=99

 Score = 49.8 bits (115),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 9/34 (26%), Positives = 13/34 (38%), Gaps = 0/34 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           M  V C  C +       ++P      RCP+C  
Sbjct  1   MIKVECDGCKSPYQVDEKRVPPAGLKMRCPKCGT  34


>HCI45959.1 thioredoxin [Rhodospirillaceae bacterium]
Length=86

 Score = 49.4 bits (114),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 10/37 (27%), Positives = 12/37 (32%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  V CP C      P S +  +    RC  C     
Sbjct  1   MI-VACPACNTRYELPPSSISGEGRQVRCARCGNQWF  36


>PWM63029.1 hypothetical protein DBX63_02440 [Clostridia bacterium]
Length=352

 Score = 53.6 bits (125),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 27/218 (12%), Positives = 61/218 (28%), Gaps = 15/218 (7%)

Query  157  WAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGG  216
            +             ++++                    +     G        LI+ +  
Sbjct  133  FFTTTLCYLLAAFAIAFVLNIFVSIFTAFATLFATLGAIPSLVGGGVFHAGAGLIVPLVL  192

Query  217  GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
              LL+ +  L    +  F   V  ++ I    A+++S  LV   +  +FG  ++L  I L
Sbjct  193  VFLLVFVAELAGTSFLLFVYPVAVNEPIRNFAAIKRSFQLVWKRFGRVFGCLLILSGIVL  252

Query  277  TLSFLTARIPYVGEAANLAFSLLLT---------------PFSFLYYYLIYSDLKANYRG  321
              + +     ++    + A  ++L                P+      ++Y D +    G
Sbjct  253  VFALIFTACVFLAIELSGAAGIVLVCLAVLLYLVMLLFLSPYGAALVTVLYFDTRTRMEG  312

Query  322  PQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSA  359
                    Q                +   +   +N S 
Sbjct  313  TAWLGEPEQPSAQPQQPEPAAWQEPVQEPAPQSENDSP  350


>WP_047581807.1 zinc-ribbon domain-containing protein, partial [Methylobacterium 
sp. ZNC0032]
Length=99

 Score = 49.8 bits (115),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 8/35 (23%), Positives = 15/35 (43%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            + CP C ++    ++KL  +    RC  C  +  
Sbjct  2   LIVCPSCASQYELDAAKLGPEGRKVRCANCKTSWH  36


>HCR64982.1 hypothetical protein [Oceanicaulis sp.]
Length=34

 Score = 47.8 bits (110),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 10/34 (29%), Positives = 16/34 (47%), Gaps = 1/34 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           M  V CP C A+    ++ L A+ +  +C  C  
Sbjct  1   MI-VTCPSCEAKYRVDAAALAARGNKVKCAACAH  33


>HBJ93531.1 hypothetical protein [Hyphomonadaceae bacterium]
Length=105

 Score = 49.8 bits (115),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 10/40 (25%), Positives = 14/40 (35%), Gaps = 1/40 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
           M    CP C A+       + A   + RC  C  +    P
Sbjct  1   MIL-TCPSCSAQYFADDKAIGANGRTVRCAACAHSWFAQP  39


>OYW53255.1 hypothetical protein B7Z31_12010, partial [Rhodobacterales bacterium 
12-65-15]
Length=207

 Score = 52.1 bits (121),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 11/45 (24%), Positives = 16/45 (36%), Gaps = 1/45 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           M  V CP+C A    P   +P      +C  C     F    ++ 
Sbjct  1   MRLV-CPNCAATYEVPEDAIPDTGRDVQCASCGHAWFFARPGTEF  44


>MBA2360413.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Actinobacteria bacterium]
Length=111

 Score = 50.2 bits (116),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 22/108 (20%), Positives = 47/108 (44%), Gaps = 7/108 (6%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            +  G + L++PGL+    +     ++  +     +AL +S  LV G    +   FVLL  
Sbjct  1    MLLGLVALVVPGLVLLARWGLVVPLIVLEGADWRRALARSNALVRGQTRPVMAIFVLLTG  60

Query  274  ISLTLSFLTARIPYV-------GEAANLAFSLLLTPFSFLYYYLIYSD  314
            +++ ++ +   I Y+          A LA  +++  F     +++Y  
Sbjct  61   LAIGVALIPVLIGYLVLENVLGAWLATLAIDVMMVSFYAFAPFVLYRR  108


>NLH85803.1 DUF4339 domain-containing protein [Verrucomicrobia bacterium]
Length=441

 Score = 54.0 bits (126),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 31/183 (17%), Positives = 57/183 (31%), Gaps = 7/183 (4%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            +  L I           LLK          +       A    +L  +  +       + 
Sbjct  147  LVNLIITGPLTGGVLYFLLKNIRRQPTGISDVFAGFRRAFGQLLLGYVVMIILIYLAMLP  206

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
               + ++      L  +     +    I V   G + LI+P +   V + F   ++ D  
Sbjct  207  GVALMVWP-----LLTLARQQAVAAGPIFVALLGFICLIVPAIYLSVSWTFLLPLIIDRQ  261

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVI--SLTLSFLTARIPYVGEAANLAFSLLLT  301
            +    A++ SR +V  HWW +FG   +  +I     L         +  A         +
Sbjct  262  MKFGPAMKASRRMVGKHWWQVFGLVAVCGLINVVGLLFCGVGIFLTLPIALGAILYAYES  321

Query  302  PFS  304
             FS
Sbjct  322  IFS  324


>OZB22600.1 hypothetical protein B7X51_14825, partial [Pseudomonas sp. 34-62-33]
Length=111

 Score = 50.2 bits (116),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 21/96 (22%), Positives = 46/96 (48%), Gaps = 0/96 (0%)

Query  202  SFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHW  261
               +  ++++L++  G +LL+IPG+   V +     ++ +  +   QALE SR  ++ HW
Sbjct  1    PLIITAVVMMLLIYIGMILLLIPGIYLGVAYLLAIPLVVERGLSPWQALEASRKAITQHW  60

Query  262  WAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
            + +FG F++L +I +  +              +   
Sbjct  61   FKVFGLFIVLGLIIIVSAIPLGIGLVWSIPLMVVAM  96


>MBC7768129.1 zinc-ribbon domain-containing protein [Phycisphaerales bacterium]
Length=75

 Score = 49.0 bits (113),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 6/37 (16%), Positives = 10/37 (27%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M    C  C        + +     + RC  C  +  
Sbjct  1   MIL-TCTSCSTRYYADDAAIGPAGRTVRCAACGFSWF  36


>KPJ73767.1 hypothetical protein AMS14_06350 [Planctomycetes bacterium DG_20]
Length=230

 Score = 52.5 bits (122),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 31/198 (16%), Positives = 60/198 (30%), Gaps = 25/198 (13%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
            ++ +    +L    +L          + +     I   L +    +     K +      
Sbjct  36   WSLVTGDFVLFVIGYLVVAAILGVSVLTVIGPLIIGGPLWFGYFRVVQTRLKGEPASIGD  95

Query  193  MKLGLRHVGSFTLLLILLILVVG--------------GGSLLLIIPGLLFCVWFFFCQYV  238
            +  G R  G   L  +LL L+V                G LL +   L+    FFF   +
Sbjct  96   VFQGFRDFGKGFLTFLLLALIVLGVVVVQMLLSLIPVLGILLTLCVSLVVGPMFFFVMPI  155

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
             A  ++    A+ +S      ++W             + LS +   I   G       +L
Sbjct  156  AALSDVSPTDAISRSVKFCFANFWK-----------MVLLSLVLGLIAMAGSLVCGIGAL  204

Query  299  LLTPFSFLYYYLIYSDLK  316
               P + +     Y++  
Sbjct  205  FTVPLAIVATVAAYNEYY  222


>ETR74400.1 anti-sigma B factor antagonist [Candidatus Magnetoglobus multicellularis 
str. Araruama]
Length=186

 Score = 51.7 bits (120),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 10/37 (27%), Positives = 14/37 (38%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  V+C  C  +      K+P K +  RC  C     
Sbjct  1   MI-VQCESCQKKYRLDDKKMPPKGTKVRCSRCKHIFH  36


>MBE3550789.1 hypothetical protein [Brockia lithotrophica]PTQ51735.1 hypothetical 
protein BLITH_1373 [Brockia lithotrophica]
Length=264

 Score = 52.9 bits (123),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 31/199 (16%), Positives = 63/199 (32%), Gaps = 10/199 (5%)

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
            WG++G    G          + +              ++++ L    +  L L     + 
Sbjct  39   WGIVGAIWAGSDPEAFSDLISKIQSLEDEPPEVPAPIEFSLQLLVNVFTSLFLGGFYAAF  98

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
               +     G+   +         + +   ++ LV G G L  ++PG+   V   F  + 
Sbjct  99   LAAVRGEPFGVGI-LFSRTGDFWKYLVAHFVVSLVFGLGLLFFVLPGIYLGVRLSFYPFA  157

Query  239  LADDNIGGLQALEKSRLL-VSGHWWAIFGRFVLLLVISLTL-------SFLTARIPYVGE  290
            +AD   G   A      L     +W +FG +  L+ + L          F+   +P++  
Sbjct  158  IAD-GHGVFDAPFTVSWLATQKRFWTVFGFYAALVGLGLLFPVGIGMMIFIGDFLPFLLP  216

Query  291  AANLAFSLLLTPFSFLYYY  309
               L   L    +      
Sbjct  217  VIFLFAVLGGIAYFLYAVA  235


>NIR37268.1 hypothetical protein [Actinobacteria bacterium]NIS31736.1 hypothetical 
protein [Actinobacteria bacterium]NIT95853.1 hypothetical 
protein [Actinobacteria bacterium]NIU66833.1 hypothetical 
protein [Actinobacteria bacterium]NIV56026.1 hypothetical 
protein [Actinobacteria bacterium]
Length=218

 Score = 52.1 bits (121),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 26/179 (15%), Positives = 50/179 (28%), Gaps = 11/179 (6%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGG--SL  219
                   + ++ M         +       + + G R  G           +V     ++
Sbjct  38   LANVAAFVVVNAMVTDYLGTAGRGVGAAAEAARTGWRRRGDLAGAFGRSYAIVFLLLSTV  97

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL--------  271
            +L   G+   V + F    +  +   G  AL +S  LV G WW        +        
Sbjct  98   VLAPFGIRQLVRYQFAPQAVMAEERRGGAALRRSSALVRGRWWHTAVVVATINGAIALTA  157

Query  272  -LVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
                 L L   +    ++          L+ P + +   L+Y D  A     +      
Sbjct  158  LAAALLLLVAASGVPLWIFSGLVSVVYALVVPLAAIAVTLLYGDAVAEREETEANEPVP  216


>MYD96811.1 hypothetical protein [Gammaproteobacteria bacterium]
Length=303

 Score = 53.2 bits (124),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 22/129 (17%), Positives = 40/129 (31%), Gaps = 13/129 (10%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            V    L    P ++  V++ F   ++   N+    AL +S  LV GH+W       +   
Sbjct  168  VLALVLFASTPIVVLLVYWGFALPLIVTRNLDTFAALGRSWNLVRGHFWRTLLILTIATF  227

Query  274  ISLTLSFLTARIPY-------------VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
            I   ++ L +   +             +    N    ++ TP        +  DL     
Sbjct  228  IVAVITILVSFGAFFTAFVPTVSGQMAILFLINTLSGMVTTPMFVAVLLAVLHDLSLRRG  287

Query  321  GPQHPPIKR  329
            G        
Sbjct  288  GDDLHRRIE  296


>MBI5835749.1 hypothetical protein [Candidatus Eisenbacteria bacterium]
Length=302

 Score = 53.2 bits (124),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 40/308 (13%), Positives = 76/308 (25%), Gaps = 13/308 (4%)

Query  16   PSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKT  75
            P+  LP     A  P          A +  +                      +      
Sbjct  3    PAQHLPEYPQPAVDPAQRLPEHPQVAVAPVSPWPGLAPPPAWPAGYVAGEQLSIARWVGQ  62

Query  76   VNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAP  135
                   R              S    I +            G     +    + LA   
Sbjct  63   GWQTFRARPLPWILWALAMLLPSAALEIYRYSQGWVGGSVSPGETGSVMTTTLLSLALTI  122

Query  136  IFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL  195
                L +    +     +      +    AY          ++   +          +  
Sbjct  123  SLLPLHVGGNLYALSLQRGAHPPAMKFLQAYARGAWLVAALALMGLMA--APVFLLVLLP  180

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
            G     +  L   L+++ +   +   +   L   + + F   ++AD  +  + A+  S  
Sbjct  181  GRFLSATGALAQALVVVYMLIATPFAVGVYLYLLLGWMFAPVLVADREMNAMDAMRTSWH  240

Query  256  LVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
            LV GH W + G  ++L+            I + G  A L  SL+        +   Y DL
Sbjct  241  LVDGHRWRLLGLILVLV-----------AIAFAGVLACLVGSLVTQGIVAGAFSAAYRDL  289

Query  316  KANYRGPQ  323
                    
Sbjct  290  AVRSGDMD  297


>WP_144902450.1 hypothetical protein [Halobellus captivus]
Length=159

 Score = 51.3 bits (119),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 24/121 (20%), Positives = 40/121 (33%), Gaps = 5/121 (4%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
              G    ++PG+     F    + +  ++ G L AL +S  L  G+   +    VL   I
Sbjct  32   TIGLAFFLLPGIFLAACFLCFIFAVGVEDCGTLSALRRSWDLSRGNRLKLSAIVVLSGAI  91

Query  275  SLTLSFL-----TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
              T+  +      A  P + E      S +L  F +      Y  L+     P       
Sbjct  92   GGTIGIVGTVFDLAAAPVIAELLTNTLSTVLFVFVYGVMAASYLQLREERAQPDSDETAG  151

Query  330  Q  330
             
Sbjct  152  T  152


>NLX73041.1 hypothetical protein [Bacteroidales bacterium]
Length=324

 Score = 53.2 bits (124),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 21/171 (12%), Positives = 56/171 (33%), Gaps = 12/171 (7%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
            +  ++S    +     +   Q     +   +   ++  +      +F            S
Sbjct  105  YMVLYSDPSREGFGIADVWLQFKAKFVKQLSFYLLVFLVVAAIAMVF------------S  152

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
                   +G+      LL+ ++    +LL++  +   V       V+  +++   +  ++
Sbjct  153  FIFIGLGIGAGYGGSRLLLALMVIAVILLLLFFIYLSVPLSMANMVIYAEDVDLGKVFKR  212

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
               L+ G WW  F   +++ +I   +S L +    V        +     F
Sbjct  213  CLDLIKGSWWQSFAIILVIYLIYSIISSLFSIPVMVSSMVKGFVAASGGGF  263


>WP_196104241.1 zinc-ribbon domain-containing protein [Pontivivens sp. MT2928]QPH55042.1 
zinc-ribbon domain-containing protein [Pontivivens 
sp. MT2928]
Length=434

 Score = 53.6 bits (125),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 10/37 (27%), Positives = 16/37 (43%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  + CP+C  +   P S +PA+     C  C  +  
Sbjct  1   MRLI-CPNCATQYEVPDSAIPARGRKVECGNCGNSWH  36


>WP_187533636.1 hypothetical protein [Erysipelothrix inopinata]QNN60508.1 hypothetical 
protein H9L01_09060 [Erysipelothrix inopinata]
Length=214

 Score = 52.1 bits (121),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 27/192 (14%), Positives = 60/192 (31%), Gaps = 4/192 (2%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
                I ++G+V           +   +  N         +    V +             
Sbjct  10   NYGAIVMIGLVGTLISSILNFFMAGTSSSNHPFIYLLLTLASFVVTFFFAVFIETVAIHA  69

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
                               + G+   +  +   ++  G LL +IPG+   + F F  ++L
Sbjct  70   YRNDDAITMDNFKAAFEHVNWGTALKINFIASFLIAIGLLLFVIPGIYLSLAFSFIGFLL  129

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI---PYVGEAANLAF  296
             D+  G    +++S  L  G+   I    +++ +++  +S +       P +        
Sbjct  130  YDE-KGHSNLIKRSMELTDGYKLKILLGGIVISILTGAVSSIFGLFIKEPALIRTLASLV  188

Query  297  SLLLTPFSFLYY  308
            +LL  P    Y 
Sbjct  189  TLLAQPIYINYI  200


>NOQ82797.1 hypothetical protein [Myxococcales bacterium]
Length=846

 Score = 54.4 bits (127),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 12/39 (31%), Positives = 17/39 (44%), Gaps = 0/39 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  V CP C A  +    +LP      RCP+C ++    
Sbjct  11  MIKVSCPSCKAAYDVDEHRLPDDGLRMRCPKCSESFQVH  49


>TML64728.1 hypothetical protein E6G22_03820 [Actinobacteria bacterium]
Length=205

 Score = 52.1 bits (121),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 35/176 (20%), Positives = 54/176 (31%), Gaps = 6/176 (3%)

Query  168  LLGLSWMTGSMFIYICKTDV-GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
            +     +   +           +    +   R         I+  L +  G LLLI PGL
Sbjct  30   VFVQGALIEIVRNIHAGRPARPIGALYETARRRFWPLFWASIIYSLGISFGLLLLIAPGL  89

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS-----LTLSFL  281
            L    +      +  +      AL+KSR  V G  W +    V   V+      L + F 
Sbjct  90   LAAARWSLMAPFVVLEGESASDALDKSRATVGGRTWEVMWIVVATFVLLAGSSNLIIYFA  149

Query  282  TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAA  337
                P      +  +S L  PF      ++Y  L    R   H  + R       A
Sbjct  150  LPEGPVPAILFSFLWSSLTAPFEAHVLSVVYYRLTDPERPVIHEDVVRWRSVWEGA  205


>HBM60493.1 hypothetical protein [Citreicella sp.]
Length=312

 Score = 53.2 bits (124),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 11/40 (28%), Positives = 16/40 (40%), Gaps = 1/40 (3%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
            M  + CP CGA+   P   +P +    +C  C  T     
Sbjct  82   MRLI-CPKCGAQYEVPRDAIPQEGRDVQCSGCGHTWFQSQ  120


>VVC04255.1 Membrane domain of glycerophosphoryl diester phosphodiesterase 
[uncultured archaeon]
Length=361

 Score = 53.2 bits (124),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 39/288 (14%), Positives = 85/288 (30%), Gaps = 7/288 (2%)

Query  110  SWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
             +  F   G   + +  L +VLA        +L   + L  +            + YI  
Sbjct  72   WFSAFTGLGIAGVIVGFLLMVLAALISTYYNVLPTFSALRAKGLETVQWDRGKLLGYICF  131

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
             +     +    + K  + +  +  L L       +     +L V        I  +   
Sbjct  132  VIVDFFYTALSLLDKRGLAVVVAFWLCLLLAVFSPMHAAFGLLTVVLA-FAYFIVIIYNA  190

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR----I  285
            +  +    +     +   ++L +S  L     +A+FG  ++L +       + A     +
Sbjct  191  MRLYAGLPIYLSKGMSITESLGQSWELTKDKAFAVFGLSLVLGLTWAIPIMVVAMGVELV  250

Query  286  PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIP  345
              V          +  P + L   +++  L   +R   +  +      L   I+  +L  
Sbjct  251  MQVVVLMGFVGLAVSGPGAILVGVILFVLLYMAFRMLVNVLMTFVMTYLMVGIYDQLL--  308

Query  346  GLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQR  393
                 + S     A +   A  +       +P       R  P+   R
Sbjct  309  QEKSGAKSAPMPRAPRPSEAAANEAGEEMPEPAPRRAAPRGAPKAAPR  356


>ELZ44635.1 hypothetical protein C464_13085 [Halorubrum coriense DSM 10284]
Length=324

 Score = 53.2 bits (124),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 29/182 (16%), Positives = 60/182 (33%), Gaps = 6/182 (3%)

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLI-LVVGGGSL  219
              ++  +      +    F            ++   L     F L+  L+I + +  G++
Sbjct  141  FISLMILSATYFVILSRTFAQSQSEMSRFPATLSHQLGRTTIFALIGGLIITVSIMFGTV  200

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            LLI+PGL     F F  + +A +  G + +L++S  L  G    +    V+  +    + 
Sbjct  201  LLIVPGLFLAASFLFFIFAVAVEERGIISSLKRSWDLARGSRLKLGILVVMSAIFGGVIG  260

Query  280  FL-----TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPL  334
             +      A +P   +   +          +      Y  L+       HP      +  
Sbjct  261  TITPLLTLAGLPIAADVVTVVLIAAFFVPYYAIIASAYLQLRGQEDDQNHPAQNPVNVSQ  320

Query  335  TA  336
            T 
Sbjct  321  TP  322


>HFD16665.1 DUF3426 domain-containing protein [Rhodospirillales bacterium]
Length=271

 Score = 52.9 bits (123),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 11/70 (16%), Positives = 18/70 (26%), Gaps = 1/70 (1%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  + CP+C          L  +    RC +C      +P   +         +    G 
Sbjct  56   MI-IACPNCQTRFKIDPEALAPRGKMVRCSQCGHRWFAEPPAEEIEAPPLPERSPVEEGG  114

Query  61   QRRIPSDRLE  70
                   R  
Sbjct  115  TGPAEGGRRP  124


>NUO63237.1 hypothetical protein [Gemmatimonadaceae bacterium]
Length=48

 Score = 47.8 bits (110),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 12/31 (39%), Positives = 14/31 (45%), Gaps = 0/31 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           V CP C +      SK+PA    ARC  C  
Sbjct  3   VACPECRSVFRVDPSKVPAGGVRARCSVCGG  33


>PYQ07758.1 hypothetical protein DMF83_08590 [Acidobacteria bacterium]
Length=240

 Score = 52.5 bits (122),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 38/184 (21%), Positives = 78/184 (42%), Gaps = 2/184 (1%)

Query  109  DSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYIL  168
                 F       L + +LG ++    I +  L+      +P+ +N   ++       + 
Sbjct  1    MLARTFRLWATNFLALSVLGALIHSPAIAAWTLIAVIGGHDPERRNIFSSVANLLGLILN  60

Query  169  LGL-SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
            L L   +T S+F  I  + V +  ++++GL    +    ++L+ LV+    + LI+PGL+
Sbjct  61   LILEGAVTYSVFGQIRGSPVPVGVALRIGLSRANAVLGAILLVGLVMLPACVCLIVPGLI  120

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI-SLTLSFLTARIP  286
               W++    V   ++ G   AL +S+ L  G  W +F     L ++     + + A + 
Sbjct  121  LATWYWVAVPVAVIESPGSRAALTRSKELTQGDRWPVFACMSYLGMMRLAGFAVIVAGVR  180

Query  287  YVGE  290
              G 
Sbjct  181  AAGG  184


>MBA2313169.1 hypothetical protein [Actinobacteria bacterium]
Length=257

 Score = 52.5 bits (122),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 29/214 (14%), Positives = 72/214 (34%), Gaps = 22/214 (10%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
              ++ + L   P+   ++L+ A           +  L A   +  L +      M   + 
Sbjct  44   FLVVALPLWVLPVAGIVVLESAGGSRFIAFMVFFVQLAALQLFGSLLVGPAAVVMTERLH  103

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA--D  241
            +  + +  +++       S     +     V    LLL++P +   V      +      
Sbjct  104  ERPMSVKGALRELRPLRSSLIATGLYS---VIAAVLLLVVPVIPATVVLGPPIFAHVIAL  160

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT----------------ARI  285
            +     +AL ++R L+  +   +F    ++L+++  L F+                 + +
Sbjct  161  ERKTLQEALPRARSLLKRNALRLFTYLFVVLLLAFILDFVLYSQASVVERALGVDSESVL  220

Query  286  PYVGEAANLAF-SLLLTPFSFLYYYLIYSDLKAN  318
               G         ++L P +     + Y D++A 
Sbjct  221  ALAGALLRGLVTGVVLWPVASCLSLVAYFDVRAR  254


>ODT78704.1 hypothetical protein ABS71_01655 [bacterium SCN 62-11]
Length=242

 Score = 52.5 bits (122),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 22/140 (16%), Positives = 43/140 (31%), Gaps = 0/140 (0%)

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV  214
                   A    +   ++ +   +         GL   +  GLR       L  ++ +  
Sbjct  42   IPHTSSWALSFTLGPWITALLWGVARREKGLRAGLPACLGHGLRLWPRMLGLGFMMSICC  101

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
              G + L++P     V       +   ++    +AL KS  L   H+W +   + +LL  
Sbjct  102  MAGLICLVLPLFYATVVISLAPALAYAEDRPAEEALNKSYELTRPHFWRLLVFWTVLLGG  161

Query  275  SLTLSFLTARIPYVGEAANL  294
            S         I  +      
Sbjct  162  SFGAEMSLVLILELTRLLQF  181


>OGY81431.1 hypothetical protein A3F54_02195 [Candidatus Kerfeldbacteria 
bacterium RIFCSPHIGHO2_12_FULL_48_17]
Length=245

 Score = 52.5 bits (122),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 34/172 (20%), Positives = 66/172 (38%), Gaps = 0/172 (0%)

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
             R       +  + + LA    F +     A      + +  +A+            +  
Sbjct  28   WRMFLAFAIVLAIPLSLANEYFFPSGNPVTANVFALVSSDNFYALAFFNTLIGAFADAVF  87

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
                   I    +   + +K      G   + L+L  +VV G S+LLIIPG+++  ++ F
Sbjct  88   IYLTASAIVGKWLSPQKLLKKAANIWGILVMTLLLKGVVVLGLSVLLIIPGIIWLNYYTF  147

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
               ++  +      AL+ S+ LV GHWW + G  + + +  + +S     I 
Sbjct  148  VTQLVIINEKKWKSALDASKQLVRGHWWEVLGVNITVWLYYIGVSMGMGWIF  199


>NTU42834.1 hypothetical protein [Nitrospirales bacterium]
Length=259

 Score = 52.5 bits (122),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 31/189 (16%), Positives = 60/189 (32%), Gaps = 25/189 (13%)

Query  154  NWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILV  213
                   + ++   L       G     I      L   +K+            +L +L 
Sbjct  66   MVLDFFYIFSIIAGLFVYGLTVGMAQKAIETGSANLREGIKIAKGRFFPLLSAAVLFVLT  125

Query  214  VGGGSLLLIIPGLLFC----------------------VWFFFCQYVLADDNIGGLQALE  251
              G  LL+  P +L                            F    +  D +   Q ++
Sbjct  126  TVGLMLLVSFPSMLLASAGAGSPLLSFGLSVIAGLMALYLLMFAVVAVVVDGLSAAQGMK  185

Query  252  KSRLLV---SGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYY  308
            +S  +V    G  + IF    L++VI    S   + IP++G+A  +  S +   F  +  
Sbjct  186  RSLEIVMAHKGESFFIFSILALIVVILFFSSIALSFIPFIGQATQILVSGIAGGFVSVVL  245

Query  309  YLIYSDLKA  317
             ++Y ++  
Sbjct  246  VMVYKEVMR  254


>OIO19188.1 hypothetical protein AUJ23_02480 [Candidatus Magasanikbacteria 
bacterium CG1_02_32_51]PIY93572.1 hypothetical protein COY69_00940 
[Candidatus Magasanikbacteria bacterium CG_4_10_14_0_8_um_filter_32_14]
Length=254

 Score = 52.5 bits (122),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 34/199 (17%), Positives = 67/199 (34%), Gaps = 7/199 (4%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
               +        F   F  LL+   +++              T   +    +        
Sbjct  47   FFTLIPGTFTKTFLYFFGLLLVSFFSFMMSIAFIRMVNSHYMTKTVLGFFSNLKDSFFLS  106

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                  + L     + +  V     +  L +  +    L +I  G+LF  W  F    +A
Sbjct  107  IKNLVALLLLGIPAILVYIVYFVGSMGYLSVRAIILLYLGVIFIGILFVFWLSFVLVAIA  166

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY-------VGEAAN  293
             +N  GL A + S ++V G  W++F R  +  ++   L  +   I +       +     
Sbjct  167  IENQKGLNAFKSSIVVVKGRVWSVFLRLFIPSLLFYILFLVYNEIFFKLFAQNIIYTILV  226

Query  294  LAFSLLLTPFSFLYYYLIY  312
            L F+L + PF  +   ++Y
Sbjct  227  LLFALFVIPFGAIVPTILY  245


>MAL57839.1 hypothetical protein [Brevundimonas sp.]
Length=39

 Score = 47.5 bits (109),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 10/36 (28%), Positives = 11/36 (31%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M    CP C      P   L A     RC  C +  
Sbjct  1   MIL-TCPACATSYFIPDDVLGANGRKVRCKSCGEVW  35


>WP_182450389.1 hypothetical protein [Streptacidiphilus sp. P02-A3a]QMU67865.1 
hypothetical protein GXP74_06120 [Streptacidiphilus sp. P02-A3a]
Length=270

 Score = 52.5 bits (122),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 20/139 (14%), Positives = 41/139 (29%), Gaps = 18/139 (13%)

Query  203  FTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWW  262
             T      +L +   ++   +  L   +        LA +      A+ +S  LV G WW
Sbjct  118  LTGASDSTVLSLALLTVPGGLLVLWLYISLNLAGPALALERQTLRSAISRSLRLVRGAWW  177

Query  263  AIFGRFVLLLVISLTLSFLTARIPYVG------------------EAANLAFSLLLTPFS  304
             +F   +++ ++    + L      +                       +  S L  P +
Sbjct  178  HVFALTLVVTLLVDMAAGLLLSPTVIADLAMNNSDSTSTAGLLLTAVVGVLSSTLTIPIT  237

Query  305  FLYYYLIYSDLKANYRGPQ  323
                 L+Y D +       
Sbjct  238  SALSALLYVDQRIRREALD  256


>MBA4032952.1 hypothetical protein [Planctomyces sp.]
Length=130

 Score = 50.2 bits (116),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 21/123 (17%), Positives = 48/123 (39%), Gaps = 3/123 (2%)

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            + + +    + +  G  H+ +  L+ ++  + +   S   +IPGL     F+   YVL  
Sbjct  1    MARGETASIKDLFSGSSHLLNLYLVNLIQNVGLTFASCFCLIPGLFLLPIFWAVPYVLIA  60

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL--TLSFLTARIPY-VGEAANLAFSL  298
            +   G+ A+ ++     G+   I    ++ ++I        L   I + +         L
Sbjct  61   ERPPGIDAISRAIDYTRGNRLQILLIILVSMLIHFAGVCFCLGGLITFPLISMLTAVAYL  120

Query  299  LLT  301
             +T
Sbjct  121  RMT  123


>SBW20251.1 putative membrane protein [Candidatus Frankia californiensis]
Length=139

 Score = 50.5 bits (117),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 15/67 (22%), Positives = 23/67 (34%), Gaps = 0/67 (0%)

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
              V           +      AL +S +LV   WW  FG  +L  +I   +  +   +  
Sbjct  1    MAVALVLSTPAFVLEGGTINTALRRSWILVRCAWWRTFGILLLTGIIVTVVVSVLTIVIG  60

Query  288  VGEAANL  294
            V  AA  
Sbjct  61   VAFAAGG  67


>WP_138465116.1 zinc-ribbon domain-containing protein [Poseidonocella sp. HB161398]
Length=357

 Score = 53.2 bits (124),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 8/39 (21%), Positives = 13/39 (33%), Gaps = 0/39 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
             + CP C A  + P + + A     +C  C        
Sbjct  10  IRIICPACQAAYDVPQAAIAAGGRDVQCSACGHNWFQLW  48


>RLM60084.1 hypothetical protein DVK07_19755 [Halorubrum sp. Atlit-26R]
Length=123

 Score = 49.8 bits (115),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 20/110 (18%), Positives = 35/110 (32%), Gaps = 6/110 (5%)

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            +L IP +   V F F  + +  ++ G +  L++S  L  G+   +    +L  VI     
Sbjct  1    MLFIPVIFLAVSFLFFTFSVGVEDRGIVDGLKRSWGLSRGNRLKLSVLVILAGVIGFISG  60

Query  280  FLTARI-----PYVG-EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
             +           VG    N   S+L      +           N     
Sbjct  61   IVGTLFDISNAAIVGELIVNTINSILFVLLYGIISAAYLQLQGDNPERVD  110


>NWG72642.1 zinc-ribbon domain-containing protein [Parvularculaceae bacterium]
Length=388

 Score = 53.2 bits (124),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 15/99 (15%), Positives = 24/99 (24%), Gaps = 1/99 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + CP C    +   ++      S RC  C ++             T   A  P  G 
Sbjct  1   MI-ITCPECATRYDVEEARFEPNGRSVRCASCGESWFVPAPSPVEDLMTARRADPPQTGE  59

Query  61  QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSG  99
           +      R   +     C           + E       
Sbjct  60  RNTAQGRRAGSEPGRAECDDEPAPRRGWRKDEGEPRPRW  98


>WP_131900554.1 hypothetical protein [Jiangella asiatica]TDD99113.1 hypothetical 
protein E1269_27270 [Jiangella asiatica]
Length=137

 Score = 50.2 bits (116),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 18/106 (17%), Positives = 32/106 (30%), Gaps = 8/106 (8%)

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            + + V   F    +  ++ G   A  +SR +V G WW       L++ +      L   I
Sbjct  6    IAYAVRNGFAAQAVMMEDRGANDAFRRSRAVVRGQWWRATAVAALVVGLGAITGPLVGVI  65

Query  286  PYVGE--------AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
              +                 +   P+  L    +Y DL        
Sbjct  66   LLLVSDGSFDLINLITAVVYVATIPYVALVLGYLYLDLSIRAEEDD  111


>PKP92064.1 thioredoxin, partial [Alphaproteobacteria bacterium HGW-Alphaproteobacteria-14]PKP97738.1 
thioredoxin, partial [Alphaproteobacteria 
bacterium HGW-Alphaproteobacteria-15]
Length=27

 Score = 47.1 bits (108),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 7/28 (25%), Positives = 12/28 (43%), Gaps = 1/28 (4%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSAR  28
           M  + CP CG     P + + +   + R
Sbjct  1   MI-IACPACGTRYAVPDAAIGSDGRTVR  27


>MBI3971862.1 hypothetical protein [Chloroflexi bacterium]
Length=231

 Score = 52.1 bits (121),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 31/200 (16%), Positives = 62/200 (31%), Gaps = 2/200 (1%)

Query  105  QLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATV  164
               A            LL +    +    A +   L                  +    V
Sbjct  16   YHHAWEQLRRSWARLLLLVLVGGLLPAGVAMLLLVLGQVAEAGGAVALGPVFVVLANVYV  75

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
              +L+ L++      + + + +                 +L   +L+L      +LLI+P
Sbjct  76   WLVLVPLTYGWSYAILRVARGEPPRVADAFAVFGRAYLPSLGAFVLVLAQIAAGMLLIVP  135

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGR--FVLLLVISLTLSFLT  282
            G+       F  +++ D+ +G   A+ +S    +GH   I G       L+++  L  L 
Sbjct  136  GIRAQARLAFVPFLVVDEGLGAKAAIRESWRRSAGHGRTILGVSLLGAPLLLAGLLLGLV  195

Query  283  ARIPYVGEAANLAFSLLLTP  302
              +P +        SL    
Sbjct  196  GMVPTLLWTMLALASLYAAI  215


>WP_184195778.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Polymorphobacter multimanifer]MBB6226654.1 
hypothetical protein [Polymorphobacter multimanifer]GGI69219.1 
hypothetical protein GCM10007973_02900 [Polymorphobacter 
multimanifer]
Length=236

 Score = 52.1 bits (121),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 26/153 (17%), Positives = 49/153 (32%), Gaps = 2/153 (1%)

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
               V    +  S    ++   + +       ++         +   L L  +  G   LL
Sbjct  59   FWLVLVPTIIASLGQLAVVHLLLRPGAPPRAALAGAFAAWPVYLAALALAAIPTGVAMLL  118

Query  221  LIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            L++PGL           +         +  + +S  L     WAIFG F++ ++    LS
Sbjct  119  LVLPGLYVASRLLLVMPLAIVSPRGSPVAMVRRSWELTRPAGWAIFGFFLVAILGIFGLS  178

Query  280  FLTARI-PYVGEAANLAFSLLLTPFSFLYYYLI  311
             +   +   VG    L     L  F+      +
Sbjct  179  LIAGGVGSAVGSVLTLFGLAALGKFAAGLVAAV  211


>CAN74577.1 hypothetical protein VITISV_009110 [Vitis vinifera]
Length=1115

 Score = 54.0 bits (126),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 29/187 (16%), Positives = 58/187 (31%), Gaps = 6/187 (3%)

Query  209  LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
            LL+LV   G ++ +   L+   W+     +   ++ GGL+AL  S+ L  G+    F   
Sbjct  172  LLLLVSMMGVMIALGIWLVLSAWWNMGVVISILEDKGGLEALSTSQYLSKGNRLRGFALM  231

Query  269  VLLLVISLTLSFLTARI--PYVGEAANLAFSLLLTPFS----FLYYYLIYSDLKANYRGP  322
            +L  +    LS+ T  +   + G       +  L        ++ + + Y D K   R  
Sbjct  232  LLNFIWLYGLSWSTLHVRGSFSGRIVLAFVNTGLVCVGKVIKWVVFMVYYHDCKWRCREK  291

Query  323  QHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPD  382
                  +        +    L        + +Q  +          ++  L         
Sbjct  292  VDMEEVQGIXTEDMPVALRELNDNYPEAMIVQQQQNQLLGXFPPSFLKHILXEASFSPQL  351

Query  383  LNRSLPE  389
                   
Sbjct  352  SELPPNS  358


>NLN04541.1 hypothetical protein [Clostridiaceae bacterium]
Length=312

 Score = 52.9 bits (123),  Expect = 1e-04, Method: Composition-based stats.
 Identities = 35/288 (12%), Positives = 84/288 (29%), Gaps = 11/288 (4%)

Query  49   TDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLA  108
                       L   +   R   ++        +                       ++ 
Sbjct  1    MYRTRLKSGDYLDYAVEYYRQNFKTFFGMTLFFHIPVTFLSFFLLDIGNFVDNFTEMMMT  60

Query  109  DSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYIL  168
            D++E+  ++   L     L  +     + +  L               +    A      
Sbjct  61   DNFEMGMQQLLLLYLGITLYGLYNTTIVHAVSLGTIKHTYEKLVNGVDYTAKQAISYGFK  120

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
              + ++   + +      +    S  + L  + SF    ++ +++    +L LI+  + F
Sbjct  121  RLIWFVLYLIIVSFVTGFIYNIVSFVVALTFLSSFMDFGVINVIIGAQIALALIVAVVYF  180

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA-----  283
             +  +F  + +A + I   +AL  S  +  G    IF      +V    L F+       
Sbjct  181  MIRLYFIPHAIAIERIDCFKALGLSWKITKGRISKIFWPLCFGVVFCAALPFVIRSLNNF  240

Query  284  ------RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHP  325
                   +  +  A  ++ S +L P  +++   +Y  LK         
Sbjct  241  LVFSDPLVNRILAALIMSISSVLHPILYIHTTQLYIYLKHETGMIDFM  288


>NBO18873.1 DUF4339 domain-containing protein [Proteobacteria bacterium]
Length=274

 Score = 52.5 bits (122),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 31/168 (18%), Positives = 54/168 (32%), Gaps = 0/168 (0%)

Query  129  IVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVG  188
              +        LLL                  +      L                    
Sbjct  97   HSIMTVFAGGILLLSTLMASILVRIMPAVVGSMLAWVLFLTFHYLFFACCMRLYRGQRFS  156

Query  189  LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQ  248
                 +     +G   L  +LL L++ GG + L++PGL+  +++ F  + + D  +  ++
Sbjct  157  RKFLDEQLFPMLGRLFLAGLLLGLMMVGGFMALVLPGLIVAIFYAFVPFFILDRQMNLIE  216

Query  249  ALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
            A+  SRLL   H  A       L +I +    L   IP    A + A 
Sbjct  217  AMTASRLLSRKHRGAYQATIAFLTIIYVGCLILIVPIPVALPAYSAAL  264


>WP_145030715.1 hypothetical protein [Caulifigura coniformis]QDT54896.1 hypothetical 
protein Pan44_29350 [Caulifigura coniformis]
Length=355

 Score = 53.2 bits (124),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 34/349 (10%), Positives = 82/349 (23%), Gaps = 29/349 (8%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              V C  CG+  + P          ARC  C   L   PA   R  +  +    P     
Sbjct  3    IRVVC-ECGSRLDAPDEL---AGQQARCGACGAVLTIAPARKAREASAVDGVRLPPKLTG  58

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
            ++      E  +              + +     S    +  S        L        
Sbjct  59   KKKKKVFTEHNAPLQEEETRTGKRKKKKKSAEEMSAGPRKRRSLAQVWVDGLTFPFRREA  118

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            L    +   L    + +                +       ++  +     ++  ++   
Sbjct  119  LITTAVLAFLYGPIMLAMSFGPAMLVTGFYAIKFFIGFFTVSILIVGYFCYFLFQTLRCT  178

Query  182  ICKT---DVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII--PGLLFCVWFFFCQ  236
                    V      +     +        ++ L +   S+            +++    
Sbjct  179  AQNERELPVATAFDFEEVRLDLWLMIGGTGMVFLPLIVTSIAFWWDGRSTPDALYYPLLA  238

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAI----------FGRFVLLLVISLT---------  277
            + L    +G + ++ ++ +L + HW  +              ++   +            
Sbjct  239  FCLFLWPMGVIASVLQTSVLAANHWTTLTTILKLPFQYTATLIVAGGLVAVAIGLDWVVP  298

Query  278  -LSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHP  325
             LSF    +  +        +  L   +      +   L    R     
Sbjct  299  RLSFKFGIVALLAGIIRGFLTWWLVFLTVTACMSLMGYLYYRNRNRIGW  347


>MBR89700.1 hypothetical protein [Verrucomicrobiales bacterium]
Length=310

 Score = 52.9 bits (123),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 31/203 (15%), Positives = 55/203 (27%), Gaps = 17/203 (8%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
             +  +  L        I              Q  +  +  LL     I +          
Sbjct  118  NIFILGPLIGGFHLFFIRQIRGEACGFGDLFQGFSRNYLHLLLLPIAIGVISLVAMLPGI  177

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILV------VGGGSLLLIIPGLLFCVWFF  233
            + I     G    +    +       L  L  L+      V  G  L+++        F 
Sbjct  178  LTIVLGATGFVEGLWDAFKQANETKGLAPLFTLIKASFGVVMAGIFLMVVGNAFVMTRFS  237

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAAN  293
            F  +++ D  +    A+  S   VSG +W +FG             F+   I  +G+   
Sbjct  238  FAFWLVVDKRMSFSDAMGASSNKVSGQFWKVFGLV-----------FVAQLIGGLGQIVF  286

Query  294  LAFSLLLTPFSFLYYYLIYSDLK  316
                L   P  +     +Y    
Sbjct  287  GIGVLFTLPICWCAMASMYVRNF  309


>MBN33850.1 hypothetical protein [Rhodospirillaceae bacterium]
Length=97

 Score = 49.0 bits (113),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 19/94 (20%), Positives = 34/94 (36%), Gaps = 3/94 (3%)

Query  202  SFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHW  261
               LL ++L +++  G  LLI+PG+   V F         +      A+ +   L S   
Sbjct  5    PVILLSLVLNVLIFLGLSLLIVPGVNLAVMFSVAIPACIVERQRIRSAMRRCAHLTSDDR  64

Query  262  WAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
            W  FG   + ++  +    +      V   A   
Sbjct  65   WPAFGLQAIFVISLIV---VVGGFNLVFAFAGAV  95


>MAF67884.1 hypothetical protein [Micavibrio sp.]
Length=243

 Score = 52.1 bits (121),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 9/34 (26%), Positives = 14/34 (41%), Gaps = 0/34 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           + CP C ++   PS K+       RC +C     
Sbjct  3   LTCPECQSQFRVPSDKIGETGRKVRCGQCQHIWH  36


>OYU42076.1 hypothetical protein CFE44_26090 [Burkholderiales bacterium PBB4]
Length=147

 Score = 50.5 bits (117),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 26/144 (18%), Positives = 51/144 (35%), Gaps = 11/144 (8%)

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
            +      M     +        +  G        + +++  ++VG G LL I+PGLL   
Sbjct  1    MMVGYMKMIKIEDEGGKPEIADVFKGFDDFVPALVSILVGSIIVGIGFLLCILPGLLLVA  60

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
                  Y++A     G+ AL+++   V G+           L+ +     +   I  +G 
Sbjct  61   ILPVAAYLVALGEKDGINALKRAWDAVKGN-----------LLSAFFCMLVLGIIGQLGL  109

Query  291  AANLAFSLLLTPFSFLYYYLIYSD  314
                   +L  P +F+  Y +   
Sbjct  110  ILCFVGVILTMPITFIGSYHMAKQ  133


>WP_137105721.1 zinc-ribbon domain-containing protein [Azospirillum brasilense]QCO05054.1 
hypothetical protein D3867_23075 [Azospirillum 
brasilense]
Length=216

 Score = 51.7 bits (120),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 14/111 (13%), Positives = 27/111 (24%), Gaps = 1/111 (1%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  + CP C        S +  +    RC +C       P +             P    
Sbjct  1    MI-ITCPACDTRYTLADSAVGPQGRKVRCAQCGHMWWQSPQDDPVFHPDAVTEFHPVPPS  59

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSW  111
              + P  + +  +K        R   +     +R     +  +        
Sbjct  60   SAKTPPSKTKAAAKPAAAPGARRRALVGWGPSWRCCWPSVPPVISAAPPWC  110


>OGW32305.1 hypothetical protein A2X59_00640 [Nitrospirae bacterium GWC2_42_7]
Length=339

 Score = 52.9 bits (123),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 45/343 (13%), Positives = 93/343 (27%), Gaps = 28/343 (8%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            T+ C  CG E N     L    +  RC +C     ++    Q    +++  +        
Sbjct  2    TISCQKCGKELNVSERSL-KTGARFRCLKCGNLTTYEQLGEQSGIISNSSQSIHKEVYVH  60

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                    I +     ++ +R+    PE   R +    R    +              L 
Sbjct  61   DKSVSDSRISTGPQVVQQDDRTSSPSPEEIDRINDKYTRYGYLIPQTDKIEERSFMEHLP  120

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
              +   +      +     +     L          I+   +    +    M        
Sbjct  121  EAFSFPLKKGGILMLVIGSVFFTVILFFSKYAPLIGIIGFVLVSGYISAFMMKIVSRTAD  180

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLIL---------------------VVGGGSLLL  221
             + D+  +         +      +I+  L                     V+  G+L+L
Sbjct  181  GEIDIPDWPEFSDWWDDIILPWAQMIITALASFCPLIAYIVVSYVLGGRPSVLVIGALVL  240

Query  222  IIPGLLFCVWFFFCQY--VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
                         C +  + A + +   QA+ +        +       +++ V+   L+
Sbjct  241  AGAFYYPMALLAMCLFRGMHALNPVLLFQAIGQIFS----DYIIACILMLIIFVLKWGLT  296

Query  280  FLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGP  322
            F +  IP+VG   +    L L         LIY   K   +  
Sbjct  297  FFSNMIPFVGPFIDNFLMLYLVMLEMRILGLIYYANKNKLQWF  339


>RLE93661.1 hypothetical protein DRN04_06455 [Thermoprotei archaeon]
Length=550

 Score = 53.6 bits (125),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 23/196 (12%), Positives = 67/196 (34%), Gaps = 6/196 (3%)

Query  127  LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD  186
            L +++ +    S       +      +     +       +          +   +    
Sbjct  148  LAVIVFYTLGQSTCASFIYSLAEDSLKRGYCKLEEIWSKAVKNLPRVFIAILMKNLVVYV  207

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGG  246
            + +   +      +    +L ++L  ++   +++L    +   V   + +  +   +   
Sbjct  208  IPISCFLAFIFLGLKYLDMLALILSTLMFLFAIILY--IIAAEVALIYVEPSIVIGSRDA  265

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP----YVGEAANLAFSLLLTP  302
            ++A ++S LLV  +     G F+L +++ + +  ++  +        EAA L   L++ P
Sbjct  266  IEAFKESILLVRQNISKALGYFLLQVLVGIIIGSISGALSNFHILFSEAATLILLLIVRP  325

Query  303  FSFLYYYLIYSDLKAN  318
               +    IY  L   
Sbjct  326  IFAVCLAGIYMSLTGR  341


>KMT09158.1 hypothetical protein BVRB_6g132640 [Beta vulgaris subsp. vulgaris]
Length=161

 Score = 50.5 bits (117),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 23/93 (25%), Positives = 41/93 (44%), Gaps = 0/93 (0%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            LVV   ++L ++  L   + +     V   +   GL+AL+KS+ L+ G     F   VL 
Sbjct  16   LVVLIIAILYLVGMLYMFLTWQLSDVVSVMEKNCGLKALKKSKELIKGKMGIAFAMLVLN  75

Query  272  LVISLTLSFLTARIPYVGEAANLAFSLLLTPFS  304
            L++ +    L  ++   G  A + F +L     
Sbjct  76   LLLGIPTLILIQKLAGFGIVARILFGILSLVLW  108


>HIF06606.1 hypothetical protein [Gemmatimonadetes bacterium]
Length=76

 Score = 48.2 bits (111),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 11/35 (31%), Positives = 16/35 (46%), Gaps = 0/35 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            TV CP CG      ++K+P     A+C  C +  
Sbjct  3   FTVSCPSCGMSFPVDAAKVPEGGVRAQCNMCPEIF  37


>MBI1339012.1 hypothetical protein [bacterium]
Length=310

 Score = 52.9 bits (123),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 8/37 (22%), Positives = 12/37 (32%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  V CP C        + +  +    RC  C  +  
Sbjct  1   MILV-CPSCETRYFADDASVGKEGRKVRCAACGHSWF  36


>NVM56621.1 zinc-ribbon domain-containing protein [Desulfobacterales bacterium]
Length=229

 Score = 51.7 bits (120),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 9/57 (16%), Positives = 13/57 (23%), Gaps = 1/57 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPH  57
           M  + C +C    N   + +    S  RC  C                       P 
Sbjct  1   MI-ITCENCKTRFNLDENLIKESGSRVRCSRCHHIFTAYKPAPAEEPWPGAEPPHPF  56


>WP_174541478.1 hypothetical protein [Methyloligella sp. GL2]QKP77328.1 hypothetical 
protein HT051_07600 [Methyloligella sp. GL2]
Length=367

 Score = 52.9 bits (123),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 39/225 (17%), Positives = 77/225 (34%), Gaps = 2/225 (1%)

Query  79   RRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFS  138
                            +       I   +   WE F +R W  L   ++  V++ A    
Sbjct  1    MWKTIPGAAPHLGRRDSFLPMNFPILGTIRYGWEAFKQRSWLYLAATVIFAVVSLAISML  60

Query  139  ALLLKPATWLNPQNQNWQW--AILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLG  196
            ++++                  +       +   +       ++        +  S    
Sbjct  61   SVMIDGLLNGWDGPDGIPQESFVGTIFSLALSTLVYMGVTGFYLKAFDAPDRVALSDLWQ  120

Query  197  LRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLL  256
                 ++    +L  L VG G +LLI+PG++F + F F   V+ D  +G ++ALE+SR +
Sbjct  121  PHPFWNYLGASVLAGLSVGVGLVLLIVPGIIFALMFAFGTVVVMDRGLGPIKALEESRRI  180

Query  257  VSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
              GH W + G  +LL +++L           V    +    +   
Sbjct  181  TRGHRWRLLGFGLLLGLLNLAGLLAFGLGLLVTVPVSFLAGIYAY  225


>NMD37062.1 hypothetical protein [Christensenellaceae bacterium]
Length=273

 Score = 52.5 bits (122),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 25/153 (16%), Positives = 64/153 (42%), Gaps = 9/153 (6%)

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL------LLIIP  224
            L+ +   +  Y     + + R++ + L        +  L+   V  G+       + I+P
Sbjct  101  LTMLFSQIKNYFKALALFVIRALIITLWSAVPLMAIFALINFRVLSGNAASFLSPIFILP  160

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            GL   V +F   +++ ++N G L++L++S+ LV    + +F      +++    +++   
Sbjct  161  GLFAFVRYFNSPFIMVNENKGVLESLKQSKNLVKKREFPMFSMLAYFMLLEFAATWILGI  220

Query  285  I---PYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
                P +    + A  L +  +  +    +Y +
Sbjct  221  FDASPVISLILSQAVGLAIKAYYNISVAGMYCN  253


>MBD5440315.1 DUF975 family protein [Treponema sp.]
Length=163

 Score = 50.5 bits (117),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 27/155 (17%), Positives = 55/155 (35%), Gaps = 12/155 (8%)

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
            +   A  ++    +         ++    F     GL          +  IL V   SLL
Sbjct  1    MILGAVAVIFTVAIXSVHNTMFKESRXXYFSDFTNGLXQWWQAFRGGLWFILWVFAWSLL  60

Query  221  LIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
             +IPG++    +    +V+A++  IG  +A++ S+ L  G+   +F   +  +  ++  S
Sbjct  61   FLIPGIVKYYSYSMMFFVMAENPKIGVCKAMQISKELTRGYKGELFVLDLSFIGWAILAS  120

Query  280  FLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
                              + L P+ F+     Y  
Sbjct  121  IP-----------CGLGYIWLAPYGFMTKTNAYQY  144


>PYT09483.1 hypothetical protein DMF49_01935 [Acidobacteria bacterium]
Length=225

 Score = 51.7 bits (120),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 11/69 (16%), Positives = 15/69 (22%), Gaps = 0/69 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
             + CPHC       + KLP       C  C + L                     C   
Sbjct  7   IQITCPHCQERYMVEAGKLPPSGGRFTCRICGEKLEIRTELLAPQPAVVESRGEVCCPRC  66

Query  62  RRIPSDRLE  70
                 +  
Sbjct  67  GNRFLPQQP  75


>HIM47199.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=104

 Score = 49.0 bits (113),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 11/45 (24%), Positives = 15/45 (33%), Gaps = 1/45 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           M  V CP C        + L  +  + RC +C  T    P     
Sbjct  1   MI-VSCPSCATRYLIDPTALGGEGRTVRCAKCSHTWHEQPPADMP  44


>ARU42874.1 hypothetical protein CCB81_01385 [Armatimonadetes bacterium Uphvl-Ar2]
Length=252

 Score = 52.1 bits (121),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 32/209 (15%), Positives = 63/209 (30%), Gaps = 12/209 (6%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                LL I  L   +      +A            +        +  V            
Sbjct  45   CAPILLMIVALLGFVFLVMFPAANSGGELPMFMMVSFQIVVLFGILFVYAAAGPGHIGCA  104

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC-VWFFFC  235
             + + I + +       K GL      ++  ++    V  GS    +PGLL+  +     
Sbjct  105  RLALAIRRREPLTVEMAKAGLPRFVPGSIAGLINYAGVLIGSYCCYVPGLLWGGLTTGAF  164

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
              +  D+N+    A+++S  ++    W           ++  L  L + I  +G  A + 
Sbjct  165  MAMAIDENLSAGDAIKESLEVMKPQMW-----------MAALLYLLVSMISSIGCIALIV  213

Query  296  FSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
              +   P   +   L Y D+K        
Sbjct  214  GLVFTMPLLSIIMGLAYLDMKFGIGQSDR  242


>MXW26595.1 hypothetical protein [Dehalococcoidia bacterium]MYA52876.1 hypothetical 
protein [Dehalococcoidia bacterium]
Length=262

 Score = 52.1 bits (121),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 19/131 (15%), Positives = 40/131 (31%), Gaps = 5/131 (4%)

Query  211  ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
             L      ++ + P L               D  G ++A+ +S  + SGH   + G  +L
Sbjct  132  WLTAVLALVIGLPPFLYVMTRLALALQTFILDEAGPVEAIGESWSMASGHMPRLLGVMLL  191

Query  271  ----LLVISLTLSFLTARIP-YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHP  325
                +  I + +SF     P  +         + +  F  +   + Y  ++      +  
Sbjct  192  ALVAVAGIQVGISFALGPAPDALRIILMGLIGIPVAVFGAVALTMFYLRIRETSPVREAG  251

Query  326  PIKRQWLPLTA  336
            P          
Sbjct  252  PRLEPAHGSGW  262


>WP_044217345.1 hypothetical protein [Flammeovirga pacifica]OHX64709.1 hypothetical 
protein NH26_24410 [Flammeovirga pacifica]
Length=273

 Score = 52.5 bits (122),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 29/191 (15%), Positives = 69/191 (36%), Gaps = 6/191 (3%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            F          +   + L    I     +  A  L         + +      I++    
Sbjct  22   FNFWKTHFKIFFKSSLRLCVPFIVIGTAIAGAGELLFPENANAVSTISIFGNLIVIIGQI  81

Query  174  MTGSMFIYICKTD------VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
             + ++   + K        + +   +K    +  SF +   L ++++  G L  I+PG+ 
Sbjct  82   YSMTLAFSLVKESMVNTNSLTVDTLIKGAKTNTLSFIVAYFLYLIILLLGLLFFIVPGIY  141

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
              V F     ++A +      A+ +  LL+  +WW+ F   ++L  I+L ++    + P 
Sbjct  142  ISVTFALIFPIIAFEQKNIGDAISRCNLLIKDNWWSSFLFLIILSFITLFVTIFVLQFPS  201

Query  288  VGEAANLAFSL  298
            +     +  ++
Sbjct  202  LLVGVFIGLNI  212


>NLX23555.1 YIP1 family protein [Phycisphaerae bacterium]
Length=523

 Score = 53.2 bits (124),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 42/479 (9%), Positives = 104/479 (22%), Gaps = 23/479 (5%)

Query  5    RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR-  63
            RCP CG        +         CP+C Q       +      +    +C         
Sbjct  17   RCPECGTPFRPSEFEFAPGSVQFCCPDCKQAYYGTGPKGHLEPPSFTCVSCGRALDMDEM  76

Query  64   ---IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
                     E +++       +               + +R    +     +    +   
Sbjct  77   VLLPTEGIEEDETRPETVPWLDTKLGRFRAWWNTVGMALVRPADLMRLLPLDSSVGQAIW  136

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
               +     ++    +F    L  A  L           ++     + L L  +   ++ 
Sbjct  137  FGTLTHSLAIIGSFAVFMIFPLIVAMTLAGGGGPGGIEGVMLCTPLMALILVPILVLVWS  196

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +    + +                      ++     L + +  +   +       ++ 
Sbjct  197  CLVHGVLCITGKDVKPFSRTIQAVFYSAGANIITAIPCLGMYVGWIWNLIS-----QIIM  251

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
                 G+ A      +V            LLL I + ++ L   +  V   A+ A +L+ 
Sbjct  252  LKEGHGISARRAIAAVV----------LPLLLAIGVPVAILVPAVRGVQTMASGARALMQ  301

Query  301  TPFSFLYYYLIYSDLKANYRGPQHPPIKR----QWLPLTAAIFGWMLIPGLLLVSLSRQN  356
               S        +      +               L ++  +          +       
Sbjct  302  RQMSTSSVTAALTGYCQTRQSAWPSHAIELVTADLLRVSDLVDASTATMLEQVPIGDTTL  361

Query  357  LSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLS  416
               E L +A +D   +          +   L +        D                  
Sbjct  362  AQLEYLSAAEQDAIAKDVVDAMPPNVIAHRLGDFVFTYHGIDLANADPNLWLAILAPDPE  421

Query  417  LGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQ  475
                                 ++L ++  P           +       +    L  + 
Sbjct  422  RQKPASDPGSLDDSFPIFAGQIRLPVASAPANPPPVTKPIPLSNRTAALESQNALRAQH  480


>WP_183346504.1 zinc-ribbon domain-containing protein [Geomonas paludis]
Length=525

 Score = 53.2 bits (124),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 30/303 (10%), Positives = 68/303 (22%), Gaps = 17/303 (6%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              + CP C   R     K+P    +  CP+C    +FD   S      +     P     
Sbjct  5    IQITCPSCALSRLVSLEKMPQTAVTVTCPKCNTPFLFDAHVSAPLPAIETPRQQPEVTPC  64

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                S               + +         R +       S        +        
Sbjct  65   LLRESGGEPPGEIKAAKGTTDSTGTEPDLELERLATELQVISSGESKTGSLVLLIITVMF  124

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
                 +        +    +L      +           L        G +         
Sbjct  125  FFSAQIVSTTWHEVLILVGVLFFHELGHMAAMKIFKYTDLKMFFIPFFGAA---------  175

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGG----SLLLIIPGLLFCVWFFFCQY  237
            +   +          +  +G    +++ ++L +         L     ++  +  F    
Sbjct  176  VSGKNSNPTAVKSCIVSLMGPLPGVILSVVLYILFFLTKNYYLFKTAQIMMMLNVFNLLP  235

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
            ++  D    +  L     +   ++  +F        + L  S     + + G        
Sbjct  236  IMPLDGGRFVDVL----FVNRRYFRFVFAFLGGAAFLLLAKSAGDFVLGFFGVITIYVAL  291

Query  298  LLL  300
              L
Sbjct  292  CNL  294


>RXH69783.1 hypothetical protein DVH24_007039 [Malus domestica]
Length=645

 Score = 53.6 bits (125),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 29/302 (10%), Positives = 72/302 (24%), Gaps = 12/302 (4%)

Query  67   DRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYL  126
             +L    +                           ++   +      F   G  L+ +  
Sbjct  351  HQLPYDFQPSLIVHAINLRTSLSYHSNPMISDISWALDVAIQILQVYFFTLGNFLIVLTT  410

Query  127  LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD  186
            +             ++     L       +W   + T   I L  +  + ++  +     
Sbjct  411  IHAASTIYASNERSMIGLRNLLRSSVMKKRWHERMLTFFSISLLCNLSSAAVVYWHYARP  470

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGG  246
            +          R       ++  L   +           ++  +          ++ +G 
Sbjct  471  LYPIGCPTRFFRLFRKIHWIVYGLAFAIWLDYSARWNLSVVVSIL---------EEKVGV  521

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA---RIPYVGEAANLAFSLLLTPF  303
             +A+  S  L  G+    F   +L  V    L  L A      +     +  F  L    
Sbjct  522  FEAISASSELTKGNRLRGFFLMLLYSVAKFNLPALVAREFTTIFAFYFLDTIFMFLGNVI  581

Query  304  SFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLL  363
             +    + Y D K  ++             L         +  L  +  +    ++ +  
Sbjct  582  IWEVLTVYYYDCKNRHQKAAAVTSFDTGNTLLFLSTLVNFLIVLTTIDAASTIYTSNERT  641

Query  364  SA  365
              
Sbjct  642  MF  643


>MSY59729.1 hypothetical protein [Actinobacteria bacterium]
Length=275

 Score = 52.5 bits (122),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 28/252 (11%), Positives = 65/252 (26%), Gaps = 24/252 (10%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
             + +     + L + L    +   +     +  N   +       +       L L    
Sbjct  20   WKTFRAFLRHWLALALPVFLVVDLIGAIALSESNDTARGIWAVAGVLATIVGSLLLQGAL  79

Query  176  GSMFIYICKTDVGLFRS--MKLGLRHVGSFTLLLILLILVVG------------------  215
                  +    V +            + +    L+L+++                     
Sbjct  80   TIAAEDVTDGRVDVAADTAFAQARGRLPALFGTLMLVVVFFVAAILGVLLVRALLGGVIA  139

Query  216  -GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
               + +++  G++  V       V+  +   G  A+E+S  +  GHW+ I     L  +I
Sbjct  140  GLLTAVVVALGIIVAVRLAVLAPVVVLEKRSGRAAIERSWEITQGHWFLILRVAFLSTII  199

Query  275  SLTLSFLTARI---PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
                + L   I      G   + A   +    +     L              P      
Sbjct  200  VGIPARLLGGIVSVAIPGFVGDYAGMAIPDALAAPLPALALVLTYFRIIRTDEPAPVEPV  259

Query  332  LPLTAAIFGWML  343
             P+ + +     
Sbjct  260  TPVLSPLEPSEP  271


>NND43154.1 hypothetical protein [Silicimonas sp.]
Length=257

 Score = 52.1 bits (121),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 10/37 (27%), Positives = 17/37 (46%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  + CP+C A+    +S +P +    +C  C  T  
Sbjct  1   MRLI-CPNCSAQYEIDASLIPDEGRDVQCSNCGHTWF  36


>MBI4951725.1 zinc-ribbon domain-containing protein [Myxococcales bacterium]
Length=200

 Score = 51.3 bits (119),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 14/39 (36%), Positives = 19/39 (49%), Gaps = 0/39 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  V+C  CGA  N   S++PA     RCP C  + +  
Sbjct  1   MVKVQCEGCGAPYNVAESRIPATGLKMRCPSCGASFMVQ  39


>KKU18356.1 hypothetical protein UX28_C0001G0213 [Candidatus Pacebacteria 
bacterium GW2011_GWA1_46_10]KKU84153.1 hypothetical protein 
UY13_C0002G0065 [Candidatus Pacebacteria bacterium GW2011_GWB1_47_8]HCR80888.1 
hypothetical protein [Candidatus Pacebacteria 
bacterium]
Length=288

 Score = 52.5 bits (122),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 24/101 (24%), Positives = 53/101 (52%), Gaps = 0/101 (0%)

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
               K  +G+  +++ G +      +  +++  +V GG   L++PG++F + F F  + + 
Sbjct  114  AAGKKKIGIGEALRRGWQQFIPVLVAGLVMAFMVFGGLFPLVVPGIVFAILFNFALFEIV  173

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
              N   +QAL  S  +V  H+W + GR V+L+ +++ +  +
Sbjct  174  LHNASPMQALRNSAGMVKQHFWPVVGRLVVLMALAVGIEVI  214


>HHV09035.1 DUF975 family protein [Clostridiales bacterium]
Length=221

 Score = 51.7 bits (120),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 29/232 (13%), Positives = 63/232 (27%), Gaps = 19/232 (8%)

Query  199  HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLV  257
              G   LL +L+ +     +LLLIIPG++    +    Y+L ++  +   +A+  S+ ++
Sbjct  1    MFGRAFLLRLLMTIFTFLWTLLLIIPGIIAVYSYAMAPYILEENPGMTATEAITCSKEMM  60

Query  258  SGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD---  314
             G+ W +F   +  +   +   F                 L L+P+  +     + D   
Sbjct  61   RGNKWRLFCLQISFIGWVILCIF-----------TCGIGFLWLSPYMAMAEVAFFYDVSG  109

Query  315  ----LKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQ  370
                    +         +                     +       +      G    
Sbjct  110  KFSAPNGQFGQNGQYNQGQYNQYQYNQHEYNQDQYNQDQYNQQYNQYQSNGYQQNGYQQN  169

Query  371  QRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTL  422
                   QQ         + P        +    +     ++      P  L
Sbjct  170  GYQQNGYQQNGYHPNEYQQNPSGFMQNANEQPEQENTGNPTDPVDPNNPTNL  221


>KAF9613703.1 hypothetical protein IFM89_010145 [Coptis chinensis]
Length=433

 Score = 52.9 bits (123),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 36/233 (15%), Positives = 78/233 (33%), Gaps = 11/233 (5%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             + +L  ++    +    L      +     ++    L  T     +   W    +    
Sbjct  88   ILLILVEMVFLLLMTVVSLFSMVATIYVSAMSYLGKSLTLTDLCFGIKKIWTRPVITWLY  147

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                +  F  + L L  V       ++++ V     L +I   L   + +     +   +
Sbjct  148  ISLCIAGFTLLILPLCFVLILGRGSMIVVAVGILVFLPVIFLNLYLSLVWMLSLVMSVLE  207

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP----------YVGEAA  292
            +  GLQAL K+  LV G          L++++      L + I            +    
Sbjct  208  DCYGLQALGKAEKLVKGRKVVGLVLTFLVMILY-IPGILFSMINKNQQTVTTQILIAYGV  266

Query  293  NLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIP  345
             + F++L+   S + Y + Y +LK ++          ++ P  A  F  ++ P
Sbjct  267  TVNFTILVKILSMMAYTVFYFELKESHGEQVVVEEDMEYRPAWATDFMRVVRP  319


>MBI2594007.1 hypothetical protein [Candidatus Daviesbacteria bacterium]
Length=237

 Score = 51.7 bits (120),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 41/199 (21%), Positives = 75/199 (38%), Gaps = 7/199 (4%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             ++LL   L    +   ++           +     I       IL     +  +    +
Sbjct  36   VVFLLTYFLKAPFLNILMIPVVMLMQITLLKTHTTPIEKFDYEQILALKDPLLKNKIWRL  95

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL---FCVWFFFCQYVL  239
              T +  F  + +     G     + L    +    L +I  G+L   F V+++F  Y++
Sbjct  96   VWTYIVYFFLLFIVSLPAGIAGAFIALKYENIALLYLAIIPLGILLTPFIVYWYFFAYIV  155

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI----PYVGEAANLA  295
             D N+ G+ AL+KS+ ++  HWW  F  FV+  +IS  +S +   I      +G      
Sbjct  156  LDQNLKGIAALKKSKEMIKDHWWKTFAIFVIAAIISGVISSVITSIGGQDALIGGLLMAV  215

Query  296  FSLLLTPFSFLYYYLIYSD  314
            FS L + +        Y D
Sbjct  216  FSGLSSVYFGYVAVAYYFD  234


>PWT71969.1 hypothetical protein C5B60_10110 [Chloroflexi bacterium]
Length=464

 Score = 52.9 bits (123),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 32/188 (17%), Positives = 56/188 (30%), Gaps = 2/188 (1%)

Query  160  LLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
               T+    L  S     +        +    +  + L   GS     + +  +     +
Sbjct  277  WRPTLGASFLAGSAYYLVLLPAAVSYVIVYMTASGINLCQAGSAASTALTVGCLASIFGI  336

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL-  278
              +I  LL  V   F  Y+ A  N+   QAL +S  +  GH+W  FG  ++    +  + 
Sbjct  337  FGVIAALLLVVRLGFAPYIAATSNVTIRQALARSWEITRGHFWTAFGVILVTGTAAYLVL  396

Query  279  -SFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAA  337
                          A     +   P   L Y ++  DL     G      +   +  T  
Sbjct  397  QFAAAFPPALAIFVATPIAYIFTVPLVSLTYIVLLYDLWLRRDGYAALTQEPGAVVGTRP  456

Query  338  IFGWMLIP  345
                  I 
Sbjct  457  PTSVPPIG  464


>RZM07946.1 hypothetical protein EOP67_72485, partial [Sphingomonas sp.]
Length=123

 Score = 49.4 bits (114),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 8/40 (20%), Positives = 13/40 (33%), Gaps = 1/40 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
           M    C  C      P + +  +  + RC  C  +    P
Sbjct  1   MIL-ECTECETRYVIPDTAVGPEGRTVRCANCKHSWFQQP  39


>OGP62797.1 hypothetical protein A2170_16600, partial [Deltaproteobacteria 
bacterium RBG_13_53_10]
Length=280

 Score = 52.1 bits (121),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 13/65 (20%), Positives = 20/65 (31%), Gaps = 1/65 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + C  C  + N   SK+P   S  RC  C            + Q  ++  +      
Sbjct  1   MI-ITCTSCMTKFNLDDSKIPKTGSKVRCSRCKHVFHVAFPPESQEQVIESFESFVKYHE  59

Query  61  QRRIP  65
               P
Sbjct  60  DLMEP  64


>NTU99436.1 hypothetical protein [Candidatus Falkowbacteria bacterium]
Length=250

 Score = 51.7 bits (120),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 40/219 (18%), Positives = 97/219 (44%), Gaps = 11/219 (5%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
             ++  R +  + + ++  +L+ A                + +    A         +L  
Sbjct  31   RIYASRTYLFVVMAIVPFLLSSALDHIFNYANGLPGGMSRVRLMLIAAAFLLFVIAVLVY  90

Query  172  SWM-TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
             +     M     K +      +K        +  L +++ L++   + LL+IPG++F V
Sbjct  91   VYSQIIMMLSLGKKKNAPFSEILKDAHPLASKYLYLFVMVTLLLIMWTFLLVIPGIVFGV  150

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG-  289
            ++ F  Y+L ++ I G++A+ +S+ LV+G+W  + G+F+LL+++++ ++ +      +  
Sbjct  151  YYLFANYILVNEKIEGIEAIRRSKELVTGYWLQVAGKFILLILLNIFIAIIFGIPLMLMH  210

Query  290  ---------EAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
                     + A    + L +P   ++ YL+YS+L+   
Sbjct  211  QGTTSYALVQLALGCLTSLFSPIIIIFGYLLYSELRTIK  249


>OGK09556.1 hypothetical protein A2767_06010 [Candidatus Roizmanbacteria 
bacterium RIFCSPHIGHO2_01_FULL_35_10]OGK42959.1 hypothetical 
protein A3A74_05830 [Candidatus Roizmanbacteria bacterium RIFCSPLOWO2_01_FULL_35_13]
Length=423

 Score = 52.9 bits (123),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 61/311 (20%), Positives = 116/311 (37%), Gaps = 2/311 (1%)

Query  87   LQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPAT  146
            ++  +E       + S +        L+ R  +    +    ++L F+ I          
Sbjct  1    MKKPQELLRESWQIFSRNWKTYIKILLYSRLFYFPALLITGLLILLFSLILRPDRQSDLF  60

Query  147  WLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLL  206
             L          +L      +    + +T  +F Y  ++  GL    K  LR+V    L+
Sbjct  61   DLPKIITLGLPFLLFLIFVILAETCTIITQLIFFYSPESYTGLTDLYKKSLRYVWPMLLM  120

Query  207  LILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFG  266
              L  L++ GG +L IIPG++F VWF F ++ L  +N    ++L+ S  LV   +W IF 
Sbjct  121  SGLGGLMIMGGIMLFIIPGIIFLVWFSFSRFFLLFENKKVYESLDASGRLVKARFWEIFM  180

Query  267  RFVLLLVISLTLSFLTARI--PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
            R + + ++   L++L   I    +     +   L+L P   +Y  L+Y + K        
Sbjct  181  RILTVYLLFYALTYLPLLIRSGTIKLILAIIILLVLNPLVNIYLILLYKNAKELETVQPA  240

Query  325  PPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLN  384
               K   L         +L+    +       +++   L  G + +     +   T  L+
Sbjct  241  FEEKHSRLLYWVIPLFALLLVSGAIGGYMFLKINSPSDLPHGNETRIISENKIIPTKYLS  300

Query  385  RSLPEEPQRLS  395
             +  E      
Sbjct  301  PTPIENNYWDP  311


>HHS69818.1 zinc ribbon domain-containing protein [Candidatus Bathyarchaeota 
archaeon]
Length=288

 Score = 52.1 bits (121),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 41/233 (18%), Positives = 83/233 (36%), Gaps = 5/233 (2%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
            Y     L   P+      +               + +      ++   +  G     +  
Sbjct  48   YWAVSTLPPFPMMDFTASEALFDWLLSLSKVLLVVGVVGWILTVIVQGYTVGYSSTLLTG  107

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
             D+     ++  +R         ++  L++G G LLLI+PG++F V F     V+  ++ 
Sbjct  108  EDLKSGEIIRRIVRKTPRLLAASLIANLLIGIGLLLLIVPGIIFAVMFSLIVPVIVLEDG  167

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY----VGEAANLAFSLLL  300
              L++L +SR LVS  W        +L+    T   L + I      +G  A++A +  +
Sbjct  168  KILESLSRSRRLVSRRWGKTLALIAVLVAAISTSEILGSLIAIAFEPIGYLASIAITATV  227

Query  301  TPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLS  353
             P   +    +Y  +KA     +   I+        A +       L  +++ 
Sbjct  228  EPIYPVAVACLYYSMKA-KEASEREEIREATPRPIPARYCIECGEPLSPIAVY  279


>OYV09983.1 hypothetical protein CG446_1323, partial [Methanosaeta sp. ASO1]
Length=214

 Score = 51.3 bits (119),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 25/142 (18%), Positives = 47/142 (33%), Gaps = 0/142 (0%)

Query  127  LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD  186
            LG+++    +    +     +L           ++  V  I L        +        
Sbjct  73   LGMLIFAGYLGLMTIFADGGFLALSFMASILLFVILMVVLIFLAEGASIEMIRQASMGRA  132

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGG  246
              L  + K   R++    L  +L  +++  G     IPGL+    F+F   ++  D   G
Sbjct  133  ADLSAAWKSTRRNLEPLVLTSLLAGIMIALGYAFFFIPGLIMSFAFYFITQLIVIDGKSG  192

Query  247  LQALEKSRLLVSGHWWAIFGRF  268
            L AL+ S   V  +        
Sbjct  193  LDALKASYRFVEANLSDSLIVV  214


>WP_182098911.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Enhydrobacter aerosaccus]
Length=220

 Score = 51.3 bits (119),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 34/220 (15%), Positives = 71/220 (32%), Gaps = 12/220 (5%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
              LG+    A +   L+    +++        +    A +         +  S F  +  
Sbjct  1    MRLGLEYFTAFMVCGLVFTTPSFMLEMRGVGGFPKFAADILGNAAVQICILCSTFEALAG  60

Query  185  TDVGLFRSMKL-GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD-D  242
                +  ++     R++    +  ++  L+V  G  L  IPGL     +      L   +
Sbjct  61   RMPNVSGTLWQIQRRNLAKLLIFSMVQALLVILGLALFAIPGLYLMTLWAVALPALVLIE  120

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLV----ISLTLSFLTARIPY------VGEAA  292
            ++  L A  +S  L  G  W +FG  V  ++         S L   +P       +    
Sbjct  121  DMPFLDAFRQSASLTKGRRWRVFGVVVACILAAVVAFGLASLLVRFVPIAIERAELRTMV  180

Query  293  NLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWL  332
                + ++  F +    ++Y  L+    G     I   + 
Sbjct  181  LWPVTAIVAAFLYPVPAVLYVLLRQEKEGLTVEEIVEPFY  220


>NLW78468.1 DUF975 family protein [Ruminococcaceae bacterium]
Length=362

 Score = 52.5 bits (122),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 32/306 (10%), Positives = 71/306 (23%), Gaps = 12/306 (4%)

Query  101  RSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAIL  160
                   A +          L  ++      A     +   ++    L P      +   
Sbjct  68   FFYFMSYAAATVTAFAFVVMLYSLFRFIFGAAINLGVNQFTMRLVLRLEPHRVGTLFFRF  127

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
                  +L             +                     T   I+L+++    ++ 
Sbjct  128  HIFGRALLTRFLTSLFIFLWGLAFYLPATILIGISSGAFYIYGTGYYIVLLILGILLAIG  187

Query  221  LIIPGLLFCVWFFFCQYV-LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
              I  +     +    Y+     ++G L+A+ +S+ ++ G+   +FG     +   +   
Sbjct  188  GSIFLIAVQYRYAMAPYLLSQYPDMGALEAIRQSKAIMKGNKGRLFGLHFSFIGWYILCL  247

Query  280  FLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIF  339
            F                 L L+P+        Y ++          P      P      
Sbjct  248  FTFG-----------VGYLWLSPYVKTAEAAFYLEITGQLPAWVKFPATVPGYPQAPPPG  296

Query  340  GWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADY  399
                 P     +  +  +       A    Q      P  T      LP       + + 
Sbjct  297  PPPAGPAQAPYTYGQPPVPPTAPPQAASPPQGAPMESPPSTMQQAPPLPPSSIYSPAPED  356

Query  400  KLLLSK  405
                  
Sbjct  357  PEAPRP  362


>CAA9225334.1 hypothetical protein AVDCRST_MAG93-630, partial [uncultured Chloroflexia 
bacterium]
Length=234

 Score = 51.7 bits (120),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 29/200 (15%), Positives = 64/200 (32%), Gaps = 13/200 (7%)

Query  128  GIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD-  186
                A   +  +             +    ++  A   Y+ +        +     +   
Sbjct  29   VFPAAVLAMLGSAPYYFIEGQIHVREQILTSLTGAFAFYLYIVYVAYAEEVTAQAERGAE  88

Query  187  ----VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                  +   ++   R   S     +  I++     +LL +PGL     +     V++ +
Sbjct  89   RITTRSVLHMLRQATRIAPSAMAAAVAAIIIPRATIILLAVPGLWVLTRWSLFAPVISRE  148

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI-------SLTLSFLTARIPYVG-EAANL  294
            ++G + AL++S  LV GH+  +F    L  V+            FL +     G      
Sbjct  149  HLGPVAALKRSNELVRGHFELVFLTAALAAVLEEVAVHMGGVAGFLISGSDTWGHWLGGS  208

Query  295  AFSLLLTPFSFLYYYLIYSD  314
              ++L  P +     + Y+ 
Sbjct  209  ITTVLTIPLAAYATSVTYTH  228


>HHD11436.1 DUF3426 domain-containing protein [Deltaproteobacteria bacterium]
Length=502

 Score = 52.9 bits (123),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 10/36 (28%), Positives = 16/36 (44%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  ++C +CG +     SK+  K +  RC  C    
Sbjct  1   MI-IQCENCGTKYQLDDSKIGDKGAKVRCSRCKHVF  35


>MBF0519763.1 hypothetical protein [Nitrospirae bacterium]
Length=247

 Score = 51.7 bits (120),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 23/122 (19%), Positives = 48/122 (39%), Gaps = 0/122 (0%)

Query  160  LLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
             +      +   +         I    +    ++   L+   S      L  +++  G +
Sbjct  35   NILNFYMQMFVHAMTLAMARELIENGTLSYRSAIAEVLKKFSSLLFSGTLFGILLVAGLM  94

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            LLI+P +L   +F F   ++  +N     A+ +S  LV  +  A F  +V L   ++T+S
Sbjct  95   LLIVPAILINFFFMFTYVLIIKENKHSFTAIRESIKLVRRNLNAAFSVYVTLFTTAITVS  154

Query  280  FL  281
             +
Sbjct  155  IV  156


>TEY47489.1 hypothetical protein Saspl_039123 [Salvia splendens]
Length=401

 Score = 52.9 bits (123),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 38/263 (14%), Positives = 76/263 (29%), Gaps = 25/263 (10%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY  166
             +            +  + LL +                T    +   +  A ++  +A 
Sbjct  19   FSWKKLFTQITIATIFPLSLLSLFHDLLFSNILHNSFNNTPNKWRFCLFNAAYIVLFLAL  78

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV------------  214
             L+  + +  ++      ++     +M++ LR     T  L+L +L++            
Sbjct  79   SLVSTTAVVEAVARVYTSSEPRFAEAMRVVLRVWKRVTATLLLCLLLIVTYNAASFALIS  138

Query  215  ------GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
                       L     +   V +     V   ++ GG  A+ K   LV G      G F
Sbjct  139  PWIPYSLIFFSLYAAGFVYISVVWHLASVVSVLEDRGGADAVIKGLGLVKGRVVISGGVF  198

Query  269  VLLLVISLTLSFLTARIPYVG-------EAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
             LL +  + +  +  R   VG           L     +T    +   ++Y + K  +  
Sbjct  199  FLLNLCFVWVEVVFERCVVVGIGRRIGCGIVCLLVLCGITLLGLVTQTVVYFECKLYHGE  258

Query  322  PQHPPIKRQWLPLTAAIFGWMLI  344
                PI               LI
Sbjct  259  IIIIPIANSKDARVDLGDYAPLI  281


>GEV76394.1 hypothetical protein CTI12_AA110850 [Tanacetum cinerariifolium]
Length=154

 Score = 50.2 bits (116),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 18/105 (17%), Positives = 38/105 (36%), Gaps = 1/105 (1%)

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
              ++   +V      +L++  + F V +     V+  ++  GL AL +S  LV G  +  
Sbjct  3    GYVVDYNVVYVILGCVLVVIMMFFYVNWSLAYVVVVVESKWGLSALIRSWYLVKGMKFVS  62

Query  265  FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYY  309
            F   +   V    +  + +    +G       S+ +  F      
Sbjct  63   FVIMLYFGVFGGLIVLMCSAGN-IGVLVITLGSIFIMMFWLRITA  106


>WP_193388356.1 zinc-ribbon domain-containing protein, partial [Anaeromyxobacter 
sp. PSR-1]
Length=120

 Score = 49.4 bits (114),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 11/40 (28%), Positives = 16/40 (40%), Gaps = 0/40 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAE  42
            V CPHC A      +++       RCP C Q+      +
Sbjct  2   RVTCPHCNAAYKIDDARVTPAGVKVRCPRCHQSFPVRRPD  41


>HBE82154.1 hypothetical protein [Blastocatellia bacterium]
Length=586

 Score = 53.2 bits (124),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 24/185 (13%), Positives = 61/185 (33%), Gaps = 1/185 (1%)

Query  102  SISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILL  161
             I       + L        + ++   ++       +  +      +           L 
Sbjct  258  MIYTEHISKFLLLSLVFHVPMILFTAVLITLSFLKVNESIGTTTANITMGITGSVMYFLT  317

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLI-LVVGGGSLL  220
               AY+++G      +  + +    + L  ++    +   +F    IL   L +  G++ 
Sbjct  318  GFCAYLIIGTITWIVTQSMAVPLRPIKLRPALSEARKKWRTFAGTGILSTVLTLVIGAVT  377

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
              I   +  V +     V+  +N+ G QAL++S+ LV            ++ +I    + 
Sbjct  378  CGIGFFVTTVLWTLVGPVVMMENLRGRQALKRSKDLVKRSLATSVAAVAIMFLIPAVSAG  437

Query  281  LTARI  285
              + +
Sbjct  438  TISFV  442


>MAM76783.1 hypothetical protein [Tistrella sp.]
Length=281

 Score = 52.1 bits (121),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 11/56 (20%), Positives = 12/56 (21%), Gaps = 1/56 (2%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCP  56
            M    CP C          L       RC +C      DP               P
Sbjct  63   MIL-TCPACSTRYTLDPQSLGPDGRKVRCTQCGHVWHQDPPADMPRPLALPRDENP  117


>HGW15319.1 hypothetical protein [Geobacteraceae bacterium]
Length=300

 Score = 52.1 bits (121),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 12/79 (15%), Positives = 19/79 (24%), Gaps = 0/79 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + CP C A  N   SK+P +     CP C          +                   
Sbjct  2   KIECPACSASGNIDESKVPEEGRRVVCPRCSTHFDVRKERTGAEIIQQRQRMVCPKCGCE  61

Query  63  RIPSDRLEIQSKTVNCRRC  81
           +   +   I    +     
Sbjct  62  QPLLETCAICGIVIKHYIQ  80


>WP_075726472.1 hypothetical protein [Corynebacterium aquilae]APT84909.1 hypothetical 
protein CAQU_07350 [Corynebacterium aquilae DSM 44791]
Length=331

 Score = 52.5 bits (122),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 23/180 (13%), Positives = 60/180 (33%), Gaps = 3/180 (2%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L+G+ +  IV       S  +              +  +  A   ++     ++ G   +
Sbjct  133  LVGLIISAIVTPMVFNVSLKMTSGVRAEIKDLIPARGWLNSALAVFLYSLPVFLIGIGAL  192

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV--WFFFCQYV  238
             +              L   G       L+ + +G  + ++++  ++F +  +F     +
Sbjct  193  NLLGKQ-AFREEFMSHLSGYGELDNPWPLIQIFLGLFAFIVLMALVIFLLSPFFALWPVL  251

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
               ++I  + AL +S  L   ++  + G      +++           +VG  A    S+
Sbjct  252  PLLEDISLIDALGRSIKLGKQNYGGLLGLTFFTQLLAQFSGQAFQVTAFVGSIAGSVASM  311


>WP_142407150.1 hypothetical protein [Mycobacterium sp. EPG1]
Length=473

 Score = 52.9 bits (123),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 36/229 (16%), Positives = 66/229 (29%), Gaps = 6/229 (3%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
             +F     G L    +G+ + F  I +A  L     L   +       +      +L  L
Sbjct  244  SMFESSDPGGLITGFIGMWIVFMLILAAFALPADALLLGLSVIAADKAVRGHRVRLLEVL  303

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
            +   G +   +  T         +            +L   +     + + I      + 
Sbjct  304  AAAKGRIRAVLGLTLCFYGIIFAVDAIAFAIVMAFFLLFPPLGVIALVAVFIASFAVGIL  363

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI------  285
            F     VL  +  G L +L+++  L       +FG   L  V    L F+   +      
Sbjct  364  FSLAPIVLIVEKRGALDSLKRAAQLSKPAAGRLFGIHSLWAVCVSPLLFIPGAVVSLILG  423

Query  286  PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPL  334
            P  G          L  +  +   LIY+DL+      +           
Sbjct  424  PIGGGLFFAIAFGALIAYFRVLQMLIYTDLRIRQENFETELHAEWSQRP  472


>MBE2197726.1 DUF4013 domain-containing protein [Anaerolinea sp.]
Length=419

 Score = 52.9 bits (123),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 31/311 (10%), Positives = 68/311 (22%), Gaps = 13/311 (4%)

Query  5    RCPHCGAERNTPSSKLPAKKSSAR-CPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
             CPHCGA     +        +         T           Q        P+      
Sbjct  102  NCPHCGAALRPGARFCGKCGQALTAVSNPGATWKMPDIAPPPYQLPPAYEPQPYTPPAYE  161

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLG  123
              +   +         +               +         L       F    W +  
Sbjct  162  PQAYAPQPYEPQAYTPQPYTPPVPPQYATPYKAAGSNSGKLNLSEALGFPFQAANWVMTF  221

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            I    + L        L             +    +                      + 
Sbjct  222  IIGSVLWLLPLIGMILLNGYAVEVARRVIHDHPDTLPD---------WDDWGTKFRDGLA  272

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
             + +GL  S    L     F LL      +        ++  +L   +F         + 
Sbjct  273  VSVLGLIWSFIPFLIFSLPFILLTYTGRGLPVVFMGAGVLLQILATYFFMPAVMGRYAET  332

Query  244  IGGLQAL--EKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA-FSLLL  300
               +  L  +     V G + +  G ++L  ++ +    +   + ++     +    +  
Sbjct  333  GNFMAGLQVQAIVAQVMGRFGSYLGNWLLAGLMLIIGLTILGVLSFITNVTIVFCIGICG  392

Query  301  TPFSFLYYYLI  311
             P  F   + +
Sbjct  393  IPLIFAASFHM  403


>TVQ98381.1 hypothetical protein EA398_13415 [Deltaproteobacteria bacterium]
Length=510

 Score = 52.9 bits (123),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 12/37 (32%), Positives = 17/37 (46%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  V+CP C +       K+PAK  + +CP C     
Sbjct  36  MI-VQCPACSSRYRIADEKIPAKGGNLKCPSCGHAFF  71


>RLC26940.1 hypothetical protein DRH56_03745 [Deltaproteobacteria bacterium]
Length=342

 Score = 52.5 bits (122),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 33/284 (12%), Positives = 73/284 (26%), Gaps = 19/284 (7%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              +RCP C   ++ P +++P     A CP C Q   F   ++      +N A       +
Sbjct  61   VEIRCPFCRFSKSVPRARIPEGVRRATCPRCGQRFPFSLEDTGEMPDPENRAAPGASAGE  120

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                       S   N  R      +                +  L          G   
Sbjct  121  ESPEEKTRRSGSPWENRSRLGLWQGIYQTFRSVLFSPAAFFSTLELHGGIREPLAFGILS  180

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA-YILLGLSWMTGSMFI  180
              +  +  +     IFSA+L      L  +       +++            +++  +  
Sbjct  181  GSLGTMSGLFWQFLIFSAVLFTFGGPLLGRIGVHLVFLMVLVSIPVFAALSLFISSGILH  240

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
               +         +   R V       I  ++   GG +  +   ++  +          
Sbjct  241  PALRLVGSGKNGFEATFRVVSYSHAAGIWSVIPFVGGGMAGVWQLVVQVIG---------  291

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
                     L ++         A +    +L++       +   
Sbjct  292  ---------LREAHETTYPRVIAAYLIPAVLVLCCFVAGLIFLF  326


>MBI4393811.1 protein kinase [Euryarchaeota archaeon]
Length=565

 Score = 52.9 bits (123),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 27/176 (15%), Positives = 59/176 (34%), Gaps = 2/176 (1%)

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
               +   + K        + L L      T +  L           L +  +   V +  
Sbjct  112  LAFVGPRLSKAREACGTLVALNLFVDFIPTFVSGLSGGAAFIAGFALFLVTVYLGVRWML  171

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG--EAA  292
               V+  ++  G +A+ +S+ L+ G WW++F    L+ + S+  + L +    +     +
Sbjct  172  ADAVILSEDASGNRAIPRSQQLMRGSWWSVFLAMGLVSIPSIVFAALLSSPNGLLPQLIS  231

Query  293  NLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLL  348
                ++ + P + +    +YS L A          +       AA  G   +    
Sbjct  232  AFVPAMAIYPAAGVLTVSVYSALVAEAPQTHERRHESVRSDPPAAAPGAWPVRPAP  287


>WP_137150880.1 zinc-ribbon domain-containing protein [Devosia sp. FKR38]
Length=402

 Score = 52.5 bits (122),  Expect = 2e-04, Method: Composition-based stats.
 Identities = 8/38 (21%), Positives = 13/38 (34%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  + CPHC  +       + A     +C  C +    
Sbjct  1   MI-ISCPHCQTKYQVTYEAIGATGRKVQCAHCQEAWQQ  37


>TAL36625.1 hypothetical protein EPN93_07315 [Spirochaetes bacterium]
Length=536

 Score = 52.9 bits (123),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 34/360 (9%), Positives = 79/360 (22%), Gaps = 6/360 (2%)

Query  2    PTVR-CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
              +  CPHC +  +        ++   RC  C    + D    Q           P   +
Sbjct  36   IKITLCPHCNSSYSISFRASKDERYRLRCRRCENFFMVDFPAIQDPMEAGPRPVVPAAHV  95

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
            +          + +       +       +     +     + +                
Sbjct  96   RTAPAVAPARAEYRNGPPVLPDTIEARPRQAIPATAAVRAAAPAAPAFSERRAAEPAIQA  155

Query  121  LLGIY---LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
               I       + +                     +   +          +   +     
Sbjct  156  PPVIMRERPSPLPVREVRPRVVTPEPGLAQNKVVFKGLTFRDFSLKELAAVALSALTRVK  215

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
            +        V +   +      + + T       +V         +   L  ++      
Sbjct  216  LVTAASGIFVSVMLLLAANRLGLSTLTAGSSAPEIVRLALYFAPAMLLFLIYIFTASLIA  275

Query  238  VLADDN--IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
                D+   G      +     +G    +     LLL++  T+  L   IP VG  A   
Sbjct  276  RATLDHVFTGADGGAPRMLSFAAGSLPQVCVNSALLLLLGNTVILLIGNIPLVGPVAYSL  335

Query  296  FSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQ  355
              +     S +   ++            +         L    F      GL+  +L+  
Sbjct  336  LFIPAYLISIMLIIVLVVGAWFYPPIIAYREPGIFKNTLHLYHFIRRHNLGLIPAALTLF  395


>TNE54099.1 hypothetical protein EP341_05890, partial [Sphingomonadales bacterium]
Length=272

 Score = 51.7 bits (120),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 12/65 (18%), Positives = 19/65 (29%), Gaps = 1/65 (2%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  + CP C      P + +     + RC +C  +   D  E +            H   
Sbjct  52   MI-IACPACRTRYVVPDTAIGIDGRTVRCAKCKHSWFQDGPEVEAKAPPPPPPIEEHEVE  110

Query  61   QRRIP  65
                P
Sbjct  111  APPPP  115


>KJV06413.1 hypothetical protein VZ94_11375 [Methylocucumis oryzae]
Length=364

 Score = 52.5 bits (122),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 33/207 (16%), Positives = 63/207 (30%), Gaps = 12/207 (6%)

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS------  279
            ++  +       V+  +N  G  AL+ S  LV GHWW     F+   ++ + L       
Sbjct  1    MIIGLSMSLYLNVIVVENKSGFAALKTSHSLVWGHWWRTLTVFMAPGLVFIILYAAILSA  60

Query  280  -FLTARIP-----YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLP  333
              L   +      ++        S L+ P+     Y+ Y DL       +        + 
Sbjct  61   VLLFQYLGLNELDWLVSIIANLLSALIAPYFLGLGYVQYHDLNVAQNRCRFSSALGGLMS  120

Query  334  LTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQR  393
            +       +L   L   S++ Q  S   L+         L ++       + +       
Sbjct  121  INCWQPLVLLSVLLAAQSVATQQQSHASLMELTVTRSNPLNSEGLAELLASVTTINSDAD  180

Query  394  LSSADYKLLLSKQRKTTSEGGLSLGPV  420
                  K LL+  +    E   +    
Sbjct  181  KPWDALKALLAWLKSLNPEHYEAHYQW  207


>MBC8559302.1 hypothetical protein [Clostridiaceae bacterium NSJ-33]PWL42686.1 
hypothetical protein DBY45_08510 [Clostridiales bacterium]
Length=383

 Score = 52.5 bits (122),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 39/341 (11%), Positives = 92/341 (27%), Gaps = 13/341 (4%)

Query  89   PEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWL  148
                       L   +  +             L      G +L      + + L     L
Sbjct  45   RYAFQYNEVVRLWCYNANIDWGQSAEHYGINLLGISLYFGNLLLGWLQSAVVALPLLLIL  104

Query  149  NPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLI  208
               + +++         +     S +  S+F          F  +   L    + + L I
Sbjct  105  KRPDASFKDQYRHFIKVFFRFFSSNVLISVFFQAYFLLETFFTGLPFRLIGSSTASTLAI  164

Query  209  LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
            L+ ++    SL L    L +  ++    +   +D +    A ++   L++G +W      
Sbjct  165  LIRVLFFFVSLALFPFWLFYSTFWSLSIF---EDGLTVWSAFKRCWSLLTGQFWRSIWTI  221

Query  269  V-LLLVISLTLSFLTARIPY-------VGEAANLAFSLLLTPFSFLYY--YLIYSDLKAN  318
            V ++  +SL ++ L   + +       +         +  TP     Y    +  D +  
Sbjct  222  VKVVFFMSLPVAILIRALAFWEEQTYALTNFFIWVILVFSTPLVTAAYRHIYLALDGREI  281

Query  319  YRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQ  378
            +       +  ++  L+  ++      GLL                     +        
Sbjct  282  FLHKSRKSLSDRFQDLSQQVYDQKKAMGLLSDEEEHILPDPITGEPLWASGEYEDNAVEW  341

Query  379  QTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGP  419
            +    +     +    S        S +   T +      P
Sbjct  342  EDAAASPPDSPKDDAPSGEIRHQDDSDELADTDKSNAFNEP  382


>HGF33636.1 hypothetical protein [Desulfobacca acetoxidans]
Length=91

 Score = 48.2 bits (111),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 16/36 (44%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V CP CG + +   S++P   +  RC  C    
Sbjct  1   MI-VVCPECGTKFSLDQSRIPGATAKVRCSRCRHVF  35


>PIZ30407.1 hypothetical protein COY40_04730 [Alphaproteobacteria bacterium 
CG_4_10_14_0_8_um_filter_53_9]
Length=511

 Score = 52.9 bits (123),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 9/44 (20%), Positives = 20/44 (45%), Gaps = 0/44 (0%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
            M T  CP C A+    + ++  +  + +C +C      +P + +
Sbjct  230  MITATCPECSAQYKLTAQQVGPQGRTLKCAKCGHKWFMEPVKEE  273


>NJO84589.1 tetratricopeptide repeat protein [Blastochloris sp.]
Length=423

 Score = 52.5 bits (122),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 17/107 (16%), Positives = 33/107 (31%), Gaps = 14/107 (13%)

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY---  287
             F F    +  +  G   +L +S  L  G +   +G  +L  +++  L  L   +     
Sbjct  101  RFVFAAQAVVLEGSGAFASLTRSWRLTRGMFGQTYGVVILGALLTGFLYGLPYALGVVSL  160

Query  288  -----------VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                       +    +     L+ P       L Y + +A   G  
Sbjct  161  GFNLAAPFVQPLAVLTSQLVQALVLPLQLAIMTLAYYNARARNEGYD  207


>QHI70525.1 hypothetical protein GT409_14115 [Kiritimatiellaeota bacterium 
S-5007]
Length=261

 Score = 51.7 bits (120),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 34/205 (17%), Positives = 62/205 (30%), Gaps = 15/205 (7%)

Query  92   EFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQ  151
              +A  +    +  L  D W ++C     L  I LL   L  A     +    A +   +
Sbjct  1    MMQALCNQKHRLRDLFLDGWGIYCHCFVPLAVILLLCNGLDQALKVLLIKAFDAPFKLAE  60

Query  152  NQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTL------  205
                  ++       ++   S     +    C   +  F      ++ + S         
Sbjct  61   FLGSFASLAGMLSVLVITIESVSGHHVKWTECFRKIRRFFLKAAWVKMIVSIIQSLVGLP  120

Query  206  ---------LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLL  256
                      L + + +     + L++P L   V   F  Y +      G+ AL  S  L
Sbjct  121  AILLVLNKDALDVPVKLFTILEIALMLPFLWISVSLLFSVYAVILREKSGIDALFHSWRL  180

Query  257  VSGHWWAIFGRFVLLLVISLTLSFL  281
            V   WW +F    LL +     +  
Sbjct  181  VRHRWWYVFMTVFLLNLPIFIFTMA  205


>HIE06348.1 hypothetical protein [Candidatus Stahlbacteria bacterium]
Length=176

 Score = 50.5 bits (117),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 10/34 (29%), Positives = 15/34 (44%), Gaps = 1/34 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           M  V C  C  + N   +K+P +    RC +C  
Sbjct  1   MI-VVCDKCSKKYNVDDAKIPDEGIKVRCAQCGN  33


>WP_107584647.1 hypothetical protein [Alkalicoccus saliphilus]PTL39046.1 hypothetical 
protein C6Y45_07645 [Alkalicoccus saliphilus]
Length=289

 Score = 51.7 bits (120),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 29/215 (13%), Positives = 63/215 (29%), Gaps = 27/215 (13%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            LL    +   L           K            ++ + L  +    + L+ +     +
Sbjct  94   LLVYPFVYASLILFISGWRDGNKLEVKEARTQAMQKYGVSLGALFLFTVILTAVFLVFLV  153

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
            ++    +      ++         + L L  + +  G ++L                   
Sbjct  154  FLLIMPLYTGWMAEVVFAVFYFGFMGLFLSKISLFLGHIVLADTR---------------  198

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI-------PYVGEAAN  293
               +G  ++      L  G    +FG F+LL +I   ++FL   I         VG    
Sbjct  199  -QGLGFGES----WELTRGKTGYVFGFFILLFLIGAGVTFLFQFIIGALLGNSVVGGLVV  253

Query  294  LAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIK  328
                L +T F    Y ++Y ++       ++    
Sbjct  254  NFIGLFVTLFYVAGYTVLYREVTKPEPEHEYEEQP  288


>WP_014373691.1 hypothetical protein [Saprospira grandis]AFC23448.1 hypothetical 
protein SGRA_0709 [Saprospira grandis str. Lewin]
Length=269

 Score = 51.7 bits (120),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 29/191 (15%), Positives = 61/191 (32%), Gaps = 11/191 (6%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            I  L +   F   +  +  K      P+  ++  A         +  L+ +   +     
Sbjct  66   ILDLFVNAFFVAGYYYMGDKIYRKEQPKFSDFFLAGPKLLKLVGVSLLTSVVFLLPFLPL  125

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
               V + +++   L    S    +    L       L  +  LL          ++   N
Sbjct  126  LIFVLINQNLSTELDMAISGNFFMPSFGLFGFVLLFLGALASLLSYAALSLGLPIITFSN  185

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
            +G + A++ S  L+ G+    F             +F++  + +      L   L+  P 
Sbjct  186  LGVIGAMKASVKLMKGNLLNWFFY-----------AFISLLLLFFATIVLLFGLLVAIPV  234

Query  304  SFLYYYLIYSD  314
              L  Y+I +D
Sbjct  235  VMLAQYVILAD  245


>HGF76395.1 hypothetical protein [Firmicutes bacterium]
Length=349

 Score = 52.1 bits (121),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 46/254 (18%), Positives = 86/254 (34%), Gaps = 12/254 (5%)

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
             R         LL   L       +              +  ++IL      I L L  +
Sbjct  17   WRIYRSNFKKSLLHFSLPILFFLISTTSLCLLDETNFRYSPLFSILSFIFYLISLFLFPL  76

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
            +    ++     +   +S+ + L+    F LL  +LI ++ G  +L  IP +L   WF F
Sbjct  77   SFLSLLHSLSQKLPFKQSILITLKKFPRFLLLAFILISILLGSFILGFIPFILAFSWFTF  136

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY-------  287
              + L  +  G L +L  S+ LV G  W        + +  L +      + +       
Sbjct  137  SLFPL-IEGYGILDSLLLSKYLVRGRLWRTLLNLSAIALPILAIGMGIPLLLFLKIDKIY  195

Query  288  ----VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWML  343
                +  +      L L+PF  +Y  L+Y  LK +   P+ P   +  +          +
Sbjct  196  LSVHLAISTFAMLQLFLSPFLIIYLSLLYESLKQSKELPEKPLTIKWEILFAIPSLLGFV  255

Query  344  IPGLLLVSLSRQNL  357
            +   L++       
Sbjct  256  LMSFLIIFSLLNIF  269


>MXU63985.1 hypothetical protein [Rhodobacteraceae bacterium KN286]
Length=375

 Score = 52.5 bits (122),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 9/46 (20%), Positives = 17/46 (37%), Gaps = 0/46 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQT  48
            + CP C A+     + +P +    +C  C  T   D   ++    
Sbjct  2   RIICPSCSAQYEVDDNAIPEEGREVQCGTCSTTWFQDKRSAEAPMP  47


>NOX39819.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=328

 Score = 52.1 bits (121),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 11/38 (29%), Positives = 15/38 (39%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  V CP+C A+     S +P      +C  C  T   
Sbjct  1   MRLV-CPNCAAQYEVDDSAIPENGRDVQCANCGNTWFQ  37


>QDP19568.1 hypothetical protein FMM02_06090 [Sphingomonas sp. AE3]
Length=361

 Score = 52.1 bits (121),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 10/64 (16%), Positives = 18/64 (28%), Gaps = 1/64 (2%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M    CP C  +       +P      RC  C  +   DP  +   ++     +     +
Sbjct  78   MIL-TCPACATKYVVKDDAVPPGGRQVRCASCKHSWHVDPEPAADEESQAVRPSQAAAPI  136

Query  61   QRRI  64
                
Sbjct  137  PSEP  140


>TXD38845.1 hypothetical protein FRC98_00125 [Bradymonadales bacterium TMQ4]
Length=554

 Score = 52.9 bits (123),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 10/48 (21%), Positives = 17/48 (35%), Gaps = 1/48 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQT  48
           M  V+CP C +      + +P      +CP C    +  P      + 
Sbjct  1   MI-VQCPSCSSRYRVNDANIPPSGGKIKCPSCAHAFVVYPEAPAEPEH  47


>KQC13745.1 hypothetical protein APR63_07605 [Desulfuromonas sp. SDB]
Length=292

 Score = 51.7 bits (120),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 10/36 (28%), Positives = 13/36 (36%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  + CP C         K+PA  +   C EC    
Sbjct  1   MI-IECPRCKTRYKVDEQKVPAGGAPIECIECGNIF  35


>KAF7842967.1 putative transmembrane protein [Senna tora]
Length=628

 Score = 52.9 bits (123),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 22/181 (12%), Positives = 51/181 (28%), Gaps = 0/181 (0%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
              W LL +     ++                +     +   +      A   +       
Sbjct  380  HEWTLLLVIQFCYLIFLFAFSLLSTAAVVFTVASLYTSKPVSFSSTISAIPRVFKRLFIT  439

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             +++ +          + L L  V   T    LL   V    +L +   +     +    
Sbjct  440  FLWVSLLMIAYNFIFVLSLVLLIVAIDTHNSFLLFFAVLVIGVLFLGVHVYITALWHLAS  499

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
             V   + + G  A++KS  L+ G          + L+    +  + + +   G  +   F
Sbjct  500  VVSVLEPLYGFAAMKKSYELLKGRTKFAAIIVFVYLICCGVIGGVFSAVVVHGGGSYGVF  559

Query  297  S  297
            +
Sbjct  560  T  560


>TLY28689.1 hypothetical protein E6K62_09855, partial [Nitrospirae bacterium]
Length=257

 Score = 51.7 bits (120),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 30/166 (18%), Positives = 59/166 (36%), Gaps = 0/166 (0%)

Query  145  ATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT  204
             T+ +         +       I + +      +     K    +F       +      
Sbjct  51   LTFASQVVPLLAPIVGQVLSVVIQVVMFAGLAIVTWKQGKDGSTVFADFFPDWKTTAQLV  110

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
               ++ + +V  G   L++PG+   V + F   ++ D  +G  QALE SR +V+ HWW +
Sbjct  111  WCTVVGLFLVVIGLAFLVLPGIYLLVAYTFSYLLIVDRRLGVWQALEGSRRVVNKHWWGV  170

Query  265  FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYL  310
            FG  + +L++      +   +  V     L+          L   L
Sbjct  171  FGLTLAMLLLIGIGGMIGVGVLGVPIGYGLSGWFPEVSLDQLPLAL  216


>WP_144864987.1 hybrid sensor histidine kinase/response regulator [Hyella patelloides]VEP14517.1 
Two-component sensor histidine kinase [Hyella 
patelloides LEGE 07179]
Length=951

 Score = 52.9 bits (123),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 45/414 (11%), Positives = 109/414 (26%), Gaps = 16/414 (4%)

Query  189  LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQ  248
            L      G+  +     +LI++ L       +  +    + +  +   Y L        +
Sbjct  30   LSFPFGFGVDFLFGSITVLIVVRLYGIWWGTVASLIAGSYTIALWQHPYALIIFTC---E  86

Query  249  ALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT-ARIPYVGEAANLAFSLLLTPFSFL-  306
             L  +  L  G+   +    +  L+I + L +L    I  VG    L   +         
Sbjct  87   TLFVAWRLRRGNQNLLLIDMIFWLLIGMPLVWLFYGGILQVGAITTLIIVVKQAVNGIFD  146

Query  307  ---YYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLL  363
                  L+       +           +  +   +    ++   L++            +
Sbjct  147  ALVASLLLTYKPLYRWGNFSLRNANFSFEQILLNLLVAFVLIPALMLMFV------SNRV  200

Query  364  SAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLF  423
            +   +    + T      +++  L    Q    A  +L  +         G +   + L 
Sbjct  201  AMKHEQNTLIATLDTSAQNMSAYLLRWHQSGLEALSRLAETSSETQIVISGQTQQSIELA  260

Query  424  ADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAF  483
                    Q   +   LE+    + S     S  ++  ++       ++      E    
Sbjct  261  IGSLPLFRQIYIINADLEVIAAASSSNEFDRSDYLDFSQLDIPRNPQIFIIPDLTEETDA  320

Query  484  HWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTL  543
                        +        +      + +  +L     T+PL        +  I  T 
Sbjct  321  SSKTKILQTLPIILDNRWLGNIIAELDIDFIGQLLQTETYTVPLKSTLFDENQLIIASTH  380

Query  544  QIGGKQLILQRLGSNAVTL-RFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGD  596
                 Q +L R  +  +      G+  ++ +        +PL           +
Sbjct  381  NELDAQQVLNRSQTGEINYVESDGEDREIYHWLPI-VEGKPLIGRWKESFYGRN  433


>PAW78819.1 hypothetical protein B9S32_05405 [Verrucomicrobia bacterium Tous-C9LFEB]
Length=330

 Score = 52.1 bits (121),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 32/282 (11%), Positives = 72/282 (26%), Gaps = 64/282 (23%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQ-NWQWAILLATV  164
                    F    +    I      L+    F    ++ +  +       W    +  ++
Sbjct  28   CFQLYRRHFIPLFYFSALIQAFPFSLSLILAFFGRNVQISDLMTHPENLTWFLCRIFLSL  87

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL---  221
              + LG + +T  +        V +  S+    R   +      L   ++    L     
Sbjct  88   VLLTLGEAALTDYVARLYLGKSVSVRSSLATMSRRSPAVLWSTALKYFLIFLAFLPCVIP  147

Query  222  --------------------------IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
                                       +P L+  V +      +  + + GL AL +S  
Sbjct  148  IVIGQSWRGPLSVWMIAGLSLAGFILFMPWLILAVRYLVMMQAVMLEKVSGLAALRRSSE  207

Query  256  LVSGHWWAIFG--------RFVLLLVISLTLSFLTARIPYVGE-----------------  290
            ++  +                +L++ ++  L  + + +P +                   
Sbjct  208  IIRYNLGKNIMQWGETRISLILLVIGVANILVIIASHLPQLVATAGEMLRGNLNPDSITL  267

Query  291  ---------AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                       N   S LL P   +   L Y D++       
Sbjct  268  SPVIMTATDLLNFLGSALLAPLYVIGGTLFYYDVRVRKEAYD  309


>TMQ34936.1 hypothetical protein E6K70_04975 [Planctomycetes bacterium]
Length=256

 Score = 51.3 bits (119),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 28/233 (12%), Positives = 51/233 (22%), Gaps = 15/233 (6%)

Query  1    MPTVRCPHCGAERNTPSSK------LPAKKSSARCPECCQTLIFDPAESQRTQTTDNIAT  54
            M  V+CP C  +     SK       PA     R P         P   +   +      
Sbjct  1    MIRVQCPKCDKKLALDDSKAGGVGACPACGQRFRVPGTRAQTPDTPNADKVRASGPARHA  60

Query  55   CPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSW---  111
                  QR         Q  +                           +   +   +   
Sbjct  61   NKATAKQRPAGKQTAPQQPPSRPKEPWEEEDSSPYAVREEPESDDRPKVEYGIDKDYEKK  120

Query  112  ------ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
                  E   +   G +G+  L I +         ++    W++        +     + 
Sbjct  121  VERKRQEEQQKETRGFIGLIFLMIFIWILMGVMPFIMFELVWVSLSMGLLLTSAGGIMMT  180

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
                        +F+        +    +            LIL   +V GG 
Sbjct  181  TAAFHEGKGLLMLFVPFYGFYFMIMYWKEARNGFCIWLIGCLILGTALVTGGF  233


>OQX01670.1 hypothetical protein BWK80_59455 [Desulfobacteraceae bacterium 
IS3]
Length=263

 Score = 51.7 bits (120),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 39/282 (14%), Positives = 81/282 (29%), Gaps = 23/282 (8%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             ++CP+CG  +    SK+P K     C +C                           +  
Sbjct  2    KIQCPNCGISKEANDSKIPDKGCYVGCRKCGHRF----------FVKKENNFIEKKEILS  51

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                    ++ ++V   +      +    ++       R  + L+  +   F +    L 
Sbjct  52   SEEDVSCPVEQESVQTEKDFPKAEIAGVLKYPKPSPNKRICAILIDFAIVTFLQSLLHLF  111

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
            GI     +      F  +        +P        +       I L  S+         
Sbjct  112  GINTKNFIFQSLFFFLYVFRDSYKGQSPGKVLIGLKVTDLEGNPISLSASF---------  162

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
             K ++ LF    +    +    +    ++ ++     L++I   +  + +    Y     
Sbjct  163  -KRNIILFWEYIVYAPMLLILLIEPNFILKIIAIHIGLIVIGVFISIIEYIKITYSYDGR  221

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
             IG   A  K   L   H      R+  L +I + +S L   
Sbjct  222  RIGDKLAKTKVIDL---HPDRGGWRYFFLSLIIIFVSVLLQM  260


>WP_134724441.1 zinc-ribbon domain-containing protein [Paracoccus luteus]
Length=239

 Score = 51.3 bits (119),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 10/44 (23%), Positives = 15/44 (34%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  V CP CGA+    ++ +P +     C  C            
Sbjct  1   MRLV-CPRCGAQYEIDAAAIPPQGRDVECSSCDHVWRATRPFDP  43


>HBE00133.1 hypothetical protein [Gemmatimonadetes bacterium]
Length=152

 Score = 49.8 bits (115),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 10/35 (29%), Positives = 15/35 (43%), Gaps = 0/35 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
             ++CP C +       K+P    +ARC  C Q  
Sbjct  3   IQIQCPSCPSSFPVDPDKIPEAGVNARCSSCGQVF  37


>HAC57592.1 hypothetical protein [Rhodobiaceae bacterium]
Length=378

 Score = 52.1 bits (121),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 8/38 (21%), Positives = 13/38 (34%), Gaps = 1/38 (3%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
            M  ++CP C A      +         RC +C  +   
Sbjct  83   MI-IQCPSCSARYPVDGASFAPSGRKVRCAKCGHSWHQ  119


>MBB24536.1 hypothetical protein [Geminicoccus sp.]
Length=205

 Score = 50.9 bits (118),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 26/176 (15%), Positives = 51/176 (29%), Gaps = 32/176 (18%)

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG--  225
               L+  T +    +    V + ++M        +  L  +L+ L +    ++L+     
Sbjct  4    GWLLAIPTQAGLNLVRTGGVSIGQAMGKSFSLWFALGLAFLLMQLGIMVLEIILVWVPSE  63

Query  226  ---------------LLFCVWFFFCQYV-LADDNIGGLQALEKSRLLVSGHWWAI-----  264
                           ++  +       + +  D  G + A  +S  L  GH  +I     
Sbjct  64   FNSMLGVIGNIVNTIVVIAISLALAPLLLVILDGSGAMDAFGRSLRLTRGHRLSILLFGL  123

Query  265  ------FGRFVLLLVISLTLSFLTARI---PYVGEAANLAFSLLLTPFSFLYYYLI  311
                  F   +   VI     FL A I      G    +  +L    F  +     
Sbjct  124  LAALAVFVVMLAAGVIGAFAYFLLALILSDALAGTIVGIFGALFALCFYSVVSIYF  179


>RMG21182.1 hypothetical protein D6729_01370, partial [Deltaproteobacteria 
bacterium]
Length=85

 Score = 47.8 bits (110),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 11/61 (18%), Positives = 22/61 (36%), Gaps = 0/61 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
           VRCP C  +     +++  +  + RC EC             T+  ++    P    +  
Sbjct  3   VRCPSCATQYEFDDARISEEGVTVRCTECGYVFKVKRRAVAVTEPLEDARPGPDGTTRPW  62

Query  64  I  64
           +
Sbjct  63  M  63


>NUQ91842.1 DUF3426 domain-containing protein [Gemmatimonadaceae bacterium]
Length=83

 Score = 47.8 bits (110),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 12/31 (39%), Positives = 14/31 (45%), Gaps = 0/31 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           V CP C +      SK+PA    ARC  C  
Sbjct  3   VACPECRSVFRVDPSKVPAGGVRARCSVCGG  33


>KAB2879190.1 hypothetical protein F9K33_10200 [bacterium]
Length=532

 Score = 52.5 bits (122),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 27/213 (13%), Positives = 65/213 (31%), Gaps = 12/213 (6%)

Query  424  ADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAF  483
            ++    D           LS    L  A   + R  + + + D+  DL           +
Sbjct  315  SNLSDPDYNLFGNDQSYRLSLLGELPGALLAAERAVVTRAIADNGDDLLPTDEWSRETTY  374

Query  484  HWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQ--LTRNDIGK  541
              V  + +             L      + +  + G +  +     + +   +     G 
Sbjct  375  LQVSQSASH-----ILWDVTLLVPDQTVKGIKELSGFVYASSASGAKWVDTGIKELKEGS  429

Query  542  TLQIGGKQL-ILQRLGS-NAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFS  599
                    +  + +  S   ++L+F    T + ++   ++  + L    ++    GD  +
Sbjct  430  RGTSFDVTIETITQNESSQEISLKFAVSYTMIKDIQILDATGKVLNTQKYSSSDFGDGCT  489

Query  600  LRQMFDG---NIESITVLVAGDSMTQSYPFELT  629
            +    DG      S+ + +  D      PF +T
Sbjct  490  IGLYIDGQFPPKGSVKIEMYEDVQKLEIPFSIT  522


>MBI4544038.1 zinc-ribbon domain-containing protein [Gemmatimonadetes bacterium]
Length=145

 Score = 49.4 bits (114),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 12/38 (32%), Positives = 14/38 (37%), Gaps = 0/38 (0%)

Query  6   CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAES  43
           CPHCGA       ++P     ARC  C         E 
Sbjct  5   CPHCGALFRVDPERVPPTGVRARCSRCGGVFPVRRRER  42


>NJN47961.1 hypothetical protein [Candidatus Competibacteraceae bacterium]
Length=545

 Score = 52.5 bits (122),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 18/139 (13%), Positives = 47/139 (34%), Gaps = 11/139 (8%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +  +L   +    +    +   D+           ++        + + +    +++ 
Sbjct  138  MLLLIVLPLAAIGALAYLGLLTGQDINW---------YLTERPPAFWIAVTIGAVLAVVG  188

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
                 L  V +     V   + + G +AL +SR LVS + W +    +  ++++  +S L
Sbjct  189  FALIALVLVRWALAVPVTLHEGLSGFKALRRSRALVSNNGWRVARLVLGWIILTTIVSAL  248

Query  282  TARI--PYVGEAANLAFSL  298
               +   + G    L    
Sbjct  249  LVALVDGFAGLLLKLFPGT  267


>EGT3616849.1 DUF975 family protein [Clostridium perfringens]
Length=307

 Score = 51.7 bits (120),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 23/167 (14%), Positives = 54/167 (32%), Gaps = 6/167 (4%)

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV  214
            W   I  + V+ +      +   + + +   +V       + L ++    +L   L + +
Sbjct  93   WVNFIKNSLVSILFTLPIIILSIIIMAVSLMNVYRTFYSSIFLNYINYDLILNRFLGVFI  152

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
                L+ I   ++    F     ++ +  IG  +A+ ++  ++ GH W +F   + LL  
Sbjct  153  IVILLICIYSIIIRLFLFPVKYIIVDEPEIGIWEAVGRAFKIMKGHKWELFILELSLLGW  212

Query  275  SLTLSFLTARIPYVGE------AANLAFSLLLTPFSFLYYYLIYSDL  315
             +           +             F L L  F      +     
Sbjct  213  KILAILPLTIGVVLVTLMDWNIILIAPFGLGLLWFYCYANTVYRVYY  259


>MBI5241206.1 hypothetical protein [Elusimicrobia bacterium]
Length=271

 Score = 51.7 bits (120),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 44/203 (22%), Positives = 75/203 (37%), Gaps = 8/203 (4%)

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
                    +   +     GL   ++   + +  F L+ +L +    GG+ LL++PGL   
Sbjct  75   VWGQAALLLAAALPGPAPGLEACLETSWKRLPGFLLVCVLYLAACLGGTCLLLLPGLAAS  134

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG  289
            +W  F   +   ++IG ++AL KS  LV G  W + GR  L              +P + 
Sbjct  135  LWLVFGPLIYLTEDIGPVEALLKSWHLVRGRSWPVAGRLCLFG--------AAGTLPGLV  186

Query  290  EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLL  349
             AA +    L  P   +    +  +L+ +  G    P +RQ   L  A    +L   LL 
Sbjct  187  PAAGVVLQTLAWPMVLMSLAALLDELRRSRGGEPFVPARRQKAFLFIAAGASLLPLSLLP  246

Query  350  VSLSRQNLSAEQLLSAGKDIQQR  372
               +R    A    +        
Sbjct  247  WLAARFMSYAAGHSAEFAQRLMP  269


>RJL00941.1 thioredoxin, partial [Paracoccus siganidrum]
Length=50

 Score = 46.7 bits (107),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 11/40 (28%), Positives = 16/40 (40%), Gaps = 0/40 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
             + C  CGA+    +S LP      +C  C  T +  P 
Sbjct  4   IKLTCETCGAQYKLATSALPPAGREVQCTACGHTWLARPH  43


>WP_148240793.1 hypothetical protein [Nocardioides sp. S-1144]QCW51723.2 hypothetical 
protein FE634_17250 [Nocardioides sp. S-1144]
Length=176

 Score = 50.2 bits (116),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 17/131 (13%), Positives = 41/131 (31%), Gaps = 20/131 (15%)

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL--  276
              L +    +   ++     L  + +G   ++ +   L    +W   G  +L ++++   
Sbjct  40   FFLALMTWFWIRCYYLPVPALMLERLGVFASIGRGYRLTRHQFWRTLGIALLTVLVAGTA  99

Query  277  --TLSFLTARIPYVGEAANL----------------AFSLLLTPFSFLYYYLIYSDLKAN  318
               LSF  + +  +G  A                    +  + PFS     + Y D +  
Sbjct  100  AQVLSFPFSLVAQLGSLAAGEYGALVFVLGTAIAQVLSTAFVAPFSAAVTSVQYLDQRMR  159

Query  319  YRGPQHPPIKR  329
                    ++ 
Sbjct  160  KEAFDVELMRE  170


>RLG70713.1 hypothetical protein DRO04_01305 [Candidatus Diapherotrites archaeon]
Length=262

 Score = 51.3 bits (119),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 22/176 (13%), Positives = 50/176 (28%), Gaps = 9/176 (5%)

Query  128  GIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM---------TGSM  178
               +A       L           +     AI       +    +               
Sbjct  48   MQNIAIGGDLPMLYFYMQQSTFWISFFILSAIATFLAMVVYFAYANYAKGLTAKEALSMA  107

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
               +      +     + +   G  T+ + +   +    SL+L I      + F     +
Sbjct  108  LKKLRHAAALVVVLFSIQIMLFGINTIFMNITPTIALLISLILWIVIFYAFIRFSLIIPI  167

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
            LA +     + L+K+  L  G+   I G F+  L++ + +  +   + +      L
Sbjct  168  LALERRRVKKGLQKAWELSKGNTLKIVGIFLAFLLVMVCVYAVGDFLEWFYAITGL  223


>WP_090520267.1 zinc-ribbon domain-containing protein [Paracoccus isoporae]SDD30895.1 
MJ0042 family finger-like domain-containing protein 
[Paracoccus isoporae]
Length=367

 Score = 52.1 bits (121),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 13/82 (16%), Positives = 21/82 (26%), Gaps = 1/82 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + CP C A+   P   +P +     C  C      D A     ++ +  A       
Sbjct  1   MRLI-CPECNAQYELPEDAIPPEGREVECAACGTIWHQDQANKAPARSAEPPAGSRAPHS  59

Query  61  QRRIPSDRLEIQSKTVNCRRCN  82
                      Q    +     
Sbjct  60  VPNADQPAHAEQPPAPDMETRP  81


>WP_194189952.1 DUF975 family protein [Clostridium sp. PT]
Length=291

 Score = 51.7 bits (120),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 19/155 (12%), Positives = 46/155 (30%), Gaps = 12/155 (8%)

Query  158  AILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGG  217
                      +   S +   +        +  +      L + G     L  L+  +   
Sbjct  86   YFRYVGYTIFVGLFSLIVSLILCIPILFKLLDYIRNIDLLLYYGIGLGDLGGLVGEIIIL  145

Query  218  SLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT  277
             ++ +  G++F ++F FC   L  +      +  KS  ++ G+   +F   +  +   L 
Sbjct  146  VIIAVAAGIIFNLFF-FCTKFLIVEGNSVKDSFGKSIEMMKGYKGKLFLTQLSFIGWFLL  204

Query  278  LSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
            +                 +S+   P+        Y
Sbjct  205  V-----------IITCGIYSVWFIPYYNASMAQFY  228


>PYV00698.1 hypothetical protein DMG26_14940 [Acidobacteria bacterium]
Length=572

 Score = 52.5 bits (122),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 35/273 (13%), Positives = 75/273 (27%), Gaps = 42/273 (15%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLA-TVAYILL  169
            +  +       +GI  +  V A   +   + +         +      ++         +
Sbjct  19   FNYYRSHFRVFVGIMAIPHVFALGSMVLLIQVLREAPKYQSHLLAGLFVMTIPYGLAYPV  78

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL---  226
             L   T ++       D  +  + +     V     L ++ +L+V      L I  L   
Sbjct  79   ALGATTFALSRLYLGQDATVRSAYRSIQGSVWRLVKLFVVTLLLVLSPFFALGIVPLHFL  138

Query  227  ----------------LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
                               + +      L  +N+G  +AL++S +L  G+   +F    L
Sbjct  139  PELYLTLVVLAVPLATWLILRYGVAVPALLLENLGARRALKRSAVLTKGYKGRLFAIGFL  198

Query  271  LLVISL----TLSFLTAR---IPYVGE---------------AANLAFSLLLTPFSFLYY  308
            + +I+      L         +P+                   A      L  P   +  
Sbjct  199  MALIAWSAAKVLEVPFGGGLELPFWAAALKGQISAWLVVARLLAASVSWALTAPLLPIGL  258

Query  309  YLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGW  341
             L Y D +    G     +        + +  W
Sbjct  259  ALAYCDARVRKEGFDLQLMMAHMGDAGSVVRPW  291


>WP_144846936.1 DUF975 family protein [Lactobacillus gasseri]TVV01469.1 DUF975 
family protein [Lactobacillus gasseri]
Length=265

 Score = 51.3 bits (119),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 23/171 (13%), Positives = 58/171 (34%), Gaps = 7/171 (4%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
              L  +  +    S      +     Q   W    ++  +   ++       ++      
Sbjct  35   LFLEYITNYVWTGSLNTNNLSLISWWQGLGWSIFTIITALIATMIAWGVQYATLAFRDTG  94

Query  185  TDVGLFRSMKLGL--RHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD-  241
                +F++M       +     L  +L  L      LLLI+PG++    +    Y++ D 
Sbjct  95   KKPNVFKAMFSSFTNGYFFKTFLTSLLTTLFTFFWGLLLIVPGIIKSFSYAMTPYIMKDM  154

Query  242  ----DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
                  +   +A+ +SR ++ G+   +F  ++   +    +      I ++
Sbjct  155  IDSKHEMTATEAISESRKIMKGNKTTLFIIWLTFNIWYFIIGLAGIAIAFL  205


>KAA0218568.1 hypothetical protein EDM80_00105 [bacterium]RIK65599.1 hypothetical 
protein DCC64_00325 [Planctomycetes bacterium]
Length=342

 Score = 52.1 bits (121),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 27/275 (10%), Positives = 74/275 (27%), Gaps = 12/275 (4%)

Query  127  LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD  186
            L   +              + +             +    +    +     +   +    
Sbjct  66   LIESVLVIFQLVLAWALAGSGVYFLVSRLYLGSETSIRQALSAVAAHFAALLSSGVIVLG  125

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGG  246
            V       + +  + S          V    S+ ++    L    F      +  +++  
Sbjct  126  VLGLLVGAVLVPLLISIEGGGKEGAGVAFLMSMAVVPALPLLYGRFGLVFCCVMLEDLRP  185

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLVISL-------TLSFLTARIP----YVGEAANLA  295
             +AL +S  L + +   + G   ++L++           +   +       + G   +LA
Sbjct  186  DEALMRSWRLSANYGGRLVGIAFVMLLLFAMLRIPGEVCASALSGYGRSYEFAGSMLSLA  245

Query  296  FSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQ  355
            +  L+ P   +   + Y DL+    G     +   +  +       +   G    ++   
Sbjct  246  WQALVAPVLVIPPAIFYFDLRCRKEGFDLAVLAASF-GVDPNFLAQLQAQGRNSYNIPGY  304

Query  356  NLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEE  390
                    +      Q L ++ +    +   LP  
Sbjct  305  VPRGWDPQTGQLPTIQGLNSRTRPRGAVRVPLPRP  339


>WP_108103268.1 zinc-ribbon domain-containing protein [Geobacter sp. DSM 2909]PTV87458.1 
putative Zn finger-like uncharacterized protein 
[Geobacter sp. DSM 2909]
Length=302

 Score = 51.7 bits (120),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 31/306 (10%), Positives = 79/306 (26%), Gaps = 26/306 (8%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             + CP+CG       S +P   +S RC  C ++        +               L  
Sbjct  2    KITCPNCGHTAEVDVSVIPTGSASVRCNACGESFSLPKGSGEPVGNLRRKRLKCPICLTE  61

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRA----------SGSGLRSISQLLADSWE  112
            +   DR +             +     +R                      + +      
Sbjct  62   QNQGDRCQECGLIFESFIIEHASPAARDRISSMSSATAPLKVSYRYNKSLHNSIQQHIAA  121

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI-----  167
                       ++++  +++   +   L                   + +T +       
Sbjct  122  HRLVNDKYYYLLFIIYFIISLMILCFDLGNIELLIQIVNPIVSLIPTISSTASITPNPGA  181

Query  168  --LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
              L+        +  YI K  V  +  +           +L I  ++ V     L+    
Sbjct  182  SKLILAFSWVMIIPFYISKIIVIKWDEVNWSGIKNIFSNILRINNMIFVLLFVALVTSIL  241

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            L+  ++  F           G     +    +  ++      + + +++  +  F+  +I
Sbjct  242  LITAIYLPF---------TKGYTYHRRLLYSLFCNFTLTASLWGMCIILIFSNLFILIKI  292

Query  286  PYVGEA  291
             + G  
Sbjct  293  MFYGVI  298


>MBI4184865.1 zinc-ribbon domain-containing protein [Proteobacteria bacterium]
Length=468

 Score = 52.5 bits (122),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 9/37 (24%), Positives = 12/37 (32%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M    CP C    N   + L  +    RC  C  +  
Sbjct  1   MIL-DCPACATRYNVDPAALGPQGREVRCFNCGHSWH  36


>WP_169849444.1 zinc-ribbon domain-containing protein, partial [Corallococcus 
exiguus]NNB98609.1 hypothetical protein [Corallococcus exiguus]
Length=135

 Score = 49.4 bits (114),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 9/35 (26%), Positives = 13/35 (37%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            V CP C    N    ++P   +  +C  C  T  
Sbjct  2   KVSCPSCQTNYNIDDRRIPPGGAKLKCARCQTTFP  36


>PLX91758.1 hypothetical protein C0621_10775, partial [Desulfuromonas sp.]
Length=82

 Score = 47.8 bits (110),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 7/36 (19%), Positives = 12/36 (33%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  ++C  C         K+    +  RC +C    
Sbjct  1   MV-IQCSECQTRFKLADDKIKPGGTKVRCSKCRHVF  35


>HBL12542.1 hypothetical protein [Cyanobacteria bacterium UBA11162]
Length=474

 Score = 52.5 bits (122),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 28/245 (11%), Positives = 65/245 (27%), Gaps = 40/245 (16%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNW-----QWAILLATVAYIL  168
            F    W  + IY          I +                              +  ++
Sbjct  219  FQAFFWSFIPIYGWAKHCQIQAIIARHSFSELINQPETVTTIRTELNLRLWDFWVLQILI  278

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
              + +      + +      +   +        +  ++L++L LV   G +        +
Sbjct  279  GFMVYAVYIGLLIVLNLVNLILEMLSSIFSDNIALIIILVILQLVAMIGFITTY--LWFY  336

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL-------  281
              +F     +  +D +     + +S  L  G  W I    ++  +I++ L+ +       
Sbjct  337  AHFFIAELPLALEDKMSSTGCISRSWTLTKGFVWRIESILLVAALITIPLTLIAALPLAI  396

Query  282  --------------------------TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
                                         I ++        + LL PF      +IY DL
Sbjct  397  IIPIFTQLINPLVEPSSETIVSALITIMFIVFIVVILLSLVNTLLMPFWQAIKAVIYYDL  456

Query  316  KANYR  320
            ++   
Sbjct  457  RSRRE  461


>WP_181180903.1 zinc-ribbon domain-containing protein, partial [Paracoccus sp. 
FO-3]
Length=109

 Score = 48.6 bits (112),  Expect = 3e-04, Method: Composition-based stats.
 Identities = 9/37 (24%), Positives = 12/37 (32%), Gaps = 0/37 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
            + CP C A+     S +PA      C  C       
Sbjct  2   RLTCPRCAAQYEIAESAIPASGREVECSACGHVWRQP  38


>OMO95110.1 hypothetical protein COLO4_16060 [Corchorus olitorius]
Length=428

 Score = 52.1 bits (121),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 28/245 (11%), Positives = 72/245 (29%), Gaps = 11/245 (4%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
             +F      +     +  +       +++    +        +      +  +    +  
Sbjct  184  NVFVGIQRDIKIFAGVEWIFLLLSAIASIFFTVSITHASALIHGGKTTSMKNLVLRTIRS  243

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
                     YI    +G      + L  +       ++         +L ++  +   V 
Sbjct  244  LKRPFITSFYITLFSLGYIFLSLITLLPLVLILGNQVISSYSGILLWILAMVFYIYLSVV  303

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT-ARIPY---  287
            +     V   +   G++AL K+  +V G     F   ++L ++S+ L       I +   
Sbjct  304  WNLSIVVSIMEEKSGIEALGKAAEIVKGMKLQGFILSLVLTILSVILFQGFRWIINFNVK  363

Query  288  ------VGEAANLAFSLLLT-PFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFG  340
                  +     +  S+ +   F    Y ++Y   K N+        + ++  +  A   
Sbjct  364  RSEAVRILMVLLVLNSIWMVRMFGHTAYTVLYYKCKKNHGEEVELEAEMEYTKIPTAPLL  423

Query  341  WMLIP  345
               IP
Sbjct  424  NESIP  428


>MBD3311948.1 hypothetical protein [archaeon]
Length=297

 Score = 51.7 bits (120),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 38/309 (12%), Positives = 94/309 (30%), Gaps = 25/309 (8%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             V+CP CG       +    + +   C +C Q L       Q T+T   +         +
Sbjct  1    MVKCPTCGG-----EATYVQQYNRYYCYKCKQYLPQQVKAPQSTKTGQALKQASKETKMQ  55

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                    ++  T    +    F L          + L                   G+ 
Sbjct  56   NNIGVINSLKKGTKLLFKNPAFFVLAAIPLTAQVLATLLL---------------QAGIG  100

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             +           + +  +    + L            +  +  + +    +     + I
Sbjct  101  QLMQTVTNTLMTAVTTMNINTIISILMGPLIITIITYAVIALVLMPIVHIGLLFGSKMAI  160

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
             + +V L   +K   + +G + L +ILL+L+            +   +   F   VL  +
Sbjct  161  TQGEVSLGPVLKQSFKKLGRYYLGIILLLLISLIPIAG-----IFITLIMSFLLQVLVIE  215

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
            + G  ++++ +  L   +   +    ++ L + +    L   +  +G   ++  S++   
Sbjct  216  DKGIGESVKHAWKLGVKNIGKLLVFGIIGLALLIASGLLLNLLSGLGNIGSIITSVITIL  275

Query  303  FSFLYYYLI  311
              +  + L 
Sbjct  276  ILYPLFSLF  284


>NQZ57222.1 hypothetical protein [Lentisphaeraceae bacterium]
Length=366

 Score = 52.1 bits (121),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 19/164 (12%), Positives = 52/164 (32%), Gaps = 3/164 (2%)

Query  465  DDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELT  524
            D   +    +     + + +    N T   + ++G  S  L    + +      G +   
Sbjct  34   DLQEQSPDLKADWSLYTSKNGKVENITYSTNAYNGKLSWSL---RKYKDHAPFTGSVNFK  90

Query  525  LPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPL  584
                +E+  + +   GK++ +  K     ++  +   +     R  + N  A N+    L
Sbjct  91   TKTGVETFNVKKPVTGKSVTLKTKPGRALKVSFDKNHVSISSPRMAIDNFSAYNAKGAIL  150

Query  585  REIGFTWQKSGDAFSLRQMFDGNIESITVLVAGDSMTQSYPFEL  628
            +             +   ++ G + ++        +     F+L
Sbjct  151  KHETQKNSFKNGISTTTYVYWGKVNNVKFQTFSKEVKHQLLFDL  194


>WP_172344592.1 hypothetical protein [Prevotella sp. PCHR]NPE25109.1 hypothetical 
protein [Prevotella sp. PCHR]
Length=327

 Score = 51.7 bits (120),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 18/107 (17%), Positives = 39/107 (36%), Gaps = 9/107 (8%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            +    +L  I  ++  +           ++I  +QA+ K+  L    W  IF   +++ +
Sbjct  163  IAAVLVLCFIALIVCLIPLSLSLPAYIVEDITVMQAMSKAFRLGFKTWGGIFAVILVMYI  222

Query  274  ISLTLSFLTARIPYVGEAANLAF---------SLLLTPFSFLYYYLI  311
            +   +S  +A   Y+   A   F             +P   +  YL+
Sbjct  223  VVSVISGASAMPWYIMMIARSVFLVSDTADSAGFAASPVYTIIMYLL  269


>MBT40490.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=260

 Score = 51.3 bits (119),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 8/37 (22%), Positives = 10/37 (27%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  V C  C         ++P      RC  C     
Sbjct  1   MI-VTCEGCSTRFQLDDERVPHGGIRVRCSCCKHAFF  36


>MBI3010989.1 hypothetical protein [Candidatus Omnitrophica bacterium]
Length=244

 Score = 50.9 bits (118),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 31/194 (16%), Positives = 59/194 (30%), Gaps = 0/194 (0%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
             +G     L+   L        L      +L  +  +   ++    V       + +  S
Sbjct  24   WFGRSWPVLVLCALFLIAASIWLYGGQIGYLAKRVTSQHASVSEFWVEGTRAFWALLGAS  83

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
            +   +      L   +   +    S      LL+L+            +   V   F   
Sbjct  84   LLGVLLFGGGVLGIVLIAAVGSAMSSATPQWLLVLLGVVFYAAAFGGWIWLGVRLVFWFV  143

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
             +  D +G L   + S     G WW  FG  VL ++I+L        +  +G  A    +
Sbjct  144  AIVGDRVGPLAGFQASFRATRGRWWRAFGLVVLFVLIALGCWLPFGLLEAIGNLAGGQAA  203

Query  298  LLLTPFSFLYYYLI  311
            L+    S +   + 
Sbjct  204  LVAGLVSQVGSSVA  217


>KAF5935985.1 hypothetical protein HYC85_027114 [Camellia sinensis]
Length=247

 Score = 50.9 bits (118),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 23/194 (12%), Positives = 56/194 (29%), Gaps = 0/194 (0%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
                  + +L   L    +               +          +  +           
Sbjct  1    MDQAPHLSVLLPHLPLRFLSPLHRRCSLHRRLSLHLQTLSPSPPPSPPFPAFSGGLFITY  60

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
            +++ +      +     L L  +   T   ILL   V   S+L ++  +     +     
Sbjct  61   LWVSLIMIIYYIVFVGFLVLLVIAVDTHNPILLFFSVVVISILFLVVHVYITALWHLASV  120

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
            V   + + G  A++KS  L+ G    +       L I   ++ +   +   GE   +   
Sbjct  121  VSVLEPVFGFAAMKKSYELLKGRTRMVVVLVFGYLAICGVINGVFGAVVVHGEGYGIFVR  180

Query  298  LLLTPFSFLYYYLI  311
            +++  F      ++
Sbjct  181  IMVGGFLVGVMVIV  194


>VUT24393.1 hypothetical protein MOIL_00483 [Candidatus Methanolliviera sp. 
GoM_oil]
Length=239

 Score = 50.9 bits (118),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 19/102 (19%), Positives = 40/102 (39%), Gaps = 1/102 (1%)

Query  203  FTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWW  262
              +  +   +VV    +   I G++F VWF++    +   +   + A+  S+    G +W
Sbjct  126  LVVASLNSWIVVAVAGIGFTIVGVIFAVWFWYVIPAVMLSDASAVGAISASKSFTKGKFW  185

Query  263  AIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFS  304
              F   + +  +   LS L   IP +G        + ++   
Sbjct  186  GTFLFILTMAALV-ALSALFTWIPSIGRVIQFLLLIPISALW  226


>MBD3788613.1 zinc-ribbon domain-containing protein [Sphingomonadales bacterium]
Length=151

 Score = 49.4 bits (114),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 12/77 (16%), Positives = 20/77 (26%), Gaps = 0/77 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + CP+C A+     S +P      +C  C Q        +         AT    G   
Sbjct  2   RLVCPNCAAQYEVDESVIPEMGRDVQCSNCGQMWFQPGKAAMAEAEAPAAATPEPDGWDI  61

Query  63  RIPSDRLEIQSKTVNCR  79
               +    +       
Sbjct  62  AEGLEHSPAEPAPEPAH  78


>NCW14115.1 hypothetical protein [Rhodobacteraceae bacterium]
Length=238

 Score = 50.9 bits (118),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 11/44 (25%), Positives = 15/44 (34%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  V CP C A    P   +P      +C  C  T      + +
Sbjct  3   MRLV-CPKCDARYEVPEENIPTTGRDVQCSACDTTWFQKHPQME  45


>MBI5810497.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=69

 Score = 47.1 bits (108),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 11/61 (18%), Positives = 19/61 (31%), Gaps = 1/61 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  ++C  C  +     S++       RC +C +  I  P    +T         P    
Sbjct  1   MI-IQCGKCSTKFRLDDSRITGGGVKVRCTKCQEVFIVTPPPPPKTPREVCSMANPPNPS  59

Query  61  Q  61
            
Sbjct  60  P  60


>NRA02540.1 zinc-ribbon domain-containing protein [Myxococcales bacterium]
Length=439

 Score = 52.1 bits (121),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 9/44 (20%), Positives = 16/44 (36%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  + C  C       + ++PA     RC +C      +P +  
Sbjct  1   MV-ITCEQCQTRFRVDAQQIPAAGVRVRCSQCQHRFFVEPQDLP  43


>RJL06555.1 hypothetical protein D3P06_03555, partial [Paracoccus aestuarii]
Length=139

 Score = 49.0 bits (113),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 8/36 (22%), Positives = 12/36 (33%), Gaps = 0/36 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
             + CP C A    P+  +P       C +C     
Sbjct  4   IELICPGCAARYALPADAIPPAGREVECSDCGHVWQ  39


>OUX70770.1 hypothetical protein CBD00_02145 [Rhodospirillaceae bacterium 
TMED140]
Length=273

 Score = 51.3 bits (119),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 24/185 (13%), Positives = 54/185 (29%), Gaps = 28/185 (15%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
            I LA +      L+  T +    +    V + ++M        +  L  +L+ L +    
Sbjct  63   IWLAALFVSATLLAIPTQAGLNLVRTGSVSIGQAMGKSFGRWFALGLAFLLMQLGIAVLE  122

Query  219  LLLIIPG-----------------LLFCVWFFFCQYV-LADDNIGGLQALEKSRLLVSGH  260
            ++L+                    ++  +       + +  D  G + A  +S  L  GH
Sbjct  123  IILVWVPSELNFMLGVIGSIVNTIVVIAISLALAPLLLVILDGSGAMDAFGRSLRLTQGH  182

Query  261  WWAIFGRFVLLLVISLTLSFLTARIPYVGEAA----------NLAFSLLLTPFSFLYYYL  310
              +I    +L  +    +      I  +                   +    F+  +Y +
Sbjct  183  RLSILLFGLLAALAVFVVMLAAGIIGAIAFFLLDLFLPRAAAGTIVGIFGALFALCFYSV  242

Query  311  IYSDL  315
            I    
Sbjct  243  ISIYF  247


>MBC1391767.1 DUF975 family protein [Listeria welshimeri]
Length=170

 Score = 49.8 bits (115),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 25/98 (26%), Positives = 46/98 (47%), Gaps = 1/98 (1%)

Query  194  KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEK  252
              G +  G   L  +L+ +     SLLLI+PG++    +    ++L D+ NI  L A+ +
Sbjct  1    FSGFKQFGRTFLAYLLISIFTFLWSLLLIVPGIIKTYSYSQTFFILRDNPNISALDAITE  60

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
            SR +++GH   +FG  +  L+  L    +      +  
Sbjct  61   SRHMMNGHKGRLFGLSLTFLLWYLIPLAVAIAGTVIIA  98


>WP_158010726.1 hypothetical protein [Tardibacter chloracetimidivorans]
Length=264

 Score = 51.3 bits (119),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 15/151 (10%), Positives = 39/151 (26%), Gaps = 7/151 (5%)

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
             L  +       + +    + + + + +      +       +  +   G ++     L 
Sbjct  96   ALAPAIGVQFAILSVVCAFLIVGQLIAMLFAGGAAQAGDTAAMTRLAVIGMVIAGPAILY  155

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT-----  282
                       L  + +  L+AL ++  L SG    I    ++   + L L         
Sbjct  156  ALARLAVVFPALLLERLTPLEALRRAFRLTSGSAVPILALILVATFLYLFLQLALGTAVG  215

Query  283  ARIPYVGEAAN--LAFSLLLTPFSFLYYYLI  311
                 +G          L     +     + 
Sbjct  216  GVFMLLGRLIGVESLGLLFTLVLTAALGTVA  246


>HAS78877.1 hypothetical protein [Ruminococcus sp.]
Length=460

 Score = 52.1 bits (121),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 21/207 (10%), Positives = 52/207 (25%), Gaps = 16/207 (8%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL--  271
            V    ++L    L   + + F       ++     + +KS  LV G+++      +L   
Sbjct  33   VLLIYIILASVALFLFIRWIFSMNYYVYEHKSFRISRKKSACLVKGNFFKTLIPVILWEI  92

Query  272  --LVISLTLSFLTARIPYVGEAANLA------------FSLLLTPFSFLYYYLIYSDLKA  317
              L+I +  +F+ + I Y+     +               L    F  +     +   + 
Sbjct  93   SALLIFIAAAFIFSCIIYLLSVKGIISKGSAVISYISRIVLSGFSFLAVPVIFSFVCTQC  152

Query  318  NYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQP  377
                          +    A+  +  +  +        +    +  +  K          
Sbjct  153  RCASCGISGDFFPKILQKPALSIFYAVIFIAAALNIFYSYETRKNGTGIKMEMFNSTEIS  212

Query  378  QQTPDLNRSLPEEPQRLSSADYKLLLS  404
                D   +          A       
Sbjct  213  AHRGDCRHAPENTIPAFELAVENKADW  239


>MAP90714.1 hypothetical protein [Candidatus Poribacteria bacterium]
Length=301

 Score = 51.7 bits (120),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 25/171 (15%), Positives = 57/171 (33%), Gaps = 12/171 (7%)

Query  145  ATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT  204
              +      ++    L + V+  L+        +  Y  K        +    ++     
Sbjct  122  LEFPYSSLNSFLSGFLHSFVSAPLILGYNGIYILTAYRKKRYQLDLIFVGFSFQYYFLVL  181

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWA  263
                +       G + LIIPG++         +++ DD +I   +AL+KS  ++ G  W 
Sbjct  182  SAYWITTAFTVVGFVFLIIPGVIISFSLAMVNFIIVDDPDILPFEALKKSYQMMKGFKWK  241

Query  264  IFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
                 +  +  S+           +G        L ++P+  +   + Y +
Sbjct  242  FCCLGLRFIGWSI-----------LGILTIGIGFLWISPYYSMSAAIFYEN  281


>NOK61900.1 hypothetical protein [Chloroflexi bacterium AL-N1]NOK67137.1 
hypothetical protein [Chloroflexi bacterium AL-N10]NOK74570.1 
hypothetical protein [Chloroflexi bacterium AL-N5]NOK81739.1 
hypothetical protein [Chloroflexi bacterium AL-W]NOK89209.1 
hypothetical protein [Chloroflexi bacterium AL-N15]
Length=331

 Score = 51.7 bits (120),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 30/238 (13%), Positives = 69/238 (29%), Gaps = 26/238 (11%)

Query  105  QLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATV  164
              L   + +        +   L  +++    +     L      N  +    +  +   +
Sbjct  76   YWLFGDFTVSLNNTIWSISALLFVLIITLPLLIRDFALLHLGVSNRASHPHAYVSVGLIL  135

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
            A +LL ++     +      +           L       +  +L++L V          
Sbjct  136  AVLLLFVAGGLCILPGLYLPSPDNFVALFSSPLLFWSIIDMRSLLMVLPVSLMGF-----  190

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL---  281
              +          V   + +  L +L +S  LV   +W +    V+L V+   + +L   
Sbjct  191  --IVYTMLSVAPCVAILEQVKPLTSLRRSWQLVRSSFWRVATMVVMLGVLMQCVGWLPRF  248

Query  282  ------------TARIPYVGEAANLA----FSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                         + I   G    L       ++  P     + ++Y DL+  + G  
Sbjct  249  FVIVILSTLNPGISEIFIWGIIFALVLVQFGMMIYLPIQVGAFTILYYDLRVRHEGYD  306


>HEB78997.1 thioredoxin [Rhodospirillales bacterium]
Length=32

 Score = 45.9 bits (105),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 9/32 (28%), Positives = 13/32 (41%), Gaps = 1/32 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPEC  32
           M  + CP C    +   S L     + RC +C
Sbjct  1   MI-ITCPACSTNYSIDPSALGGAGKTVRCSKC  31


>MBE6554379.1 DUF975 family protein [Ruminococcaceae bacterium]
Length=320

 Score = 51.7 bits (120),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 27/207 (13%), Positives = 61/207 (29%), Gaps = 12/207 (6%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            F       +G  +L     F              +           +   + Y ++ +  
Sbjct  105  FGFCLRLFVGSPVLLGYQRFTLDIVDGKPVTLPSIFRFFSTCYGKSVWLRLIYEVIFMLL  164

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLL-LILLILVVGGGSLLLIIPGLLFCVWF  232
                  + +          +++    +    +  ++LL+  +   +++ +I  +     +
Sbjct  165  SLPLAAVSVIGVWETRHAILRVAEGQMAPADITAIMLLVSGIFLLAIVTVILQIWLQYRY  224

Query  233  FFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
             FC  +LA+   +G L A   S  L+ G  W+ F      +   L            G  
Sbjct  225  AFCFMILAEYPEMGVLDAFRNSASLMRGKKWSYFCLQFSFVGWVLLA----------GMC  274

Query  292  ANLAFSLLLTPFSFLYYYLIYSDLKAN  318
                  L L P+      + Y D+   
Sbjct  275  TCGIGMLFLAPYMDAASAVFYDDITNR  301


>HIA92278.1 hypothetical protein [Candidatus Saccharibacteria bacterium]HIO87767.1 
hypothetical protein [Candidatus Saccharibacteria 
bacterium]
Length=236

 Score = 50.9 bits (118),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 31/153 (20%), Positives = 66/153 (43%), Gaps = 2/153 (1%)

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV  214
              +   +  V    + +   +    +     +  +     LG +H  +F  + ++++L V
Sbjct  75   AFFIATIVLVIVASILVEAGSIKGLLKNGSKEFSVKEGFDLGKKHFMAFLGVSLVVVLYV  134

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
              G +LLI+PG+LF +W+     ++ D NI    A+++S+ LV  +   +   +V+ ++I
Sbjct  135  LFGLILLIVPGVLFAIWYSLAPIIVLDKNISTSDAMKQSKDLVRSNMGVLAKNYVVFVLI  194

Query  275  SLTLSFLT--ARIPYVGEAANLAFSLLLTPFSF  305
            S+ L  L   A    +       F   L    +
Sbjct  195  SVVLGSLFDDAFSSAIYNLFASGFGAALYVVIY  227


>MBH2006988.1 hypothetical protein [Candidatus Saccharibacteria bacterium]
Length=315

 Score = 51.7 bits (120),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 27/160 (17%), Positives = 53/160 (33%), Gaps = 0/160 (0%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
                      +    +          ILL T+    +       S      +T+     +
Sbjct  104  APFDNPNFAYEQIVSIAVIVGGIFLVILLLTLVISAIINGMRDVSAAAIAKQTETSFGTA  163

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
            +         +  L I++ + V   SLLL++PG++  + +          ++    AL+ 
Sbjct  164  VSTLAHRFPGYVWLQIMISVKVFLWSLLLVVPGIIMAIRYSLAGTAYFARDMSASDALKH  223

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
            S  +  G W+  FG   L  +++L L  L       G   
Sbjct  224  STTITKGAWFTTFGSLGLFNLVTLGLIPLLIEPGVRGILF  263


>MBA3761007.1 zinc-ribbon domain-containing protein [Gemmatimonadales bacterium]
Length=166

 Score = 49.8 bits (115),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 10/33 (30%), Positives = 13/33 (39%), Gaps = 0/33 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           V CP+C        +K+P     ARC  C    
Sbjct  3   VTCPNCATIYRVDPAKVPESGVRARCAVCSAVF  35


>WP_028099953.1 hypothetical protein [Dongia sp. URHE0060]
Length=280

 Score = 51.3 bits (119),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 22/160 (14%), Positives = 49/160 (31%), Gaps = 6/160 (4%)

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
             +      +  +  +      V +   +      V +  L+  L   +     L  +   
Sbjct  110  LLQHLSRALLSAPSVIAAGLFVSIVYGISF-FVLVLAAGLVGTLHWALGAIVGLPGLAVL  168

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV-----ISLTLSF  280
            +   V ++     +  +  G      +S  L  G+ W +F   +L+         + L  
Sbjct  169  VALMVRWWVLLPAIVIEQTGPFACFTRSSRLTEGNRWQVFAVLLLVYAPEGLVKVVLLLL  228

Query  281  LTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
                        N+  S L   F+ +   +IY+ L+A   
Sbjct  229  TPLLGTPFVAVLNILISGLFIVFNAVATVMIYAHLRAIKE  268


>TDJ12461.1 hypothetical protein E2O66_07375 [Deltaproteobacteria bacterium]
Length=562

 Score = 52.1 bits (121),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 9/37 (24%), Positives = 13/37 (35%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  + C  C         ++PAK +  RC  C     
Sbjct  1   MI-ITCEQCETRFQLDDERVPAKGAKVRCSRCRHAFF  36


>SHW22474.1 proline and glycine rich transmembrane protein [Mycobacteroides 
abscessus subsp. abscessus]SIL20873.1 proline and glycine 
rich membrane protein [Mycobacteroides abscessus subsp. abscessus]
Length=112

 Score = 48.2 bits (111),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 20/109 (18%), Positives = 44/109 (40%), Gaps = 8/109 (7%)

Query  210  LILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFV  269
            + + V  GSLL+I   +       +  +   D  +G + AL+ S  LV  +     G+ +
Sbjct  1    MGVGVLIGSLLIIGGVIFGFFA-QYAVFFAIDRGLGPVDALKASFQLVKDN----LGQAL  55

Query  270  LLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKAN  318
            ++ +I+L ++F    + +          ++  P +     LI+      
Sbjct  56   VVFLITLGVAFGGFALTF---ITCGLGGIIAYPAAGALTGLIHVYTYRR  101


>PKL36072.1 hypothetical protein CVV44_17785 [Spirochaetae bacterium HGW-Spirochaetae-1]
Length=461

 Score = 52.1 bits (121),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 36/308 (12%), Positives = 82/308 (27%), Gaps = 17/308 (6%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRT----QTTDNIATCPHCG  59
            + CP C +  +     +P ++    C +C  +        Q        T    T  +  
Sbjct  26   ILCPGCESFYSLTYQLVPYQRYMMVCKKCSASFTVTFPVMQGQVIKTDKTIIHDTTGNSM  85

Query  60   LQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGW  119
            + R   S R  +    V  RR +                    +   L            
Sbjct  86   MFRMENSVRDAVNRVPVYGRRHSGERRETAHEAAPCGVDENDGVFGSLLSVIGKTFSPAK  145

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
              L I ++ I+L                    +      ++   V Y++   +    ++ 
Sbjct  146  IALAIPVVIIILLALTYLRGRAPVFVRRGAVDSFLNIPVMVSILVLYVITATAIARVTVE  205

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
              +    +G+ ++    LR  G  +++  LL  ++ G   L  +  ++   +F    + L
Sbjct  206  DMLHGRAMGIRKAGMFALRKAGLVSMVNFLLCGLLFGAFTLFGMIPVVGPFFFALSFFPL  265

Query  240  A-------------DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
                                  A  +         +  F     + ++ L    +     
Sbjct  266  YVITVIIIMMMITGFWFYPPFLACSRENEENKEKTFLHFLLRHFMALLYLVPIMIIVCAI  325

Query  287  YVGEAANL  294
                  +L
Sbjct  326  IFSAVFSL  333


>KPJ48918.1 hypothetical protein AMJ41_03980 [candidate division Zixibacteria 
bacterium DG_27]
Length=227

 Score = 50.5 bits (117),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 22/187 (12%), Positives = 51/187 (27%), Gaps = 24/187 (13%)

Query  140  LLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRH  199
            +      ++      +   + +  V  +   L      + + + +        +  G R 
Sbjct  26   IGGFVVFFIIEFVAGFIPVVNIIFVLLVTPALVGGLMILSLNLARNSSPEVGDVFKGFRK  85

Query  200  VGSFTLLLILLILVVGGGSLLLII------------------------PGLLFCVWFFFC  235
             GSF     L ++V     + L I                          L+  +  +  
Sbjct  86   YGSFLGAYWLFVVVYLVCLIPLFIGLGVDTARGSEPAALTIILGFVSLVVLIIAMLRWCM  145

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
             Y L  +     +A  KS  +  G+   IF   ++  +I +    +     +V     + 
Sbjct  146  VYYLVAEGSSVTEAFTKSSKITEGYRGTIFLLGIVNFLIIIAGLLVLGLGYFVAAPIAMI  205

Query  296  FSLLLTP  302
                   
Sbjct  206  AFASAYI  212


>WP_120068225.1 hypothetical protein [Halococcus sp. IIIV-5B]RJT08134.1 hypothetical 
protein D3261_01035 [Halococcus sp. IIIV-5B]
Length=143

 Score = 49.0 bits (113),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 26/106 (25%), Positives = 41/106 (39%), Gaps = 4/106 (4%)

Query  194  KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKS  253
            +  +  V    +  ++  +  G   +LLIIPGL            +  +  G L  L++S
Sbjct  5    RSRILLVVFVVVAYLVTYVAAGIAGILLIIPGLYVAYRLSLTTAAIMIEAQGPLAGLKRS  64

Query  254  RLLVSGHWWAIFGRFVLL----LVISLTLSFLTARIPYVGEAANLA  295
              L  G+ W IFG  +      LVI +    +   IP    AA   
Sbjct  65   WALAGGNVWTIFGVNLAFLAVGLVIFVVALLVGGGIPSGANAATGL  110


>NQY39059.1 zinc-ribbon domain-containing protein [Henriciella sp.]
Length=217

 Score = 50.5 bits (117),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 6/37 (16%), Positives = 12/37 (32%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M    CP C  +     + +     + +C  C  +  
Sbjct  1   MIL-TCPECETQYFADDATIGESGRTVKCATCGHSWF  36


>GAF40281.1 integral membrane protein [Agrilactobacillus composti DSM 18527 
= JCM 14202]
Length=227

 Score = 50.5 bits (117),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 26/203 (13%), Positives = 61/203 (30%), Gaps = 18/203 (9%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                    +            +      A ++          I    +    +  +++  
Sbjct  10   FNTWFGRGFGGYGYRHGYYGGNHFRGGGAAFMGNPFGFLTNTIPGIVLTIATVSATFILI  69

Query  177  --SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
                   +          +     +     L+ IL  +     +LLLI+PG++  + +  
Sbjct  70   DALRQKQVSGQPFRAAFKIFTRGEYFIGTVLINILTTIYTALWTLLLIVPGIVKSLAYSQ  129

Query  235  CQYVL-----ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG  289
              YV       D  I   +A+ +SR+L+ GH W +F  ++  +   L +           
Sbjct  130  SAYVYRDAVDNDQQITYSEAITRSRVLMKGHKWELFVIYLSFIGWYLIVG----------  179

Query  290  EAANLAFSLLLTPFSFLYYYLIY  312
                   ++ + P+  +     Y
Sbjct  180  -LTAGLAAIWVQPYLHMTLANFY  201


>NNC43223.1 hypothetical protein [Acidimicrobiia bacterium]
Length=144

 Score = 49.0 bits (113),  Expect = 4e-04, Method: Composition-based stats.
 Identities = 16/138 (12%), Positives = 39/138 (28%), Gaps = 13/138 (9%)

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
            ++  L+      G    ++PGL+   W      ++ +++ G    + ++  L  G    I
Sbjct  2    VIGYLVFGASLMGLAAGVLPGLVILAWAGIAVPIVIEEHGGLFDTIARTWRLTRGVRVTI  61

Query  265  FGRFVLLLVISLTLSFLTAR-------------IPYVGEAANLAFSLLLTPFSFLYYYLI  311
               ++   +I   L+                     V        +    P   +   ++
Sbjct  62   LAFYLWFALILGVLTAAVWITAFGLEALGVGLDPSVVLWPLAFVTAAAAIPIFPVGLAVM  121

Query  312  YSDLKANYRGPQHPPIKR  329
            Y D +             
Sbjct  122  YVDARVRNEAFDLQQRLE  139


>MBC8277666.1 zinc-ribbon domain-containing protein [FCB group bacterium]
Length=68

 Score = 46.7 bits (107),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 9/34 (26%), Positives = 11/34 (32%), Gaps = 1/34 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           M  + C  C    N   S +    S  RC  C  
Sbjct  1   MI-ITCEKCNTSFNLDESLVKPAGSKVRCSVCKH  33


>MBI2415304.1 hypothetical protein [Candidatus Kerfeldbacteria bacterium]
Length=261

 Score = 50.9 bits (118),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 23/167 (14%), Positives = 53/167 (32%), Gaps = 12/167 (7%)

Query  143  KPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGS  202
              +     Q     +   +  +   ++       + F+ +    + +    +  +     
Sbjct  101  HVSVKNIFQTTLSYFWSYVMVLLITVVLFGIAIIASFVLVAGIMILIGLVDRSAIDVWYP  160

Query  203  FTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWW  262
            +    I  + +      +             F  Y L +   G   A+  S  +V G W 
Sbjct  161  YLATYIPSLAIGVVAFGV------------MFAPYHLVEQRAGAWAAIRTSIQVVRGQWV  208

Query  263  AIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYY  309
             +  R ++LL+    ++F+   IP VG A  +  S ++        Y
Sbjct  209  GLLIRELILLLSLSVITFIIQFIPVVGTALGILISTIILTTYNYILY  255


>OLA64591.1 hypothetical protein BHW56_06520 [Acetobacter sp. 46_36]
Length=370

 Score = 51.7 bits (120),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 8/68 (12%), Positives = 17/68 (25%), Gaps = 0/68 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + CP C +    P   +     + RC  C        +++   +               
Sbjct  2   LISCPKCHSIYEIPDDLIGKTGRNFRCQACANVWHAMRSDALGYEEEKESEPFIEPIPVS  61

Query  63  RIPSDRLE  70
             P+    
Sbjct  62  EPPARPWP  69


>PIE74612.1 hypothetical protein CSA18_04495, partial [Deltaproteobacteria 
bacterium]
Length=190

 Score = 50.2 bits (116),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 11/37 (30%), Positives = 12/37 (32%), Gaps = 0/37 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  V C  C      PS K+       RC EC     
Sbjct  1   MRIVTCEACQTRYKIPSDKIKESGIKVRCSECKHEFH  37


>HBW20264.1 hypothetical protein [Actinobacteria bacterium]
Length=389

 Score = 51.7 bits (120),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 20/146 (14%), Positives = 47/146 (32%), Gaps = 27/146 (18%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG------  215
              +   ++    +T  +   +    + +  + ++    + +     +L  LV+       
Sbjct  244  VGLVASIVLTGMLTAVIGRGVLGRRLTIGEAWQVAAPRLPAVLGASVLTTLVIIALWVPY  303

Query  216  ---------------------GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSR  254
                                  G L ++   +     F     V+  +  G +QAL++S 
Sbjct  304  VVILAILIAVKAGALAAIFGVLGGLAMVCVTVAAWAMFNMVAPVVVLEGQGPVQALKRSF  363

Query  255  LLVSGHWWAIFGRFVLLLVISLTLSF  280
             LV   +W + G  +L  VI    + 
Sbjct  364  RLVRASFWRVLGILILTYVIVAIAAL  389


>WP_121876654.1 hypothetical protein [Umboniibacter marinipuniceus]RMA79957.1 
hypothetical protein DFR27_1309 [Umboniibacter marinipuniceus]
Length=225

 Score = 50.5 bits (117),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 22/202 (11%), Positives = 58/202 (29%), Gaps = 0/202 (0%)

Query  105  QLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATV  164
                  +  +       L +           +      +    L          +    +
Sbjct  9    FKFFKQYWRYLVLFVVPLALIEHFAFRNLIDLELIQAAQTDEALAEALAGQFPTLATLEL  68

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
             +  L    +  +    +    +     +   L       L   L  +++  G LL I+P
Sbjct  69   IFKPLIEGVVITATLSAVMSGALVTRVVLNRLLSVSPHLLLAFALRTVLIVFGLLLFIVP  128

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            G+   +       ++ D ++G L A+++S  L       +F   + +++      F   +
Sbjct  129  GVYLYLRLILMPVIVVDKSVGALSAMKQSWQLTGPAMGTVFIGLLQMVMWVFFPLFFIGQ  188

Query  285  IPYVGEAANLAFSLLLTPFSFL  306
            +   G A     ++     +  
Sbjct  189  VVPTGAAMFAWAAISAVALATF  210


>NLL50801.1 hypothetical protein [Eubacteriaceae bacterium]
Length=408

 Score = 51.7 bits (120),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 35/329 (11%), Positives = 98/329 (30%), Gaps = 11/329 (3%)

Query  82   NRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALL  141
               F          +      + + L   + ++     G L I  + + + +    S + 
Sbjct  61   PGIFQDPTMAYGMTNQQMAAFVMKFLPMLFLVYVFALAGALYISNVSVAMCYLITDSYVA  120

Query  142  LKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSM-KLGLRHV  200
                 +       ++ A+ +    ++   L      +F  +     G+        L+  
Sbjct  121  DTKRPFGEMLKYAFRRALPMLGTEFLYFLLMGGIWMVFSTVVALLFGVMFIANATQLQTF  180

Query  201  GSFTLLLILLILVV-GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSG  259
               ++LL +  + V     + +++  +   + + F     A       Q+++ SR+L  G
Sbjct  181  DDASVLLGMWQIWVVLVLWIAMLVFTIYLMIRYSFSILGRAKYKFSAYQSMKYSRILTRG  240

Query  260  HWWAIFG--------RFVLLLVISLTLSFLTARIPYVG-EAANLAFSLLLTPFSFLYYYL  310
                IFG          + +++I+L    +       G     +  S+ L+ F+  +  +
Sbjct  241  KVAKIFGNLLLIALLTGLPIVLITLGTGHVLDASAITGLALLTMLLSIFLSLFTSTFQSV  300

Query  311  IYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQ  370
            ++ +  A         + +    L  + F             S      +    A     
Sbjct  301  LFINFDAVRGSSVISMLDKPVHNLKDSSFYDDAGVQAPFAPNSGGGAVTQAAPVATTAAF  360

Query  371  QRLGTQPQQTPDLNRSLPEEPQRLSSADY  399
                 + +     +  + ++     +   
Sbjct  361  AGEQEKTEHVVAPDEQMKKDNMVQEAELT  389


>WP_152142453.1 zinc-ribbon domain-containing protein [Amylibacter sp. SFDW26]KAB7614784.1 
hypothetical protein F9L33_09195 [Amylibacter 
sp. SFDW26]
Length=467

 Score = 52.1 bits (121),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 9/43 (21%), Positives = 16/43 (37%), Gaps = 1/43 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAES  43
           M  + CP+C A+       +P +    +C  C      D  + 
Sbjct  1   MRLI-CPNCVAQYEVDQDVIPPEGRDVQCANCGHNWFQDSIQM  42


>MBJ7725272.1 putative Zn finger-like uncharacterized protein [Caulobacter 
sp. OAE837]
Length=128

 Score = 48.6 bits (112),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 8/36 (22%), Positives = 12/36 (33%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M    CP C A       ++  +    +C  C Q  
Sbjct  1   MIL-TCPRCAARYVVGEDQVGPQGRKVKCTTCGQIW  35


>NEE07014.1 hypothetical protein [Streptomyces sp. SID7499]
Length=168

 Score = 49.4 bits (114),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 25/149 (17%), Positives = 39/149 (26%), Gaps = 28/149 (19%)

Query  203  FTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWW  262
              L       +   G     +  L   V F      L  +    + AL +S  LV G WW
Sbjct  1    LLLGSEAGAALAFLGFAAACVAMLWLNVSFALAAPALMLERQSIVAALRRSTKLVRGAWW  60

Query  263  AIFGR-----------FVLLLVISLTLSFLT-----------------ARIPYVGEAANL  294
              FG              L+ +    ++ +                       V     +
Sbjct  61   RTFGVLALTWLLTFLLTFLVSIPFGIIAVIVDGTDVSEFLNGTAPSFGWSFLIVTGIGEV  120

Query  295  AFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
              S L+ PF      L+Y D +       
Sbjct  121  IVSTLIYPFVAGVMALLYVDQRIRREALD  149


>PME47739.1 hypothetical protein BCV34_17405, partial [Vibrio lentus]PME56774.1 
hypothetical protein BCV30_18515, partial [Vibrio lentus]PME93378.1 
hypothetical protein BCV27_21530, partial [Vibrio 
lentus]PMH94185.1 hypothetical protein BCU56_20415, partial 
[Vibrio lentus]PMI11428.1 hypothetical protein BCU53_22480, 
partial [Vibrio lentus]
Length=168

 Score = 49.4 bits (114),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 26/155 (17%), Positives = 49/155 (32%), Gaps = 1/155 (1%)

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQW-AILLATVAYILLGLSWMTGS  177
                       +      F   +     WL       +        +       +++   
Sbjct  8    CLHFTCSNFKSIFKIFGGFIITMSCLGVWLEHSFYVSENLWAYAVYLCVYSFIYTYLIAI  67

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
               ++  +  G      +  R      ++ I+  L+V  G++ LIIPGL     + F ++
Sbjct  68   FINFMASSTNGFDIERSVSWRVWSRLMIVYIIYSLIVLVGTIALIIPGLYLAARYSFVEF  127

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
                +N   L ALEKS     G    +    +LL 
Sbjct  128  EAVLNNKSPLVALEKSWRDTKGITMKLIKISLLLG  162


>PIN70052.1 hypothetical protein COV93_03080 [Candidatus Woesearchaeota archaeon 
CG11_big_fil_rev_8_21_14_0_20_43_8]
Length=284

 Score = 50.9 bits (118),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 23/220 (10%), Positives = 67/220 (30%), Gaps = 3/220 (1%)

Query  105  QLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATV  164
                 ++ L  +R   ++  Y +  +  F         +   +            +    
Sbjct  68   PWYKKNYHLIWKRFIPVMAGYAIMYICFFIVGLMQGDSEDLVFGPAALAFSAIGFIAILF  127

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
              +   L   + +    + K    +F       R +    +L  + +LV+  G   L++P
Sbjct  128  FVVYYFLINTSLARSYLLMKDKKFVFSWASG--RQIFRVIILTAIFMLVMLIGLPFLLLP  185

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
             L   +   + Q++L   ++  ++++++   L+  ++W +         +   ++F    
Sbjct  186  TLAASLLAQYAQFILLRYDLTVIESIKRGIALMRTNFWDMLFYVGCYGGVIFCVNFAIGL  245

Query  285  IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
            +  V    +    L    F          +          
Sbjct  246  LGVV-PLMSELGMLAAMVFVHPAAMFFLVETFNKMDDQSR  284


>MBI3021119.1 hypothetical protein [Candidatus Omnitrophica bacterium]
Length=124

 Score = 48.6 bits (112),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 19/97 (20%), Positives = 32/97 (33%), Gaps = 0/97 (0%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
                       +   V   F    +  D +G L   + S     G WW  FG  VL ++I
Sbjct  1    VVFYAAAFGGWIWLGVRLVFWFVAIVGDRVGPLAGFQASFRATRGRWWRAFGLVVLFVLI  60

Query  275  SLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
            +L        +  +G  A    +L+    S +   + 
Sbjct  61   ALGFWLPFGLLEAIGNLAGGQAALVAGLVSQVGSSVA  97


>TAJ25767.1 hypothetical protein EPO67_20125, partial [Reyranella sp.]
Length=162

 Score = 49.4 bits (114),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 7/33 (21%), Positives = 13/33 (39%), Gaps = 0/33 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           V C +CGA     ++ +     + +C  C    
Sbjct  3   VTCSNCGARYAVDATAIGPTGRTVQCVRCGHRW  35


>MSP52232.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=249

 Score = 50.9 bits (118),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 12/47 (26%), Positives = 18/47 (38%), Gaps = 1/47 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQ  47
           M  + CP+C    N   + L     + +C  C  T    PA  +  Q
Sbjct  1   MI-ITCPNCETRFNIDPAVLQPSGRAVKCMRCAHTWTERPARERPRQ  46


>MBI3494833.1 hypothetical protein [Candidatus Saccharibacteria bacterium]
Length=290

 Score = 51.3 bits (119),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 32/166 (19%), Positives = 60/166 (36%), Gaps = 14/166 (8%)

Query  139  ALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLR  198
              L     +L   +      +++A VA         T                ++   + 
Sbjct  83   PWLAAVVAFLVLFSTLIIAFMIVAGVALNTFITGLFTYVALENDAGRTARFGAALDAVVS  142

Query  199  HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD---NIGGLQALEKSRL  255
                  L  +L +  + G +LLLI+PG++    +    YV+ D+     G  ++ E+ + 
Sbjct  143  RFWRLFLAQLLALAKIFGWTLLLIVPGIVAAFRYALLPYVIMDESAKEKGVKKSHERVKT  202

Query  256  LVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
            LV G    +FG           ++ + A IP+VG    L  +  L 
Sbjct  203  LVQGRKREVFG-----------VATVAAIIPFVGGINQLVGNAALY  237


>WP_167175055.1 zinc-ribbon domain-containing protein [Brevundimonas terrae]NIJ25721.1 
putative Zn finger-like uncharacterized protein [Brevundimonas 
terrae]
Length=313

 Score = 51.3 bits (119),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 8/44 (18%), Positives = 12/44 (27%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M    CP C        + +       RC  C       P ++ 
Sbjct  1   MIL-TCPACATSYFVSDAAIGPLGRRVRCKACGHDWRAMPEDAP  43


>PYP42224.1 hypothetical protein DMD43_03640 [Gemmatimonadetes bacterium]
Length=173

 Score = 49.4 bits (114),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 14/87 (16%), Positives = 33/87 (38%), Gaps = 9/87 (10%)

Query  218  SLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA---------IFGRF  268
             +L ++PG+   + F     V+ ++ + GL A+ +S  L   +            +F   
Sbjct  1    MMLCVVPGIYLGLLFSLTIPVMVEEGLFGLAAMRRSAELTRYNPQRDLDADPRFKVFVIV  60

Query  269  VLLLVISLTLSFLTARIPYVGEAANLA  295
            ++  ++   +  L      V +   L 
Sbjct  61   LVGTLLGWVVGMLVQLPMIVVQEVMLL  87


>HHW31036.1 hypothetical protein [Clostridiaceae bacterium]
Length=240

 Score = 50.5 bits (117),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 32/195 (16%), Positives = 70/195 (36%), Gaps = 6/195 (3%)

Query  126  LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKT  185
             L  ++    +  +                     + T   + L  S    +    +   
Sbjct  31   YLTPIIFTISLLISFSSAFLPVDFDITAPAGIFYNILTTVIMYLITSIYLSAYIRELKNE  90

Query  186  DVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIG  245
            +  L    KL L+ +    + LI+ IL +  G++LL++PG++  + F F   ++ D N  
Sbjct  91   EYNLGICTKLVLKRIFKILVALIIYILSIAMGAVLLVVPGIILYLMFIFNACLIIDTNEP  150

Query  246  GLQALEKSRLLVSGHWWAIFGRFVLLLVI------SLTLSFLTARIPYVGEAANLAFSLL  299
             + +   S+ L  G    IF   +L  ++         L+ +++    +        S++
Sbjct  151  LIASFTLSKRLTDGKKTYIFSLMLLFNLLLFMPVSFTFLAAMSSNSNLIFNFITSFVSVI  210

Query  300  LTPFSFLYYYLIYSD  314
            +         L+Y D
Sbjct  211  VNIMQQRMIALMYVD  225


>NTU70650.1 hypothetical protein [Coriobacteriia bacterium]
Length=589

 Score = 52.1 bits (121),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 33/315 (10%), Positives = 79/315 (25%), Gaps = 9/315 (3%)

Query  209  LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
            L  +    G +L+ +  +   + + F      +    G  AL +SR L +G W  I G  
Sbjct  167  LYWIGGVLGVVLVSM-WVYLALRWVFSVQAFVEQGYRGRAALRRSRELTAGRWVRIGGSI  225

Query  269  VLLLVISLTLSFLTARIPYV------GEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGP  322
              L++  +    + + +  +      G    LA  L +         ++           
Sbjct  226  AALVLGLMLAGVVVSLVSRLAVMSASGSVGWLAVWLAVNSMLSAALAVLIFSALQALVMS  285

Query  323  QHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPD  382
            ++   + + +  ++A          + + +          +              Q    
Sbjct  286  RYLRARSRSIAPSSAERAGRAPWTAIGLIVLAAATVGALGVLVVSVTLGGPIAPTQIELI  345

Query  383  LNRSLPEEPQRLSSADYKLLLSKQRKT--TSEGGLSLGPVTLFADRFWADDQNPHLWLKL  440
             +R+        + A  +                 S G + +  D          + +  
Sbjct  346  AHRAGEAYAPENTVAAVEQARKDNADRLEFDVQRTSDGQLVVVHDADLLRLSGKDISVGG  405

Query  441  ELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGI  500
                        KG     + + LD         +               T      +  
Sbjct  406  STLAEVQAVDIGKGQHVPTLIEFLDAAGDTPLALEIKTHPGDKQSTEEVVTLLQARGAID  465

Query  501  RSIYLRQGTQAEQVH  515
            R++ L        + 
Sbjct  466  RTVLLSLDPAITDLA  480


>OUZ99439.1 hypothetical protein BVC80_1801g4 [Macleaya cordata]
Length=138

 Score = 48.6 bits (112),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 21/109 (19%), Positives = 48/109 (44%), Gaps = 0/109 (0%)

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
             ++   I+++   ++  +I  +   V +   + V   +   GL+A++KS+ L+ G  W  
Sbjct  6    AMITYYIVILIILAIPYVIGSVYISVVWHLARVVSVLEETYGLKAMKKSKTLIKGKIWIA  65

Query  265  FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYS  313
               FV+  ++   L  +   +   GE+  +   + L  F FL   ++  
Sbjct  66   SAIFVMFQILISVLLMVFNMLVVHGESIGMVGRVSLGIFCFLLLTILIH  114


>KAF9687129.1 hypothetical protein SADUNF_Sadunf02G0061600 [Salix dunnii]
Length=491

 Score = 51.7 bits (120),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 29/222 (13%), Positives = 71/222 (32%), Gaps = 18/222 (8%)

Query  110  SWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
             + LF      L  ++ L    A     +++                       V ++ +
Sbjct  118  YFWLFKVACLILSVVFSLLSNAAVVYTIASIYAGREVSFKKVMSAVPKICKRLMVTFLSI  177

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
             ++ +       +  + V L   + +G  ++        +L        +L  +  +   
Sbjct  178  YVALLAYIAVTILVFSLVFLAWFIFIGFSNL-------KVLYPFGIVLLVLSFMGCVYLT  230

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP---  286
            + +     V   +   G +A+ KS+ L+ G  W     +  + + S T++     I    
Sbjct  231  IIWLLASVVSVLEQDWGFKAMTKSKALIKGKMWTATIIYFNISITSATVTMAFQNIVVHG  290

Query  287  --------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
                     +      +  L +  F ++ Y +IY   K+N+ 
Sbjct  291  GSMNMAGRVLLGVICSSLILGVCLFGYVTYAVIYFVCKSNHH  332


>NIT13175.1 hypothetical protein [Candidatus Dadabacteria bacterium]
Length=167

 Score = 49.4 bits (114),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 14/103 (14%), Positives = 32/103 (31%), Gaps = 1/103 (1%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  ++C  C  +     +K+    S  RC +C      +  +S +T      +       
Sbjct  1    MI-IQCDQCDRKFRVDDAKIKPPGSRVRCSKCGNVFFVEKKDSAKTDEMTPESGSDRLHS  59

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSI  103
                  DR + Q+   + ++          +    S +  +  
Sbjct  60   DTEKSIDRQDSQAIEQDIQKTPNITISDQNQTPPDSDNKEKVW  102


>ACC98177.1 hypothetical protein Emin_0622 [Elusimicrobium minutum Pei191]
Length=347

 Score = 51.3 bits (119),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 26/178 (15%), Positives = 55/178 (31%), Gaps = 17/178 (10%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
            F       +         Q+ +  W + + T+  IL  ++++T                 
Sbjct  109  FIVSKICGVDITIKEAFKQSFSRAWPLFMLTLVIILFFVTFLT-----------------  151

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
            +   L  V +      L +         ++   + F V+      V+         A++ 
Sbjct  152  LGSLLIAVFASMGYNALAVFFGFIVFAAIVFLAVCFLVYLSPLTAVVVIKKKNMFDAVKY  211

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYL  310
            S  LV G + + FG  +L ++IS   S     I    +        +    + L   +
Sbjct  212  SYSLVKGFFSSTFGYMILAVLISFGFSMAAGFINMFIQGIGAVLIAIFNGITVLQLVI  269


>GAK33662.1 family finger-like domain protein [alpha proteobacterium Q-1]GER03623.1 
hypothetical protein JCM17846_13050 [Kordiimonadales 
bacterium JCM 17846]
Length=422

 Score = 51.7 bits (120),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 12/44 (27%), Positives = 16/44 (36%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M    CPHC    + P   L  +    RC  C       P E++
Sbjct  1   MIL-TCPHCAVRFHVPPDLLGPEGRRVRCGSCAHIWHAMPDEAE  43


>OQA95757.1 hypothetical protein BWY23_02291 [Spirochaetes bacterium ADurb.Bin218]
Length=470

 Score = 51.7 bits (120),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 35/329 (11%), Positives = 90/329 (27%), Gaps = 25/329 (8%)

Query  6    CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIP  65
            CP CGA  +   S + + K   RC +C +    +  E   +    +  +  +        
Sbjct  41   CPFCGAFYSITFSTIKSGKYRYRCRKCDRDFAIEFLEEDNSLDKGDSESFENSAKTHNHF  100

Query  66   SDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIY  125
            +       K  +   CN+          RA  +         A       +     +G+ 
Sbjct  101  AKEEVKSFKDKSLNECNKVDEAFSFLGKRAIQNFSIGQLFFAASDAFSSQKMLISFIGVA  160

Query  126  LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKT  185
            L+  +L       +         +        A     V+   +  S +    F  +   
Sbjct  161  LIVALLKLYGFALSSFASSFIGQSYGGFIITLAPFFIIVSVYQIVASIVAWITFENVFYD  220

Query  186  DVGLFR------------SMKLGLRHVGSFTLLLILLI-----------LVVGGGSLLLI  222
                                 + +  +     +++L             L+      L +
Sbjct  221  TKTTAYSIMRFISKKAPSIFFVNIAFLIFINCVILLFCKIPLVGTAAFALIFFPVYFLSV  280

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
               ++  +  +F   ++A      +++L+   + +  H  ++     ++  +S+ +  + 
Sbjct  281  FIAIMTFIGLWFYPPIIAHREGTSVRSLKNFLIFIKKHNLSLVYMIPVIAAVSVAVLLVL  340

Query  283  ARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
              I       +   SL           + 
Sbjct  341  LLIHTFAFLLS--LSLSEIIMGNELSAVF  367


>PLX89717.1 hypothetical protein C0614_01625 [Desulfuromonas sp.]
Length=137

 Score = 48.6 bits (112),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 10/56 (18%), Positives = 19/56 (34%), Gaps = 1/56 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCP  56
           M  ++C  C A       K+    +  RC +C           +  +  +  A+ P
Sbjct  1   MI-IQCDQCQARFRLADEKIKETGTKVRCSKCKHVFTVMAPAPEPPEAVEPAASEP  55


>MBE6462622.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=291

 Score = 50.9 bits (118),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 10/75 (13%), Positives = 23/75 (31%), Gaps = 0/75 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + CP C +  + P + +P    + RC  C       P+++   +  ++           
Sbjct  2   LISCPKCHSIYDIPDNLIPKTGQNFRCQACSNVWNAIPSDALGYEENEDNEPFIEAIEVS  61

Query  63  RIPSDRLEIQSKTVN  77
             P        +   
Sbjct  62  EPPHRNYPANKENYQ  76


>HIJ45883.1 thioredoxin [Rhodospirillaceae bacterium]
Length=75

 Score = 47.1 bits (108),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 8/38 (21%), Positives = 13/38 (34%), Gaps = 2/38 (5%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  + CP+C    +  ++ L       RC  C      
Sbjct  1   MI-ISCPNCSTRYSVDAAALGK-GKKVRCSNCGHQWYQ  36


>OYT39889.1 hypothetical protein B6U86_04785 [Candidatus Altiarchaeales archaeon 
ex4484_43]
Length=302

 Score = 50.9 bits (118),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 27/231 (12%), Positives = 64/231 (28%), Gaps = 18/231 (8%)

Query  91   REFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNP  150
              + +     +    L      L  +    L  I ++  +           +     L  
Sbjct  16   MVWFSLKFMFKRWDTLYFFIPVLVVKLLSLLYMIDVVTRIGISTFDTGEPKVIDMINLIT  75

Query  151  QNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL  210
            Q    Q  +L     +    +  +  + +    K     +  M+  +  +  F  +  L+
Sbjct  76   QFAITQLILLFIIWIFNSYAVMCVYVAAYRQFRKKRWNAWDLMQEYIGKLPGFLAISFLV  135

Query  211  ILVVGGGSLLLIIP------------------GLLFCVWFFFCQYVLADDNIGGLQALEK  252
             L+V    L++ I                    +   +           +      A+++
Sbjct  136  DLIVSIPVLIVGIFAAVTGPLAFLLLVLVLPAMVYIGIRLSVSNISYVIEGKSITGAIDR  195

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
            S  L    +W +FG  +   ++   ++F    I  +         L   PF
Sbjct  196  SLFLTKDKFWYVFGCSLAFGILIGIITFFGRLIIKLPLMVLGGGELAYQPF  246


>HAQ0966128.1 glycerophosphodiester phosphodiesterase [Enterococcus faecium]
Length=472

 Score = 51.7 bits (120),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 38/355 (11%), Positives = 89/355 (25%), Gaps = 22/355 (6%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA----YILLGLSWMTG  176
            L  I +L  +L         LL    ++  +       +L  T+        +   +   
Sbjct  73   LSLIVVLMTILLLVYFEFTFLLMSVFFIKKKEPISLKQLLHLTILQLKKVRPITFLFFLA  132

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
               + +  + +     +   ++        +     ++    LL+ I      +   F  
Sbjct  133  YFLLILPISGLSFNSDLLSKIKIPAFIMDFIFANRWIIVSSFLLVYIFLGYIGIRLIFAL  192

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG-------  289
              +   +     A+ +S  L      AI G+F+++    L LS L   +  +        
Sbjct  193  PEMILRDRPFRAAIRESWSLTKSRLLAITGQFIVIGGTILLLSSLGYIVVILAQSMVEQF  252

Query  290  ----EAANLAF------SLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIF  339
                   +  F       +LL         + Y  +         P I + ++P    + 
Sbjct  253  FPDYALISAVFAMTLLQGILLFNIVMSTVGIFYIIVDFMDDEGFLPEIPKWFIPQAPNLR  312

Query  340  GWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADY  399
               L    L +      +              +          ++     +    +    
Sbjct  313  FSALKNTGLTLFAVFFGIGVCLYNMDYLTSAVQTKPVTVSHRGVSTQNGAQNTLEALEKT  372

Query  400  KLLLSKQRKTTSEGGLSLGPVTLFADRFWAD-DQNPHLWLKLELSDFPNLSLAQK  453
                              G   +  D               L L +  NL++ + 
Sbjct  373  SRDYHPDYVEMDVQETKDGHFVVMHDANLRHLTGVNGTPQDLTLKELTNLTVTEN  427


>RZB42193.1 hypothetical protein D0Y65_052964 [Glycine soja]
Length=641

 Score = 52.1 bits (121),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 23/241 (10%), Positives = 59/241 (24%), Gaps = 0/241 (0%)

Query  40   PAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSG  99
             A++                 +  +    + +   + +         L            
Sbjct  1    MAQNNFWHIFCETRNIFQAHSRHLVTLSLIFLFPLSFSLLLSPTLSNLFNHFYTNIIPYP  60

Query  100  LRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAI  159
                S    +                 +        I  ++           N       
Sbjct  61   YNYSSTTNPNYKHHSLFFHLLYSLFTFIFSNCGVISITYSVFHFFNDQSLNLNLKSTITK  120

Query  160  LLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
             ++T    LL  + ++  +  ++      L   +  G       T    L   +    +L
Sbjct  121  SISTSFLPLLATTIVSHVIIFFVSLLYALLLVLIICGAIFFNVTTSYSSLYYFIGFLIAL  180

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
             L+   +   V +     ++  ++  GL+AL++S  LV G          +     + + 
Sbjct  181  PLLFFLIYLQVNWTLVPVIVILESCWGLEALKRSARLVRGMKRVALSSLFVYGFFEVIVV  240

Query  280  F  280
             
Sbjct  241  L  241


>MBF0344757.1 hypothetical protein [Nitrospirae bacterium]
Length=346

 Score = 51.3 bits (119),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 27/149 (18%), Positives = 59/149 (40%), Gaps = 8/149 (5%)

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
                     +           F  +   ++      +  +LLI+++  G LLL+IPG+  
Sbjct  92   YVFIIKVIRLIFTKYDGADMKFEELLANMQLTFKVLITNLLLIIIITIGMLLLVIPGIFL  151

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP--  286
             V F F  +++ D +I  ++++++S  +  G+ W +        +I+  L  +T  +   
Sbjct  152  TVRFSFSIFLIIDKDIAIIESIKRSYRITKGNTWKLLLLLFPPSIINSLLPTITTFLSDK  211

Query  287  ------YVGEAANLAFSLLLTPFSFLYYY  309
                  Y+        +  +  F+ LY Y
Sbjct  212  IVIYSLYILAFIGNIVTFFIMLFTMLYVY  240


>WP_199753933.1 zinc-ribbon domain-containing protein, partial [Corallococcus 
sp. AB018]RUO87600.1 hypothetical protein D7Y11_39840, partial 
[Corallococcus sp. AB018]
Length=156

 Score = 49.0 bits (113),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V+C  C      P  K+  K    RC +C  T 
Sbjct  1   MI-VQCEQCQTRFKIPDEKVTEKGVKVRCTKCQNTF  35


>MBE6700725.1 hypothetical protein [Ruminococcaceae bacterium]
Length=229

 Score = 50.5 bits (117),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 29/212 (14%), Positives = 57/212 (27%), Gaps = 7/212 (3%)

Query  98   SGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQW  157
                      +   ++F   G     + + G+V +       L                 
Sbjct  1    MKEICRKYRESAKVQVFSHIGQIFFAVIITGLVSSLISYHFQLTAFNKNIELISGDPEML  60

Query  158  AILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG--  215
             I L   A   +    +T  +  +   T             +   F     LL   +   
Sbjct  61   KIALIVSALSSIVTLPLTFCLTRFYLLTSRTHIFQKIPIKAYFTPFESPSYLLKCSLLTL  120

Query  216  ----GGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVL  270
                   L + +      + F    ++++D+       AL+KS  ++ G     F     
Sbjct  121  TLTVINCLGVFLLIFPVFLSFCMAPFIMSDNPETSVFSALKKSFKMMRGKKMMAFRASFP  180

Query  271  LLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
             L+  L ++   A IP +      A   L   
Sbjct  181  FLIFYLVITMFFASIPVISFIMTAAGEALFYV  212


>TET29854.1 hypothetical protein E3J69_12685 [Anaerolineales bacterium]
Length=256

 Score = 50.5 bits (117),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 27/207 (13%), Positives = 64/207 (31%), Gaps = 4/207 (2%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
                   L        +GI ++G ++  A IF  L       L       +    +   +
Sbjct  11   FPQIDHWLQSVPQETWIGIAVVGGLILLALIFWVLAAIGNGGLIAGFHMAETGETVTLAS  70

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
                G S+    + I +      L   + +         L L + ++       LLI  G
Sbjct  71   AFQQGFSFFWKLLAIQLLLGLASLVVILPVVFGGALLSILTLGIGLICFIPLICLLIPLG  130

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            +   ++    Q  L  +++  + A +++  +   +     G  +++ +I    SF+   I
Sbjct  131  IALSIYTLLTQIALIVEDLDIVAAFQRAWDVFRSN----LGEVIVMGLILGVGSFVVGLI  186

Query  286  PYVGEAANLAFSLLLTPFSFLYYYLIY  312
              +     +   +           +  
Sbjct  187  LAIPFILMVLPFITGLLVDTEASTIAC  213


>OGF21504.1 hypothetical protein A2Y83_04475 [Candidatus Falkowbacteria bacterium 
RBG_13_39_14]
Length=277

 Score = 50.9 bits (118),  Expect = 5e-04, Method: Composition-based stats.
 Identities = 44/201 (22%), Positives = 80/201 (40%), Gaps = 9/201 (4%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +L +  + I L       AL+         +  + +     +   ++          + +
Sbjct  73   ILFLISIVIALLSIIQQIALIYFADQKNIARESSVKECYQTSKSYFLSYLWISAIILLIM  132

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             I    +     +    + V +   +L+ LI +    ++L+    L   V F F  ++L 
Sbjct  133  LIGALGIMFTIGLSAIFKTVFASKYILLALISLSFIAAMLIFFYLLYLSVCFMFSYFILI  192

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFG---RFVLLLVISLTLSFLTARIP------YVGEA  291
             +NI G +AL+KS+ LVS  WW          L+++    +S L   IP       +G  
Sbjct  193  SENIKGWEALKKSKALVSERWWNTVVKLIILNLIILAISLISVLIKMIPQKTISGALGTL  252

Query  292  ANLAFSLLLTPFSFLYYYLIY  312
              L FSL   PF+ +Y+Y IY
Sbjct  253  FTLIFSLAAIPFTAVYFYFIY  273


>MBC7870492.1 hypothetical protein [Chitinophagaceae bacterium]
Length=338

 Score = 51.3 bits (119),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 30/212 (14%), Positives = 59/212 (28%), Gaps = 29/212 (14%)

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV------------  214
             ++    +T             +  +            + L L  LV+            
Sbjct  91   GVILNGLITSIASENHLGRRQSIGEAFSAMRSRFVPLGIGLFLFGLVIGILTLICFLGGA  150

Query  215  -GGGSLLLIIPGLLFCVW-FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
                  L I       +  FF+   +L+ +++G    + ++  L    +W  FG    + 
Sbjct  151  LIVCPFLGIPVVFYLAIAGFFYIVPILSLEDVGASFGISRALALGKARFWPSFGLVAGIT  210

Query  273  VISLTLSFLTARIPYVG---------------EAANLAFSLLLTPFSFLYYYLIYSDLKA  317
            +I   ++     +  V                   NL  ++L TP   +   L+Y D + 
Sbjct  211  LIIFIITLAVGALAGVLLGVSGSVFASTSTPIILLNLVLTILATPIQPIALTLLYYDTRV  270

Query  318  NYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLL  349
               G             + A        G LL
Sbjct  271  RLEGLDIAMQSVTRPNPSPADVASPPSSGPLL  302


>KPN30121.1 hypothetical protein SY89_00843 [Halolamina pelagica]
Length=188

 Score = 49.8 bits (115),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 17/94 (18%), Positives = 38/94 (40%), Gaps = 0/94 (0%)

Query  209  LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
            LL+L +G   ++ I+  L+      F   V+  +N   L A  +    ++G W       
Sbjct  25   LLLLAIGVAFVVAIVSSLVNGFTTQFVVPVMIAENRNVLAAWRRFWPTLTGQWKQYLVYV  84

Query  269  VLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
             +  V+S+ +  +   + +VG        +++  
Sbjct  85   FVRFVLSIAVGLVVGIVTFVGMLILAIPFVIVGV  118


>GBD36071.1 hypothetical protein HRbin36_01191 [bacterium HR36]
Length=343

 Score = 51.3 bits (119),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 47/324 (15%), Positives = 92/324 (28%), Gaps = 16/324 (5%)

Query  1    MPTVR-CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCG  59
            M  V  CPHCG     P+     K    +CP C      + +ES       + A+     
Sbjct  1    MNRVTHCPHCGQTVTVPAGLSGRK--RVQCPLCRNQFGLEISESAVRVLAIDTASLGVAT  58

Query  60   LQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGW  119
                    R   Q +       +++F         A    L     L+  S  +      
Sbjct  59   AGPTAARKRQPWQYRFDFGAIFSKTFSHYGSAFLAALVYFLLVFLLLIGVSIGVLVATFV  118

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWA--ILLATVAYILLGLSWMTGS  177
              L  ++  I+L    I       P  +                    +           
Sbjct  119  LRLLGWIGAIILFVLLIALLAAAIPLMFGIVSACMSIARGKGWDVACFFQPFRNFGGFIM  178

Query  178  MFIYICKTDVGLFR-SMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             ++ +    +     ++ +    +        + +++ G   L   I  +   +  F   
Sbjct  179  WYLGLIGLCIAFGVLAVAIIFGVIWLRQNFAAVALIIGGILLLGEGIALVGVLLRCFAIS  238

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL----------LLVISLTLSFLTARIP  286
             +   D  G   A++ +  + S + W  +   +L          ++ ++L  + L A  P
Sbjct  239  PMFLIDGYGINDAVKWNLKVTSHYKWLYWLVLLLLLFLVGLFNSIVGLALRFTLLAALGP  298

Query  287  YVGEAANLAFSLLLTPFSFLYYYL  310
             VG    L  S +L         L
Sbjct  299  LVGALVGLVASFVLNALVMPLTAL  322


>WP_006063008.1 hypothetical protein [Corynebacterium durum]EKX91392.1 hypothetical 
protein HMPREF9997_00764 [Corynebacterium durum F0235]
Length=239

 Score = 50.5 bits (117),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 26/212 (12%), Positives = 58/212 (27%), Gaps = 9/212 (4%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
             +   W  +      +L + L+  +                    +      A       
Sbjct  23   HVGQQWIPWATAMGAVLFLSLVPFIALIPLFGVLAAEDFFDVRVQEGTILGSAFAWIFFL  82

Query  166  YILLGLSWMTGSMFIYIC-------KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
             +              +           + +     L    +  FTLL   + ++VG G 
Sbjct  83   LLCASAVAALVLWLNSVRNAYRQCQGEVITIGSFFALRGLLIPFFTLLS--VAVIVGVGL  140

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            ++ +IPGL+  V   +  +  A+      +A  +       +     G ++L+ V+    
Sbjct  141  VMFVIPGLIVMVLLMYVPFAAAEPGSTIAEAFVRGWQAFVDNPGKSIGLWLLVAVVYCVA  200

Query  279  SFLTARIPYVGEAANLAFSLLLTPFSFLYYYL  310
            +F    +  V     L   +           L
Sbjct  201  NFTYFALLLVFPIIQLIMMVAYLMCVRRPIAL  232


>MTI08817.1 hypothetical protein [Rhodospirillaceae bacterium RKSG073]
Length=421

 Score = 51.7 bits (120),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 7/36 (19%), Positives = 15/36 (42%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M    CP+C  +   P + +  +    +C +C  + 
Sbjct  1   MIL-ACPNCSTKFRVPDNAIGPQGRVVKCAKCAHSW  35


>HAI09426.1 hypothetical protein [Dehalococcoidia bacterium]
Length=215

 Score = 50.2 bits (116),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 23/179 (13%), Positives = 52/179 (29%), Gaps = 6/179 (3%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
               L  +  L           +L   P      +       + +   A  +L    +   
Sbjct  37   QIPLSVLSWLLFRNTDLNFDFSLTEIPPVLFWVKVAIKSLTLSILGGAAWILMQGALIHG  96

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL--LLIIPGLLFCVWFFFC  235
            +     +T + +  +     R        L+   + +    +  L I   + F   + F 
Sbjct  97   ISEQFIRTPIQIQSAYAFSFRRFFPRLAALVFSGVALILMWITVLGIPFAIRFGGLWVFI  156

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS----LTLSFLTARIPYVGE  290
                + +  G   A+ +S  LV   W  +     +  ++       +S++   IP  G 
Sbjct  157  LQTASVEGFGPRAAMARSAALVRDSWSRVAWILFVSGIVFAISGGIVSYMIGFIPIAGA  215


>OJW54001.1 hypothetical protein BGO67_08015 [Alphaproteobacteria bacterium 
41-28]
Length=217

 Score = 50.2 bits (116),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 10/44 (23%), Positives = 14/44 (32%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  + CP+C        + LP +    RC  C       P    
Sbjct  1   MIVI-CPNCSKRYMLDDNLLPQEGRQVRCIACHHVWRQVPDREP  43


>MBE9516992.1 hypothetical protein [Bacteroidetes bacterium]
Length=169

 Score = 49.4 bits (114),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 20/169 (12%), Positives = 53/169 (31%), Gaps = 0/169 (0%)

Query  91   REFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNP  150
                 +                 +       + + ++ IV         ++         
Sbjct  1    MVTIFNNDQYDFYPDAGKSYSVGWKILMAAFVELLVISIVYMILSGPVGVIQWKVDSFEW  60

Query  151  QNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL  210
                  +  ++ +V                 +    + +     +  R+  +  +  I++
Sbjct  61   FLVPLVFFGIVYSVFVAGPIQYGAKWVFLKAVRGERIEVRDIFVVFQRNYWNAVIAKIVV  120

Query  211  ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSG  259
             ++VG G ++LI+PG++F     F  Y++ D  +  + AL  S  +  G
Sbjct  121  GIIVGLGFVMLIVPGIIFACRLAFVPYLVVDREMDVMDALRVSWDMTRG  169


>MBD0865688.1 zinc-ribbon domain-containing protein [Rhodobacteraceae bacterium]
Length=198

 Score = 49.8 bits (115),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 9/56 (16%), Positives = 14/56 (25%), Gaps = 0/56 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHC  58
            + CP+C A    P   +P      +C  C                 D+       
Sbjct  2   RLTCPNCDAWYEVPGKAIPRAGRDVQCSNCGTMWFQTCPNRTNDTPDDDGPHIVPH  57


>KAF8358351.1 hypothetical protein PRIPAC_93346 [Pristionchus pacificus]
Length=3704

 Score = 52.1 bits (121),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 32/290 (11%), Positives = 65/290 (22%), Gaps = 23/290 (8%)

Query  339   FGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSAD  398
                +   G  L + S        +   G+ +                      + + +  
Sbjct  2109  QPAVGPDGQPLPTDSAGKPVYPVVGPDGQPLATDSTGAVVGPDGQPIPTDASGKPVDADG  2168

Query  399   YKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARI  458
               L      K  S          +        D +P    + +   +P +    +  A  
Sbjct  2169  NVLPTDSDGKYISPKTEGDDEEKVVLPIIVGPDGSPLPTDENKKPVYPVVGPDGQPLATD  2228

Query  459   EIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLF---SGIRSIYLRQGTQAEQVH  515
                 V+  D  +      S          +      +         S  L      + +H
Sbjct  2229  STGAVVGPDG-EPIPTDASGRPVGADGSPLPTDANGNYVNVPKDDISKELPTDETGQVIH  2287

Query  516   SILGKLELTLPLAIESLQLTRNDIGKTLQIGGK--------QLILQRLGSNAVTLRFLGD  567
              + G     LP       +   D G  ++  G+         ++      N +    +G 
Sbjct  2288  PVTGPDGQPLPTDASGNFIK--DDGTPIEKDGEGRPLGPDGNVLPTDASGNFI-YPAVGP  2344

Query  568   RTDLLNVHASNSH--------AEPLREIGFTWQKSGDAFSLRQMFDGNIE  609
                 L   A+N           +PL           D   +     G   
Sbjct  2345  DGSPLPTDANNKPVYPVVGPDGQPLATDSTGAVVGPDGQPIPTDASGKPV  2394


>MSW87328.1 hypothetical protein [Actinobacteria bacterium]
Length=287

 Score = 50.9 bits (118),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 23/93 (25%), Positives = 47/93 (51%), Gaps = 1/93 (1%)

Query  188  GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGL  247
            G+   +K     +     + +L+ L    G  + +IPG+   V +     V+  + +  L
Sbjct  98   GIPDVLKSVKTRIWPMLWVGLLVSLATLVGYFIFLIPGIYLGVVWSVVTPVIVVEGLS-L  156

Query  248  QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
             AL +SR LV G+ W +FG  ++L+++ L +++
Sbjct  157  DALARSRNLVKGNGWGVFGFGLVLILLLLVITY  189


>NLN62172.1 hypothetical protein [Myxococcales bacterium]
Length=243

 Score = 50.5 bits (117),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 24/193 (12%), Positives = 55/193 (28%), Gaps = 13/193 (7%)

Query  143  KPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGS  202
                           +++   +    +   ++     +     +      +  G    G 
Sbjct  60   NFDLLFVSFLVCGLLSVVTLGILATPMIAGFLMIVRRVMRNDPNKPTVGDVFQGFEVFGQ  119

Query  203  FTLLLILLILVVGGGSLLL---IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSG  259
              LL +LL + +   SL+    I  GLLF  +F +    +  + +  + AL+K   L + 
Sbjct  120  AFLLFLLLSIAIFILSLVPVVKIAVGLLFSPFFAWGMLFVVYERLSAVDALKKIFELTAA  179

Query  260  HWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
              +           + L +  +   I  +G            P  ++   + Y  L    
Sbjct  180  GDFT----------MPLFVGLVAGIIGGLGVIFLGIGIFFTAPLIYIITAVAYETLFGAD  229

Query  320  RGPQHPPIKRQWL  332
                        +
Sbjct  230  EMHDAGDWNDGTM  242


>RLB05769.1 hypothetical protein DRG50_06725 [Deltaproteobacteria bacterium]
Length=76

 Score = 46.7 bits (107),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 7/36 (19%), Positives = 12/36 (33%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  + C  C  +    ++K+       RC  C    
Sbjct  1   MI-IECKKCRTKYRVDAAKIKTTGIRVRCSRCMHEF  35


>PYV69652.1 hypothetical protein DMG97_21290 [Acidobacteria bacterium]
Length=262

 Score = 50.5 bits (117),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 22/165 (13%), Positives = 48/165 (29%), Gaps = 0/165 (0%)

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
               A   S      A                ++    LL + W+   + + +    V   
Sbjct  28   YCLAVFISQAPTISAVSKLHMELPISIGGAYSSSRGSLLRVIWIVFLISLIVMALFVISG  87

Query  191  RSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
              +      +    L   ++ +V     ++     L + + +     V   +      + 
Sbjct  88   ILIGTVGGALAESNLGPWVIGIVGSVVVVMPAFVILRWMLNWSLVIPVTILEGGWFRAST  147

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
             +S+ LV    W IFG + LL ++    SF+      +       
Sbjct  148  RRSKFLVKDSRWRIFGVYFLLGLLGGVTSFMAQIFLLLLVPVFGV  192


>NQZ85032.1 hypothetical protein [Nanoarchaeales archaeon]
Length=298

 Score = 50.9 bits (118),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 29/199 (15%), Positives = 66/199 (33%), Gaps = 10/199 (5%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
                +  +++A     +   +   T     +   +   +   V   +L L  +    +  
Sbjct  25   FIFKMSLLIVAIGVFTTVTDIYSPTSEWTAHIGVEILNIFIYVTVAILALVNVIYLYYKN  84

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                +     +  L    +    +  IL  +++    LLLI+PG++F + + F  Y +  
Sbjct  85   KSSDNKTYSEAESLLFGKILPIIITSILFTILIIPLFLLLIVPGIIFSILWSFYLYAILF  144

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVL-------LLVISLTLSFLTARI---PYVGEA  291
             +     AL+ S  +VSG    +   F+        L ++ +  +     I         
Sbjct  145  RDKKYYSALKYSASIVSGKKLLVLNNFIAYTFNQYRLWLLPILAALFVVIISQDNITLII  204

Query  292  ANLAFSLLLTPFSFLYYYL  310
               A S+ L  ++      
Sbjct  205  LFTAVSICLEIYTLRTVVF  223


>HBY13721.1 hypothetical protein [Rhodobacteraceae bacterium]
Length=122

 Score = 48.2 bits (111),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 9/40 (23%), Positives = 15/40 (38%), Gaps = 0/40 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAE  42
            + CP C A+   P   +P      +C  C +T      +
Sbjct  2   LLACPICQAQYEVPEDAIPEAGCEVQCSACGETWFQPHPQ  41


>WP_155987575.1 hypothetical protein [Acidobacterium ailaaui]
Length=298

 Score = 50.9 bits (118),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 33/278 (12%), Positives = 74/278 (27%), Gaps = 5/278 (2%)

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
            +    +  L   L F      +  + +   +      Q    +  +    L    +    
Sbjct  19   FFWAAVVYLFCRLRFTLFDLIVYKQGSIRRSWIKYGSQAWRYVGVMLLASLAFFLVAAIS  78

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
                    +   R+  L   +     LL  +L ++V    L L    ++  V   F    
Sbjct  79   TGPFFLKMLRAMRAEALQGSNPNPLPLLAAMLPVLVVLFLLALCW-MIVDTVLQDFVLPP  137

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT----ARIPYVGEAANL  294
            +A ++     A  +  LL+        G  ++  V+S+ +S++       +  V  A   
Sbjct  138  MALEDAPIGGAFRRFFLLLQEGPAMFLGYLLVRFVLSIGISWVLLTLVGIVLLVAGAGCG  197

Query  295  AFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSR  354
              SLLL    +    ++               I    + + +      ++     V    
Sbjct  198  LVSLLLYRTMWHGGVVMQFGFVLIVAVLAVVLIALYLMAMVSIYGTVGVLKESYAVFFYG  257

Query  355  QNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQ  392
               +    L    +  +                   P 
Sbjct  258  SRYARLGDLLEPPEKDRTDVQTVFPQDLSGSLPEASPP  295


>WP_172819116.1 zinc-ribbon domain-containing protein, partial [Corallococcus 
exiguus]NRD50498.1 zinc-ribbon domain-containing protein [Corallococcus 
exiguus]
Length=120

 Score = 47.8 bits (110),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V+C  C      P  K+  K    RC +C  T 
Sbjct  1   MI-VQCEQCQTRFKIPDEKVTEKGVKVRCTKCQNTF  35


>NNE45683.1 hypothetical protein [Rhodothermales bacterium]
Length=315

 Score = 50.9 bits (118),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 25/153 (16%), Positives = 45/153 (29%), Gaps = 35/153 (23%)

Query  206  LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
               + IL      +  I   +   V + F    +A ++   L A  +S  LV+G+    F
Sbjct  161  AGAISILGTVAAVVFGIYCVIYLKVRWAFGTTTIAWEDQSVLGAFGRSSELVAGNGMRTF  220

Query  266  GRFVLLLVISLTLSFL-----------------------------------TARIPYVGE  290
            G   L  ++   L  +                                    + I ++  
Sbjct  221  GILALFAILVGILVSVVMTPFQFLMFKDLFLAGLNQAQNVALQGEADILKSLSGIGFLYG  280

Query  291  AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                  SL  T    +Y  ++Y DL+A      
Sbjct  281  VIVALASLGTTTIKSVYLPVLYFDLRARNAEFD  313


>QDT58231.1 hypothetical protein SV7mr_07200 [Planctomycetes bacterium SV_7m_r]
Length=904

 Score = 51.7 bits (120),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 46/330 (14%), Positives = 89/330 (27%), Gaps = 3/330 (1%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
             T  CP C +      +K   +    RC +C   L        +     + +      L 
Sbjct  570  FTFSCPVCDSRIT---AKRANEGDKLRCSDCHSDLTIPKPPPIKQAKETSPSELESFALA  626

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                  R       V           +PE     S S           S  +  +    +
Sbjct  627  PESQISRGAENDPWVKNADQLLEEAAKPESTKFVSLSADEDPEGGWMHSIGIRMKDPGVI  686

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              + LLG ++      +  +    T            I+L     I++  +     +  +
Sbjct  687  AHLILLGAMMGAFTFLATSIHWLITVFTLPIIFVILVIVLVASFAIMMATANGHDQIDEW  746

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                    F +M   L  +      + LL  + G  + L++   L      F    +   
Sbjct  747  PTMDPSEWFDTMGAVLSAMAVTVAPIGLLAFLFGFSNTLVLGLSLAAMFSMFPIVLLSML  806

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
            D       +  + +    H    +G F LL    L ++                F  L+ 
Sbjct  807  DAQSMTGIVSPTVIKSINHRGEDWGTFYLLSAFMLLVTIGLLVYLSGTSFGMAVFGCLIV  866

Query  302  PFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
               F+Y+ L+    +         P+ +  
Sbjct  867  AVFFVYFTLLGRLARGIQSVVHFEPLGKSK  896


>HCS52016.1 hypothetical protein [Planctomycetaceae bacterium]
Length=299

 Score = 50.9 bits (118),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 11/56 (20%), Positives = 20/56 (36%), Gaps = 0/56 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCP  56
           M    CPHC A    P S +  + +  +C +          +   +  T ++   P
Sbjct  1   MIQFECPHCQAVLRVPDSAMGQEGACPKCQKALLIPNPMAQQVSPSAVTQHVTPQP  56


>RDE16307.1 hypothetical protein C4K48_01870 [Candidatus Thorarchaeota archaeon]
Length=161

 Score = 49.0 bits (113),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 24/161 (15%), Positives = 56/161 (35%), Gaps = 9/161 (6%)

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
                I +    +            V          +L      L+ II  L   V     
Sbjct  1    MVYGIILMIPVIPGGLIFAYSAVTVDWMDPSTYGAMLASIPFLLVGIIITLYLAVRLAPT  60

Query  236  QYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI--------P  286
              V+  + +   + +++++  +   H+W IFG   LL+++   +  +   +         
Sbjct  61   IAVVIAEKDKSAVASVKRAWKITGHHFWHIFGGLFLLVIVIALVGMVIGILVAPIALVAM  120

Query  287  YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPI  327
             +   A +   + ++PF  ++  ++Y DL++  R  Q    
Sbjct  121  GLVGLAAIIIGIFVSPFPAIFQAVLYRDLESRARITQADWW  161


>OGQ12530.1 hypothetical protein A2138_07660 [Deltaproteobacteria bacterium 
RBG_16_71_12]
Length=288

 Score = 50.5 bits (117),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 11/37 (30%), Positives = 16/37 (43%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  V CP C A+       +P   +  RCP C  + +
Sbjct  1   MI-VTCPGCSAKYRVRDEAVPEGGAELRCPTCNASFM  36


>OLS27509.1 hypothetical protein HeimC3_02870 [Candidatus Heimdallarchaeota 
archaeon LC_3]
Length=414

 Score = 51.3 bits (119),  Expect = 6e-04, Method: Composition-based stats.
 Identities = 21/157 (13%), Positives = 47/157 (30%), Gaps = 21/157 (13%)

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALE  251
            +   +  +    +   +L  +   G +  I+PG++          ++  + +I   ++ +
Sbjct  150  ISKVIPKLIPIVIAAFILNFLTLIGLIFFIVPGIIISGLVSLVPVIILSESDISVGKSFK  209

Query  252  KSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY--------------------VGEA  291
            +S  LV G         V + +I   L+ +   I                      +   
Sbjct  210  RSYELVKGFLLKTLALLVFIGIIQFILATIVQLILVNIYVVFSGIDPEQISTSTDPIFVI  269

Query  292  ANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIK  328
                      P S +   L Y +LK   +     P  
Sbjct  270  LRYVSEAFFAPLSIISLILFYYNLKWENQQKYSQPWN  306


>WP_088217549.1 zinc-ribbon domain-containing protein [Haematobacter massiliensis]OWJ83136.1 
hypothetical protein CDV51_16465 [Haematobacter 
massiliensis]
Length=145

 Score = 48.6 bits (112),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 8/38 (21%), Positives = 15/38 (39%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  + C +CGA+     + +P +    +C  C      
Sbjct  1   MRLI-CANCGAQYEISDTAIPPQGREVQCAACGHGWFQ  37


>BAU82378.1 integral membrane protein [Streptomyces laurentii]
Length=143

 Score = 48.6 bits (112),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 25/124 (20%), Positives = 36/124 (29%), Gaps = 30/124 (24%)

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL----------  281
            F         +N G + AL +S  LV G WW IFG   L+ +I   +  +          
Sbjct  3    FSLAPAAAVIENQGPMAALRRSAHLVRGSWWRIFGCVALIGLIVGVVGGMVQEFVSILAM  62

Query  282  --------------------TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
                                   +     A  L   +LL P   L   L+Y D +     
Sbjct  63   VPMTSLASGDHENMRASFSTLWGVMLTAGAVGLVVQILLAPLQPLVVSLLYIDQRIRKES  122

Query  322  PQHP  325
                
Sbjct  123  LAPM  126


>TGV64675.1 hypothetical protein EN792_068835, partial [Mesorhizobium sp. 
M00.F.Ca.ET.149.01.1.1]
Length=200

 Score = 49.8 bits (115),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 21/122 (17%), Positives = 51/122 (42%), Gaps = 0/122 (0%)

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
               +    +  + +  +  + +  +         + L  +       +L+ L +  G LL
Sbjct  77   PVYLFITAVLQTSVIRAAIVDLRGSKPVFADCFGVALALLFPILGASLLVTLGILIGLLL  136

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
            L++PG+L  + +     V+  +  G L ++ +SR L  G  WA+   +++L+V    +  
Sbjct  137  LLVPGILLLLRWAVTMPVMIQERRGILDSMARSRDLTKGSRWALLWLWLILIVTGTLIGL  196

Query  281  LT  282
            + 
Sbjct  197  VI  198


>MBA4366017.1 hypothetical protein [Desulfobacterium sp.]
Length=363

 Score = 50.9 bits (118),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 23/131 (18%), Positives = 38/131 (29%), Gaps = 1/131 (1%)

Query  5    RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAE-SQRTQTTDNIATCPHCGLQRR  63
             CP C        SK+P K  +ARCP+C             +        TCP CG  + 
Sbjct  56   TCPSCQVSFKIDDSKIPEKGCNARCPKCQNRFFLQKKPLIGKKTGPIKTITCPKCGFSQP  115

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLG  123
                 +E         +      +    +     +  R+         + F      L+ 
Sbjct  116  KSETCIECGIVFAKYHQPRNDPGIHDTIQPARPDNVRRTSIADFPARPKTFPIITVALIV  175

Query  124  IYLLGIVLAFA  134
            I  +  V  + 
Sbjct  176  IAGIAGVFIYL  186


>WP_152420686.1 hypothetical protein [Haloferax sulfurifontis]
Length=217

 Score = 49.8 bits (115),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 24/171 (14%), Positives = 61/171 (36%), Gaps = 10/171 (6%)

Query  147  WLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLL  206
            ++  +         LA+ A  L  L     S+ I      +     +      +    +L
Sbjct  44   YIFHRLGWASVHSFLASWAVALSVLIVTILSLVISFGGLSLVTDGFVSGLADSLPGKIIL  103

Query  207  LILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFG  266
              ++ +++  G+L+         V   F    +A  + G + A++ S  L  G    +F 
Sbjct  104  GGIVFVLLVPGALI--------GVSLLFTGQEIAVRDKGVISAIKGSWRLTKGVRGQLFL  155

Query  267  RFVLLLVISLTLSFLTA--RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
              ++ +++ + LS+L       ++  + ++  S +++          Y   
Sbjct  156  LTIVPMLVQIPLSYLLFEHLPQFLANSISILESAVVSLVMLAIMARAYKQC  206


>MBR32394.1 hypothetical protein [Spirochaetaceae bacterium]
Length=323

 Score = 50.9 bits (118),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 23/181 (13%), Positives = 52/181 (29%), Gaps = 3/181 (2%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             I  +    + A    A +       N     +   +L        L ++ +  ++   +
Sbjct  1    MIAGMVSYGSGAARGQARIEDLFKPFNNFGGVFLAGLLYFLFLLAGLIVTIIAYTVSAAV  60

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                +            +G+     +L  +V+     L+ +    F         ++ + 
Sbjct  61   FAASLKAVIETGNLGTDLGAGLSEDVLGSIVIFAVFTLIGLTAHYFMARLDLVYPLVYEL  120

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
             +G   A  +S  L  GH    F  F   ++I+L  + L     +              P
Sbjct  121  RVGPWDAFVQSWRLTRGHG---FSLFFAKMIIALVPAILVGVPYFALLFGTFFLGTGPDP  177

Query  303  F  303
             
Sbjct  178  L  178


>MBF1339007.1 DUF975 family protein [Mogibacterium diversum]
Length=233

 Score = 50.2 bits (116),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 23/138 (17%), Positives = 54/138 (39%), Gaps = 1/138 (1%)

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV  214
              +A+  A   +++  ++    + F+     +   F        +  +    +  + L +
Sbjct  6    IAFAVGFAFAIFVINPINVGVHTFFLRNSSGEAEGFHLGDGFKYNYLNVVKTMFFMNLWI  65

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
                LL I+PG++    +    Y+LA++ +I   +AL +S  L+ G+ W  F   +  + 
Sbjct  66   LLWKLLFIVPGIIKSYSYRLVPYILAENPDIDTNEALMRSEQLMRGNKWETFIYDLSFIG  125

Query  274  ISLTLSFLTARIPYVGEA  291
              +   F    +P     
Sbjct  126  WYILSIFTCGILPVFWVM  143


>RME50712.1 hypothetical protein D6795_09580 [Deltaproteobacteria bacterium]
Length=354

 Score = 50.9 bits (118),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 11/43 (26%), Positives = 18/43 (42%), Gaps = 1/43 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAES  43
           M  ++CPHC    N P  K+  +   A+C +C         + 
Sbjct  1   MI-IKCPHCETAFNIPDEKISPRGIQAKCYKCSHIFRVRRPDP  42


>RLI93329.1 hypothetical protein DRO89_00300 [Candidatus Altiarchaeales archaeon]
Length=531

 Score = 51.3 bits (119),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 39/252 (15%), Positives = 78/252 (31%), Gaps = 8/252 (3%)

Query  69   LEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLG  128
             E+     + R       ++   E         S            C      L +  L 
Sbjct  63   FEMAHVLRDYRDIFSDVYMESFPEIYPDLIERFSAMGPEIVIISAICLILISALYLLFLV  122

Query  129  IVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVG  188
            +           L +  T          +          +  L ++   ++  I    V 
Sbjct  123  LYAYVGGGSIGYLWEGITDGITFKNFLYYGRNCVWRILGIRILLFLLFLIYSIIFFGAVA  182

Query  189  LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQ  248
                + +     G  +L  ++++ +V    +L +    L  + FFF +  +  +N G + 
Sbjct  183  ----LIISALLPGHSSLYSLVILFLVIFSFILWLFLWFLISLPFFFVETSIVIENRGIMD  238

Query  249  ALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS----LLLTPFS  304
            ++++S  LV  + W I    VL  VI L  S L   +       +L  S    L++  F 
Sbjct  239  SIKRSADLVIKNIWQILLFIVLSTVIWLAYSLLVISLEIPLSLFSLGISPLSALIILFFM  298

Query  305  FLYYYLIYSDLK  316
              +  L   +  
Sbjct  299  NPWMDLAKLNFF  310


>ORB58330.1 hypothetical protein BST43_09855 [Mycobacteroides saopaulense]
Length=192

 Score = 49.4 bits (114),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 18/133 (14%), Positives = 37/133 (28%), Gaps = 24/133 (18%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
               +   +         F     VL  +      AL +S  LV   +W + G  +L  +I
Sbjct  27   FALTFAFLALYFYLIPVFALASPVLILERQTVFGALGRSMSLVRKGYWRLLGILILTSII  86

Query  275  SLTLSFLTARIPYVG------------------------EAANLAFSLLLTPFSFLYYYL  310
               ++ +      +G                            +   ++  PF+     L
Sbjct  87   VGVVAGILGMPFSIGGQIAMGMGTSHSSATSMILGLALTTVGVMVAQIITLPFNAAVNVL  146

Query  311  IYSDLKANYRGPQ  323
            +Y+D +       
Sbjct  147  LYTDQRMRTEAFD  159


>WP_023451178.1 hypothetical protein [Asticcacaulis sp. AC402]ESQ73651.1 hypothetical 
protein ABAC402_18335 [Asticcacaulis sp. AC402]
Length=251

 Score = 50.2 bits (116),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 24/207 (12%), Positives = 52/207 (25%), Gaps = 13/207 (6%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L  +                  +             +A +          L      +  
Sbjct  33   LFFLVPYVAGYFALYAPLIDAGQSYEGSQTLAYARYYAFIAWYFLLRAAALCTAGRLIHE  92

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +   ++   + ++           + ++L + V  G    ++PG+L      F    L 
Sbjct  93   KLKGNNLPFAQMLREAPWLWLRSLPVYLVLSVAVYLGQYAWVVPGMLASFVAAFVLPPLV  152

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA-------------RIPY  287
             +    + A+     L  G +  +     L+   +L   +                    
Sbjct  153  LERQSLIGAIRTGWTLCRGRFLILAVNIALVTGFALAAQWAIGQANTRFSSGMSFVAASI  212

Query  288  VGEAANLAFSLLLTPFSFLYYYLIYSD  314
            + +AA     LL T  S   Y     D
Sbjct  213  LLQAALAVVQLLQTALSLSLYACARDD  239


>OLS15241.1 hypothetical protein RBG13Loki_1127 [Candidatus Lokiarchaeota 
archaeon CR_4]
Length=562

 Score = 51.3 bits (119),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 41/485 (8%), Positives = 116/485 (24%), Gaps = 9/485 (2%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             + C  CG E +    +   +  +  C  C   +     +       +           R
Sbjct  8    RIYCSECGHEIDAEFVERLRQGETVFCEACGSRITVKIQKRVGLPRAEKTQPQTTYTPPR  67

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                     +                       + +   +I        +    +    +
Sbjct  68   TPEPPIKNTRVAPEPTMDDILGAQRGKGGAGSEASASKDNIPDWRTSWSKSDQSKIDRAI  127

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
                         IF  ++L   T L    +   ++  +A+     L  +  + ++    
Sbjct  128  QRLNKLSRSIGILIFVIMVLVNLTALITSVRTASFSWQVASGHIAALACALGSLTVDARF  187

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                     S +          +   +     G G+L+L     +F              
Sbjct  188  RNMIKKKDYSFRG-----IDLIVWGFVGCFGYGVGALVLAKGITIFVYSLHPKSGFPHLT  242

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
                   +     L +      F  +     +++T +        +        +LLL  
Sbjct  243  RTQKFHVI--ISRLNANSASVGFLIWFGTFPLAITTAIAMHAPGLISFVVMGFIALLLDK  300

Query  303  FSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQL  362
            F F          + N        +      +   +   +++ G+ ++ +S  +    + 
Sbjct  301  FLFTPKLKGSLKGRRNVGLGVGMIVVGILGAMDCGMGVVLILKGVAVIVISSISDRQPKP  360

Query  363  LSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTL  422
                   +     +  +       + +     S   + +  ++      +  ++      
Sbjct  361  QVPSSQPKPLPKPEKGEEIIPAAKVTKVASLPSPIPFPIPKTEIPDFIPQKPITPQSTPT  420

Query  423  FADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARI--EIDKVLDDDARDLYDRQHSFEH  480
                             ++    P +S   +  A I   ++ V    +  + DR    + 
Sbjct  421  VVPIKKIPPSIKTTEHSVQEQSVPPISGDFEKQAAIQGYLNHVYTVISSKIRDRIQKLDI  480

Query  481  PAFHW  485
            P    
Sbjct  481  PENEK  485


>HCF56487.1 hypothetical protein [Myxococcales bacterium]
Length=159

 Score = 49.0 bits (113),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 10/35 (29%), Positives = 17/35 (49%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            V CP C    N   +K+PA  ++ +C +C  +  
Sbjct  2   RVACPSCNTVYNVDDAKIPASGANLKCAKCKASFP  36


>KAD5317672.1 hypothetical protein E3N88_17618 [Mikania micrantha]
Length=660

 Score = 51.7 bits (120),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 28/210 (13%), Positives = 70/210 (33%), Gaps = 1/210 (0%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            + +  L ++   A      ++     ++  ++ +   +   T   I L  SW   ++  +
Sbjct  416  IMLVNLFVLGLSALSLIISIVFLVATVSSSSEAYTAKVQNFTEIIIKLKKSWKKPTVTSF  475

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                    F  +      + S      +  ++ G   L + +  +     +     V   
Sbjct  476  YMVLFTMGFFFVCFFCVGMVSVFATGTVAYVLYGVVGLSIAVLWIYISALWMMSLVVSVL  535

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA-ANLAFSLLL  300
            +++GGL A+ ++R L+ G         VL  V      ++T+ I  +  + A    +   
Sbjct  536  EDVGGLGAIVRARELMKGEKVKTSVLVVLFYVAYGVAHWMTSAINLLSMSEAPGVSTTGH  595

Query  301  TPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
              +S L     +   K  +         ++
Sbjct  596  HVWSTLEATYSHDSPKDMHTLCDSLQQSQK  625


>NJM62597.1 hypothetical protein [Oscillatoriales cyanobacterium RU_3_3]
Length=172

 Score = 49.0 bits (113),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 22/102 (22%), Positives = 39/102 (38%), Gaps = 3/102 (3%)

Query  197  LRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIG-GLQALEKSRL  255
            L  +    L  +        G L+++I  L   V F     V+A +     + A+++S  
Sbjct  64   LAGLPFAVLAALGSPAAAAAGLLVMVIILLYVMVKFILIAPVIAIEGTRNPITAMQRSWR  123

Query  256  LVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
            L  G+ + I    +LL      ++ L   I  VG   +   S
Sbjct  124  LTKGNSFRIAVFVLLLFFTIGIIAALVTGI--VGVVLSALGS  163


>MAG22987.1 hypothetical protein [Rhodospirillaceae bacterium]HAQ33126.1 
hypothetical protein [Rhodospirillaceae bacterium]
Length=225

 Score = 49.8 bits (115),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 8/38 (21%), Positives = 12/38 (32%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M    CP C        + L  +  + RC +C      
Sbjct  1   MIL-TCPSCDTRYQLDMAALRPQGQTVRCFKCKHPWTQ  37


>MBI3392904.1 hypothetical protein [Nitrospirae bacterium]
Length=319

 Score = 50.9 bits (118),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 25/191 (13%), Positives = 59/191 (31%), Gaps = 2/191 (1%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
                  I    +        +       ++                     L    + G 
Sbjct  82   WTVAASIAFFFVQAGTIHCLARSAQDLVSFEMSVFWEGGKKFWWPLTVLASLWFPVVLGI  141

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
            +   +             G+   G+    +++ ++ VG   LL +   LL   ++ + Q 
Sbjct  142  V--LLVGGLGFAAFWWAAGMAQKGNVGGAIVVGVMGVGSSVLLFLAAALLAGAYWVYGQV  199

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
            +L  +  GG  A       V+G++    G ++LL+V  +  S   + + +      +   
Sbjct  200  ILVVERTGGFGAFRDGIKFVNGNFLRSIGFYLLLIVGLVAASMAMSIVTFPFSIIPVIGF  259

Query  298  LLLTPFSFLYY  308
             +  P  F++ 
Sbjct  260  FIQIPIQFVFM  270


>NIA23486.1 hypothetical protein [Proteobacteria bacterium]
Length=295

 Score = 50.5 bits (117),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 19/168 (11%), Positives = 61/168 (36%), Gaps = 10/168 (6%)

Query  146  TWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTL  205
             +L   N   +    +A    +L  ++++       +      +  +M    R+     +
Sbjct  119  AYLFVGNVFLKRFFQMAIWFILLFFVTFLFNFALNDLSLFISKIMANMVPIFRYSLQIIV  178

Query  206  LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
               ++ ++V           ++    +FF   ++  +N     A++K   +V    + + 
Sbjct  179  FTFIMGILVLV--------YIISAFLYFFLPAIVIVENCSVKVAMKKIFTIVK--SFKVL  228

Query  266  GRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYS  313
              +++  VI + ++F+ +       + ++    L      LY +++  
Sbjct  229  MNYLVFTVIVVFITFVLSIAGMGLVSVSIVVHALYPFLFVLYMFVLIY  276


>TMF60332.1 hypothetical protein E6I20_14375 [Chloroflexi bacterium]
Length=149

 Score = 48.6 bits (112),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 28/138 (20%), Positives = 53/138 (38%), Gaps = 12/138 (9%)

Query  209  LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
            + +L     S + ++  +     +     V+  + +G   AL++S  L  GH W I G  
Sbjct  11   VGVLAFVAASCIGLVVTIYLAASWLVAPVVVTLEGVGPTTALDRSWKLADGHRWRILGIQ  70

Query  269  VLLLVISLTLSFLTARIPYVG------------EAANLAFSLLLTPFSFLYYYLIYSDLK  316
            +LLLV+ + LS L + +  VG            +  N A +++  P  +  + + Y DL+
Sbjct  71   LLLLVLQVVLSGLISALFIVGLSQDQTVQVIVQQLVNFAANIVWAPIQWAAFTVFYYDLR  130

Query  317  ANYRGPQHPPIKRQWLPL  334
                              
Sbjct  131  VRKEAFDLQVAAEALPTP  148


>WP_181040972.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein, partial [Enterococcus faecium]PQG94069.1 
glycerophosphodiester phosphodiesterase, partial [Enterococcus 
faecium]
Length=298

 Score = 50.5 bits (117),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 30/285 (11%), Positives = 69/285 (24%), Gaps = 18/285 (6%)

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGG  246
            +     +   ++        +     ++    LL+ I      +   F    +   +   
Sbjct  3    LSFNSDLLSKIKIPAFIMDFIFANRWIIVSSFLLVYIFLGYIGIRLIFALPEMILRDRPF  62

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG-----------EAANLA  295
              A+ +S  L      AI G+F+++    L LS L   +  +               +  
Sbjct  63   RAAIRESWSLTKSRLLAITGQFIVIGGTILLLSSLGYIVVILAQSMVEQFFPDYALISAV  122

Query  296  F------SLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLL  349
            F       +LL         + Y  +         P I + ++P    +    L    L 
Sbjct  123  FAMTLLQGILLFNIVMSTVGIFYIIVDFMDDEGFLPEIPKWFIPQAPNLRFSALKNTGLT  182

Query  350  VSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKT  409
            +      +              +          ++     +    +              
Sbjct  183  LFAVFFGIGVCLYNMDYLTSAVQTKPVTVSHRGVSTQNGAQNTLEALEKTSRDYHPDYVE  242

Query  410  TSEGGLSLGPVTLFADRFWAD-DQNPHLWLKLELSDFPNLSLAQK  453
                    G   +  D               L L +  NL++ + 
Sbjct  243  MDVQETKDGHFVVMHDANLRHLTGVNGTPQDLTLKELTNLTVTEN  287


>WP_064441743.1 DUF2510 domain-containing protein [Hoyosella altamirensis]MBB3036593.1 
hypothetical protein [Hoyosella altamirensis]
Length=324

 Score = 50.9 bits (118),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 36/216 (17%), Positives = 68/216 (31%), Gaps = 4/216 (2%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
            ++   +      G  +     +     F+ I    LL  A  L          + ++  A
Sbjct  99   MMLLWFGATAALGVWMFSSIDVSAGSFFSTIVLLFLLIFAIELVAFAVIATAVVAVSEPA  158

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII-P  224
                 +   T    I      +  F  +   L         ++L+ + V    + ++   
Sbjct  159  IRGKAIEARTARDKIIASAPRLAGFMVIHGVLTLAPYVVGGMVLVTIPVLLIFVPVVWAF  218

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
             +   V +     V+  +    ++AL +S  L+ G WW   G  V  LV+     F+   
Sbjct  219  TMWLVVRYSLTIPVILLEGATTMRALRRSDQLIQGSWWRSLGIQVAALVLMWCAFFVGYA  278

Query  285  IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
            + + G    L  S     F      L+Y DL     
Sbjct  279  VTF-GLGLFLIGSGW--AFYIAVVLLLYCDLTLRKE  311


>MBI5448614.1 hypothetical protein [Gammaproteobacteria bacterium]
Length=249

 Score = 50.2 bits (116),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 32/210 (15%), Positives = 62/210 (30%), Gaps = 14/210 (7%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
               L      +      LL    +       +   +LL  V   +   + +       I 
Sbjct  36   FIPLLCSQYVSLQTDVALLNANRFDYNDITPFTLCLLLLNVIISMASYATLLTYTRSLID  95

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
               +  +   K              L IL +  G +L I+PG+   V  F    ++  ++
Sbjct  96   TQPLSAWMLAKRIAIKTPLLVFSGFLAILSLFVGFILFILPGIYIFVVLFLFYPLVILED  155

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY--------------VG  289
                +    S  L+  HWW  F    L + +  T S   +++                + 
Sbjct  156  NNPFKDFIHSFSLIKHHWWRTFFSLTLPIFLLTTFSLSCSKLLAYSLTPAHADTLRLSLR  215

Query  290  EAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
                  F  +L P+  +    +  DL+   
Sbjct  216  FLFQTLFMSVLFPWFCIQTLAVLHDLRLRR  245


>TES90920.1 hypothetical protein E3J87_08890 [Candidatus Cloacimonetes bacterium]
Length=307

 Score = 50.5 bits (117),  Expect = 7e-04, Method: Composition-based stats.
 Identities = 19/188 (10%), Positives = 54/188 (29%), Gaps = 0/188 (0%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
             L   L+        + ++LL         +   +           + L          I
Sbjct  81   FLLASLVLWAFVSGGVRASLLENILKEKKFEFNTFIANGKKFFGRIVGLWALTGLIFFAI  140

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
            +I    +G          +  S    +++ +L      ++  +   L  V+       L 
Sbjct  141  FIILGGLGTLIFFLCFSLYETSEVGAILIGVLSGFIIFIVYFVTAFLLGVFLTIANTYLI  200

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
             ++   + +++ S   +  +    F   + L +I+  ++F    I           ++  
Sbjct  201  VEDAEVIPSMKGSIRFIKKYPGHTFLVVLFLFLIAFGVAFAYNLITIPVTMIPYVGAMFS  260

Query  301  TPFSFLYY  308
               + +  
Sbjct  261  FILAPVQM  268


>RMD51407.1 hypothetical protein D6827_02300 [Candidatus Parcubacteria bacterium]
Length=218

 Score = 49.8 bits (115),  Expect = 8e-04, Method: Composition-based stats.
 Identities = 26/163 (16%), Positives = 51/163 (31%), Gaps = 2/163 (1%)

Query  134  APIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSM  193
               F   LL   +      +  +       +A      + +     +   K  +      
Sbjct  1    MVPFVINLLAFISLNADTARFVEVLTSGIQIALFFWLKAVIIILTAVEHDKKPINTRLLG  60

Query  194  KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKS  253
             +         L      ++V  G L +I PG+LF +WF F   ++   N     A   S
Sbjct  61   TVSWYLALPLLLASFAYFILVTIGMLAII-PGILFLIWFCFAPTIIVLKNTTLSNAFRDS  119

Query  254  RLLVSGHWWAIFGRFVL-LLVISLTLSFLTARIPYVGEAANLA  295
            R +  G    +  R V+ + V +     +   + ++  A    
Sbjct  120  RKITRGKELPLIWRIVVGVAVFTTIFMIVLVLMGFLISALQGI  162


>TML15946.1 hypothetical protein E6G39_06320 [Actinobacteria bacterium]
Length=324

 Score = 50.9 bits (118),  Expect = 8e-04, Method: Composition-based stats.
 Identities = 34/216 (16%), Positives = 65/216 (30%), Gaps = 14/216 (6%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            L    L    + A I                Q W   + L  +  + +  + M   + ++
Sbjct  97   LLQAYLARDSSNAAIIEFFSSPGYLGRVFDGQRWSVVVYLLDLMALSVTGALMGKMLALW  156

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                    F  ++   R +       +L+ LV     +      LL          V+  
Sbjct  157  FQGVVPDAFLVLRASGRLIPRALAAFVLVHLVEAVAVIGGGAGTLLVIPLLSLTSPVIGI  216

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP--------------Y  287
            ++ G + AL +S  L   H+       +    +   L  L   +P               
Sbjct  217  EDAGAIAALRRSWTLTRRHYKHSCAFALFGASVVFLLGLLFGWLPTTIAARLGDGGYGWI  276

Query  288  VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
            V   A++ F+ + T F  +   + Y DL+    G  
Sbjct  277  VVGVASVIFATMTTAFIAIGSVIFYLDLRVRSGGMD  312


>MBC7792898.1 zinc-ribbon domain-containing protein [Clostridia bacterium]
Length=122

 Score = 47.8 bits (110),  Expect = 8e-04, Method: Composition-based stats.
 Identities = 9/33 (27%), Positives = 14/33 (42%), Gaps = 0/33 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           VRCP C  E     +++P      +C +C    
Sbjct  3   VRCPVCSTEYELDDARVPESGLQVKCSKCGHVF  35


>NVM26697.1 zinc-ribbon domain-containing protein [Desulfobacterales bacterium]
Length=69

 Score = 46.3 bits (106),  Expect = 8e-04, Method: Composition-based stats.
 Identities = 11/38 (29%), Positives = 17/38 (45%), Gaps = 0/38 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  + CP C   ++ P  K+P      RCP+C     +
Sbjct  1   MVEITCPQCNYTKSIPPEKIPPGVRWIRCPKCGNRFEY  38


>NNG04887.1 DUF3426 domain-containing protein [Inquilinus sp.]
Length=228

 Score = 49.8 bits (115),  Expect = 8e-04, Method: Composition-based stats.
 Identities = 7/38 (18%), Positives = 13/38 (34%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  + C +C        + + AK    +C  C     +
Sbjct  1   MI-IACENCATRFTVTDAAIGAKGRKVKCSSCGHVWRY  37


>HIB11376.1 hypothetical protein [Dehalococcoidia bacterium]
Length=87

 Score = 46.7 bits (107),  Expect = 8e-04, Method: Composition-based stats.
 Identities = 10/58 (17%), Positives = 22/58 (38%), Gaps = 1/58 (2%)

Query  250  LEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA-NLAFSLLLTPFSFL  306
            + +S  LV G+W  + G   +  ++   L  +   +      A     +++  PF   
Sbjct  1    MSRSAELVRGNWSRVAGILFVAGILFGILGGIIGFLVGFIPVAEATIGTIIKAPFLIF  58


>RME62938.1 DUF3426 domain-containing protein [Alphaproteobacteria bacterium]
Length=322

 Score = 50.5 bits (117),  Expect = 8e-04, Method: Composition-based stats.
 Identities = 9/38 (24%), Positives = 14/38 (37%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  + C  CG       +KL     + RC +C  +   
Sbjct  1   MI-ISCTACGTRYVVDPAKLGDAGRTVRCAKCHHSWFQ  37


>NWF77212.1 hypothetical protein [Chloroflexi bacterium]
Length=267

 Score = 50.2 bits (116),  Expect = 8e-04, Method: Composition-based stats.
 Identities = 28/153 (18%), Positives = 53/153 (35%), Gaps = 10/153 (7%)

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
            +L+          + I    + + R   L +  +       I   + +   ++L+ I  L
Sbjct  117  LLVPAIAGLALYTLMISGMTIVISRYTFLRMGPLVQGASPSIPSTVALILVAVLIFIVTL  176

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
                           +N  G   L  S  ++ G+WW +F  F++   +    S L ++IP
Sbjct  177  YVATRLRLSAPACVLENNFG---LNTSWKVIKGNWWKVFAIFLIFGAM----SALISQIP  229

Query  287  YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
             +G          + P S     LIY  L+   
Sbjct  230  IIGIYLRGVI---VEPLSITAATLIYFQLQEAK  259


>MBJ7525797.1 zinc-ribbon domain-containing protein [Sphingomonadaceae bacterium]
Length=104

 Score = 47.5 bits (109),  Expect = 8e-04, Method: Composition-based stats.
 Identities = 9/39 (23%), Positives = 12/39 (31%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  V CP C      P + +       RC  C      +
Sbjct  1   MLLV-CPSCRTRYVVPDNAVGVDGRKVRCANCKHGWFQE  38


>HBE54169.1 hypothetical protein [Cyanobacteria bacterium UBA11369]
Length=283

 Score = 50.5 bits (117),  Expect = 8e-04, Method: Composition-based stats.
 Identities = 36/230 (16%), Positives = 74/230 (32%), Gaps = 29/230 (13%)

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAI--LLATVAYILLGLSWMTG  176
            W L+ IY        +   + L            +  +  +   + +   + + +S +  
Sbjct  44   WSLIPIYGWAKACMISAQIARLAFSELVEQPESVKTAEDRVSSRMWSFLGLQILVSIILF  103

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
            +    +      +     L LR+    +  L LL L+    +L +    +         +
Sbjct  104  ATNFGLSIVGSFITAIPALLLRNSSQDSGGLALLALIRLVVNLAIFAAYIWVYSRVMIPE  163

Query  237  YVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE-----  290
              LA + N+    ++ +S  L  G    I G  ++  +I+L + F+   IP +       
Sbjct  164  LPLAIESNVDVGTSISRSWDLTKGSVLRIQGVVLVASLITLPI-FVLVLIPIMLFFPIIA  222

Query  291  --------------------AANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
                                  +L   LL  PF      +IY DL++   
Sbjct  223  TSSSPDTVGGAIVLFILSILIVSLLIGLLTMPFWQAIKAVIYYDLRSRRE  272


>MAI61605.1 hypothetical protein [Micavibrio sp.]OUT91955.1 hypothetical 
protein CBB87_03845 [Micavibrio sp. TMED27]
Length=340

 Score = 50.9 bits (118),  Expect = 8e-04, Method: Composition-based stats.
 Identities = 9/44 (20%), Positives = 17/44 (39%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M    CP+C  +   P   +P +    +C +C +     P   +
Sbjct  1   MIL-TCPNCSTKYQFPGDDIPPEGQKVKCTKCGEVWEEFPDPDE  43


>MBD0313472.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Microcoleus sp. T3-bin5]
Length=113

 Score = 47.5 bits (109),  Expect = 8e-04, Method: Composition-based stats.
 Identities = 23/95 (24%), Positives = 33/95 (35%), Gaps = 6/95 (6%)

Query  217  GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
            G + LIIPGL       F  Y    DN   L +L  S  L  G WW +F    L+ ++  
Sbjct  1    GCIALIIPGLYVAYRLIFSLYATVIDNSSALDSLSSSWELTKGRWWLLFRAICLIALVVF  60

Query  277  TLSFLTARI------PYVGEAANLAFSLLLTPFSF  305
                L + +          +  +     L  P   
Sbjct  61   VPIILISLLIDPTAKSVAYQLTSNLLGFLAGPLMN  95


>PPR29948.1 hypothetical protein CFH31_00124 [Alphaproteobacteria bacterium 
MarineAlpha9_Bin1]
Length=105

 Score = 47.1 bits (108),  Expect = 8e-04, Method: Composition-based stats.
 Identities = 8/36 (22%), Positives = 15/36 (42%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  + CP C  +     +K+P+     +C +C    
Sbjct  1   MI-ISCPKCKIKFRVTENKIPSHGRRVQCSQCSYQW  35


>MBI2615337.1 zinc-ribbon domain-containing protein [Gemmatimonadetes bacterium]
Length=132

 Score = 47.8 bits (110),  Expect = 8e-04, Method: Composition-based stats.
 Identities = 11/31 (35%), Positives = 14/31 (45%), Gaps = 0/31 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           VRCP C        +K+P   + ARC  C  
Sbjct  3   VRCPSCETVFRVDPAKVPEAGTRARCTVCAY  33


>KKW35289.1 hypothetical protein UY81_C0046G0002 [Candidatus Giovannonibacteria 
bacterium GW2011_GWA2_53_7]
Length=513

 Score = 51.3 bits (119),  Expect = 9e-04, Method: Composition-based stats.
 Identities = 32/138 (23%), Positives = 53/138 (38%), Gaps = 2/138 (1%)

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
                L +S +   + I      V     + +G    G    + ++  L++  G + L + 
Sbjct  115  FLSFLLVSVLFSLVLIATLLFLVPGIFFLLVGNSLSGIGATVAMIGTLLLIVGVVALTVT  174

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
             +L+ V FFF  Y L  DN  G  AL+ S  LV GH+W +  R VL   +   +      
Sbjct  175  AVLWGVRFFFATYTLLIDNHKGRDALKASYRLVHGHFWNVVVRLVLPKALFFLVFAFGLF  234

Query  285  IP--YVGEAANLAFSLLL  300
            I         +    L +
Sbjct  235  IANTLATMIISGVAGLNI  252


>SFE08120.1 Uncharacterized membrane protein [Peptostreptococcaceae bacterium 
pGA-8]
Length=367

 Score = 50.9 bits (118),  Expect = 9e-04, Method: Composition-based stats.
 Identities = 20/178 (11%), Positives = 57/178 (32%), Gaps = 2/178 (1%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
            + +   +    +  A+ L  A +     ++    I      +     +     +  +I  
Sbjct  89   FSVIASIYTIAVAGAVALGMAHFTLRFLKDKNTDIGNLFFGFTNYLPALGLSLLESFIIT  148

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-N  243
                +   +      VG  +    +   ++    +  +   +   +   F ++ LADD +
Sbjct  149  IPAFIIGGIIGIFIGVGLLSG-SGIAGFLIIILIICFMSYLVYVSLGLAFSEFFLADDLS  207

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
            +G + A++ S   + G+   +FG  +     +      T  I  +         + + 
Sbjct  208  VGVIGAIKSSWKHMKGNRPGLFGFLLSYFGWAFLAIIGTGFIGLLIHLITGDGLINVI  265


>HDI61330.1 hypothetical protein [Desulfobacteraceae bacterium]
Length=488

 Score = 50.9 bits (118),  Expect = 9e-04, Method: Composition-based stats.
 Identities = 13/56 (23%), Positives = 18/56 (32%), Gaps = 1/56 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCP  56
           M  + C +C A      S L    S  RC  C    +  P +       D   + P
Sbjct  1   MI-ITCENCHASYKLDDSLLDPAGSRVRCTNCGHVFVARPPQEIPPPEDDAPVSAP  55


>MBA2765166.1 hypothetical protein [Thermoleophilaceae bacterium]
Length=303

 Score = 50.5 bits (117),  Expect = 9e-04, Method: Composition-based stats.
 Identities = 32/156 (21%), Positives = 50/156 (32%), Gaps = 14/156 (9%)

Query  200  VGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSG  259
            +      L L +  +   SLL+ IP     + +     +LA +  GG  AL +S  LV G
Sbjct  145  IPLLIPALWLGVGWITAYSLLIWIPLAWLAIGWIAAFPILAFEGTGGWAALSRSFELVRG  204

Query  260  HWWAIFGRF-----------VLLLVISLTLSFLTAR---IPYVGEAANLAFSLLLTPFSF  305
             WWA FG             ++L +  + + F                   ++L  P   
Sbjct  205  RWWATFGAVLLVSLVLGVAAIVLYIPYVVVLFTGGGAMTAMAASAVIGFVVTVLFYPALA  264

Query  306  LYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGW  341
                 +Y DL+   RG    P               
Sbjct  265  SLIASVYFDLRLRKRGVVPTPEGETGFGWEQIYAVP  300


>HDW96168.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=57

 Score = 45.9 bits (105),  Expect = 9e-04, Method: Composition-based stats.
 Identities = 12/42 (29%), Positives = 15/42 (36%), Gaps = 1/42 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAE  42
           M   +C  CG +     S L    +  RC  C  T    P E
Sbjct  1   MIL-QCQACGTKYRLEDSLLKPSGTKVRCSRCGFTWRVYPQE  41


>MBI1319779.1 hypothetical protein [Candidatus Hydrogenedens sp.]
Length=279

 Score = 50.2 bits (116),  Expect = 9e-04, Method: Composition-based stats.
 Identities = 26/191 (14%), Positives = 66/191 (35%), Gaps = 10/191 (5%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
             L + ++ ++L         L                 +L++ +  +      +   +  
Sbjct  70   WLRVGVILMLLHMCRGNHNDLATMFKGFPYLLWYLVGTLLVSLLYILAFVPGLIAMVLCW  129

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL--LIIPGLLFCVWFFFCQYV  238
             +        +++         FTL+      ++    +L   IIP       +FF  Y+
Sbjct  130  LVMGISFAELQALPGAGLLNAWFTLVSEYGGFLILELVILGIAIIPPFYVSAVYFFVPYL  189

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
            + D  +  L+AL+ S  +  G    +        +  + +S L + + ++     LA ++
Sbjct  190  IVDKGMKPLEALQASAQITQGGRMGL--------INFMVVSSLLSILGFLALVIGLAIAI  241

Query  299  LLTPFSFLYYY  309
             +  F+ +  Y
Sbjct  242  PVVFFATVSLY  252


>MBI1274627.1 hypothetical protein [bacterium]
Length=267

 Score = 50.2 bits (116),  Expect = 9e-04, Method: Composition-based stats.
 Identities = 27/139 (19%), Positives = 49/139 (35%), Gaps = 1/139 (1%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
              A   L    +   +  +     + L               L  ++L + V  GS+L +
Sbjct  123  AFAIGFLIYGILFTYIAKFSRGQFLELSELKARAKDKKKDLFLAGLVLSIPVMVGSVLFL  182

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
             PG+L    F F   ++ +  +G  QA+  SR  V+      FG    L+ I   L  + 
Sbjct  183  APGILVLSLFAFTPLLIMERGMGFKQAMGASRKAVTSMNADNFGILFTLMAILT-LGGVM  241

Query  283  ARIPYVGEAANLAFSLLLT  301
              +P +     + + L   
Sbjct  242  GILPLLVILPMVIYGLTEI  260


>MSR02063.1 hypothetical protein [Gemmatimonadetes bacterium]
Length=30

 Score = 45.1 bits (103),  Expect = 9e-04, Method: Composition-based stats.
 Identities = 9/26 (35%), Positives = 11/26 (42%), Gaps = 0/26 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARC  29
           V C  C        +K+PA    ARC
Sbjct  3   VTCTACQTVFRVDPAKVPAAGVRARC  28


>OYO04906.1 hypothetical protein CGZ95_03285 [Propionibacteriaceae bacterium 
NML 150272]
Length=286

 Score = 50.2 bits (116),  Expect = 0.001, Method: Composition-based stats.
 Identities = 16/139 (12%), Positives = 36/139 (26%), Gaps = 22/139 (16%)

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL----TLSF  280
                 +       ++  + +G + A+++S  L    +W  F   ++  +I+      LS 
Sbjct  147  MTWLSIRLLLSPILVIVERVGIISAIKRSFALTRRAFWITFATILVASLIASTAAQVLST  206

Query  281  LTARIPYV------------------GEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGP  322
              + I  +                            L  P+      +IY D +      
Sbjct  207  AVSFILPLVATATGGQEQVPRLAIVTIALTQAITVALTQPYLASIRTMIYVDRRMRAEAY  266

Query  323  QHPPIKRQWLPLTAAIFGW  341
                I           +  
Sbjct  267  DAELIAAHRPAPEGTPWHP  285


>WP_066805025.1 hypothetical protein [Sphingomonas asaccharolytica]
Length=222

 Score = 49.8 bits (115),  Expect = 0.001, Method: Composition-based stats.
 Identities = 28/162 (17%), Positives = 53/162 (33%), Gaps = 0/162 (0%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
            +    V+          L  A             + + +     +    +  + +  I +
Sbjct  5    FSFSEVIGATFALMVSDLPMALGAIAVIAAASTFLDVTSTTLSNILAIPVLVAQYFLIRR  64

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
                    +   L   GSF LL I+  L V  G +LLI+PG+     +      L  +N 
Sbjct  65   LVERQGLQIADRLGGFGSFFLLGIVTSLGVAFGFVLLILPGIYLSARWSMASAALIAENQ  124

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            G  +A+ +S      H   I   + L++   +   F    + 
Sbjct  125  GSGRAMARSWDASRDHVLPIALTWALIVAPLIAGMFALGGVG  166


>TIU27954.1 hypothetical protein E5W34_02840, partial [Mesorhizobium sp.]
Length=168

 Score = 48.6 bits (112),  Expect = 0.001, Method: Composition-based stats.
 Identities = 30/150 (20%), Positives = 58/150 (39%), Gaps = 25/150 (17%)

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGG----------  217
            L+G + ++ +    I      +   ++  L+H+     +   + LVV             
Sbjct  20   LVGQAVLSKAAAGEINGNRPSVASCVQTALQHIFPVLGIGYTIYLVVFVARSGSAAVELS  79

Query  218  ---------SLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
                      LLLI+P +++ V       V  ++ +G + ++ +SR L  G+ W IFG  
Sbjct  80   YSPQAGGLVFLLLIVPCIVWGVAISVAVPVSVEEGLGTVASMSRSRDLTKGYRWWIFG--  137

Query  269  VLLLVISLTLSFLTARIPYVGEAANLAFSL  298
                +IS+ +  L   +    E      SL
Sbjct  138  ----LISVVVIGLLLGLRLATELGLTFVSL  163


>HHI88810.1 thioredoxin [Hellea balneolensis]
Length=80

 Score = 46.3 bits (106),  Expect = 0.001, Method: Composition-based stats.
 Identities = 9/37 (24%), Positives = 11/37 (30%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M    CP C          L     + RC +C  T  
Sbjct  1   MIL-TCPDCATRFKIKPEALGPNGRTVRCSQCKATWF  36


>MBI3758476.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=305

 Score = 50.2 bits (116),  Expect = 0.001, Method: Composition-based stats.
 Identities = 34/168 (20%), Positives = 59/168 (35%), Gaps = 10/168 (6%)

Query  151  QNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVG-----LFRSMKLGLRHVGSFTL  205
                   A+ LA  A   +       ++                    +  +    +F  
Sbjct  84   FGHWVLVALHLAFAAITFMVKLVSLCAIKRIRSGERKARLLNETIEVYREAISLAPAFLW  143

Query  206  LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
            + +L ++ +  G +LL++PGLL  VW +F  Y L  +N     AL  SR L+ G    + 
Sbjct  144  ISLLQVIAIVVGIVLLVVPGLLAMVWLYFSSYALVFENKRSWPALFHSRELMRGRTIKVA  203

Query  266  GRFVLLLVI-----SLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYY  308
             R V+ L +                  +G  A  + +L +T F F   
Sbjct  204  VRIVVFLALWSGYNWWVGGAFLGLSLLIGPIAIWSGALCVTIFLFSLI  251


>WP_180272178.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Actinomyces sp. CtC 72]
Length=120

 Score = 47.5 bits (109),  Expect = 0.001, Method: Composition-based stats.
 Identities = 17/104 (16%), Positives = 34/104 (33%), Gaps = 21/104 (20%)

Query  241  DDNIGGLQALEKSRLLVSGHWWAI--------FGRFVLLLVISLTLSFLTARIPYVG---  289
             +N+   + + +S  L  G +W +            ++  +I+  L  +      +    
Sbjct  8    LENVSVWEGIRRSWQLTRGSFWRVLGALLLSALLTGLVSSLIAFPLGLVAGATSVLAPGA  67

Query  290  ----------EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                         +   S ++ PFS     LIY DL+    G  
Sbjct  68   AISAAADVLVTFLSFLLSAVIMPFSAAVVALIYIDLRMRREGLD  111


>NED07194.1 hypothetical protein [Streptomyces sp. SID6648]
Length=67

 Score = 45.9 bits (105),  Expect = 0.001, Method: Composition-based stats.
 Identities = 19/67 (28%), Positives = 28/67 (42%), Gaps = 0/67 (0%)

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
               V F      L  +  G  +A+ +S  LV G WW IFG  +L  VI+  ++ +     
Sbjct  1    WLWVRFSLASPALMLEKQGIKKAMARSVKLVRGSWWRIFGIQLLATVIANVVASIVVIPF  60

Query  287  YVGEAAN  293
                AA 
Sbjct  61   TFLAAAL  67


>OFW87939.1 hypothetical protein A3B66_01025 [Alphaproteobacteria bacterium 
RIFCSPHIGHO2_02_FULL_46_13]OFW98483.1 hypothetical protein 
A3J37_03415 [Alphaproteobacteria bacterium RIFCSPHIGHO2_12_FULL_45_9]
Length=256

 Score = 49.8 bits (115),  Expect = 0.001, Method: Composition-based stats.
 Identities = 11/39 (28%), Positives = 15/39 (38%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M    CP C A  N P+  +     + RC +C      D
Sbjct  1   MIL-TCPSCSASYNVPNEAIGIDGRAVRCKKCKHEWFQD  38


>PLY04071.1 hypothetical protein C0624_06265 [Desulfuromonas sp.]
Length=860

 Score = 51.3 bits (119),  Expect = 0.001, Method: Composition-based stats.
 Identities = 42/241 (17%), Positives = 79/241 (33%), Gaps = 4/241 (2%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                 +LL +   + +         + L  ++ LGL  + SF  L     L++ G +L L
Sbjct  530  LVAIGMLLIVVLSSNATLFLFIDQYLPLKPALALGLLRLPSFVWLYFWHSLIMTGATLAL  589

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            ++PG  F  WF      LA +   G  +L +SR    G          L   +   LS L
Sbjct  590  LVPGYFFTRWFSLAPLCLAAEQTRGFSSLLRSRAYCRGREK--LVTRALWPALLFPLSGL  647

Query  282  TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGW  341
               +  +     L   L++ PF      ++Y D  A+     +     Q L   A     
Sbjct  648  IG-LMLLPGYWKLLPPLVIPPFF-SVLTVLYQDAAASRTTLNYDNTLSQRLQWPALSMAG  705

Query  342  MLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKL  401
            +L+  ++ + +         + +     +         T       P     +   + + 
Sbjct  706  LLVFTIVSLLVLGPGRIRSSIYTLLLHTELVPTVVDTVTEKGKPIRPVVDSAVFELEIEG  765

Query  402  L  402
             
Sbjct  766  Y  766


>MBI3830845.1 hypothetical protein [Planctomycetes bacterium]
Length=314

 Score = 50.2 bits (116),  Expect = 0.001, Method: Composition-based stats.
 Identities = 37/304 (12%), Positives = 73/304 (24%), Gaps = 1/304 (0%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
                C  CG                  C       + D   +      D           
Sbjct  3    IDFDCTQCGKPLYAQDRFAGRTIRCPGCGLAMVIPVPDGFAAANLAPIDASTAKAADHAD  62

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
               P        +                     S      ++  LA  W          
Sbjct  63   TLGPGVSPGEIVQLAFEAFKRNWLLGAAMNFLYGSLLTALILALWLAVGWSALAPSHASS  122

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
                  G  +A   +++A  L          +    A+       +    +     + + 
Sbjct  123  NPYGSGGFWVAQLFLWTAASLLTPGLYWAGVKMADQALGAPVRPSLGDFFAGFQRPLTML  182

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                 +    ++   L       L  +   L        L+    ++    F+   +  +
Sbjct  183  GVWFPLFASTNLLYFLWFGSQARLPFLRSTLATAIFQGALLALYGVYFARLFWALPLALE  242

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT-LSFLTARIPYVGEAANLAFSLLL  300
              +G L+AL +S  + +G     FG ++L L + +    F    I + G  A      L 
Sbjct  243  TRLGSLEALSQSWRMTAGRGALAFGVWMLALFLYVAGFCFCMLGICFTGVLAQCLIGALY  302

Query  301  TPFS  304
              F+
Sbjct  303  WKFT  306


>WP_191750981.1 hypothetical protein [Clostridium sp. Sa3CUN1]MBD7916236.1 hypothetical 
protein [Clostridium sp. Sa3CUN1]
Length=289

 Score = 50.2 bits (116),  Expect = 0.001, Method: Composition-based stats.
 Identities = 16/178 (9%), Positives = 62/178 (35%), Gaps = 6/178 (3%)

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
                   +    +        ++  +      +  +    ++L L+ L +G   ++    
Sbjct  110  VVSWKSATRYVWNRVWSAIGLNILAWFMFLGVILVLVFLVIILSLITLGIGAFIVIPCAI  169

Query  225  GLLFCVWFFFCQYV--LADDNIGGLQALEKSRLLVS-GHWWAIFGRFVLLLVISL---TL  278
             ++  +  F   +      +++G + ++ ++ LL   G++W+  G+   +  I +    +
Sbjct  170  AIIVIISPFIKLFNSIFIVNDLGVIDSIREAFLLFKKGYFWSTIGKLAAISGIYIGVIIV  229

Query  279  SFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTA  336
             ++   +P++G    +   +++  +   Y  ++  D   +               +  
Sbjct  230  LWIFELLPFIGFIIAILGQVIMNIYVISYLNILVLDRIGSKDNLFRNNDDSGNSFIDP  287


>XP_008466772.1 PREDICTED: uncharacterized protein LOC103504102 [Cucumis melo]
Length=158

 Score = 48.2 bits (111),  Expect = 0.001, Method: Composition-based stats.
 Identities = 15/101 (15%), Positives = 39/101 (39%), Gaps = 0/101 (0%)

Query  211  ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
            + +V   +++L+       + +   + +   +   G +A+ +S+ LV G    +   F++
Sbjct  1    MTIVIIFTIILLAGEFYLILTWQLSKTIAVLEEFCGFKAMARSKALVKGKMRMVIKLFIV  60

Query  271  LLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
            L      +  + +R+        +   L+L     L   L 
Sbjct  61   LSFPMEVVQLVFSRLLIQSTIIGIVGKLVLIIIWMLLISLF  101


>RMI04990.1 hypothetical protein D6681_08750, partial [Calditrichaeota bacterium]
Length=251

 Score = 49.8 bits (115),  Expect = 0.001, Method: Composition-based stats.
 Identities = 33/175 (19%), Positives = 65/175 (37%), Gaps = 8/175 (5%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L+ +  L  ++     +  +L      +  Q  +W+ A+  A     L  +         
Sbjct  2    LMFLGELIFLIGGLLAYVGVLSTGCAEMENQAFSWEDALRRAFSVRSLRLVGATIVMGLA  61

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +    V +   +            +  LL+ V  G               ++     + 
Sbjct  62   VVGVFLVPIVVILLGTALEENILNGVGALLLPVAFG-------VTFYLIFRWYVVSPAIV  114

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS-LTLSFLTARIPYVGEAANL  294
             ++ G LQAL +S  LV GHWW  FG  +L+++++ L +S +T  I +V     +
Sbjct  115  WEDAGVLQALSRSASLVKGHWWRTFGVVLLMMLVTQLAISLITTPISFVAMWGFI  169


>MXW90477.1 hypothetical protein [Rhodospirillaceae bacterium]MYB14388.1 
hypothetical protein [Rhodospirillaceae bacterium]MYI51012.1 
hypothetical protein [Rhodospirillaceae bacterium]
Length=150

 Score = 48.2 bits (111),  Expect = 0.001, Method: Composition-based stats.
 Identities = 10/43 (23%), Positives = 15/43 (35%), Gaps = 1/43 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAES  43
           M   RCP+C        + L     + RC  C +    +P   
Sbjct  1   MIL-RCPNCATRFLVDPAALGPAGRTVRCGACGREWRQEPERP  42


>MBK67525.1 hypothetical protein [Rickettsiales bacterium]
Length=254

 Score = 49.8 bits (115),  Expect = 0.001, Method: Composition-based stats.
 Identities = 10/44 (23%), Positives = 13/44 (30%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M    CP C          + A     +C  C      DP E +
Sbjct  1   MIL-NCPECSTRFLVNDKAIGATGRRVKCARCKTIWFQDPPEKE  43


>RKU30297.1 hypothetical protein C6497_04880 [Candidatus Poribacteria bacterium]
Length=395

 Score = 50.5 bits (117),  Expect = 0.001, Method: Composition-based stats.
 Identities = 20/238 (8%), Positives = 60/238 (25%), Gaps = 14/238 (6%)

Query  73   SKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLA  132
                       +      +       G    S   + S  +       L+ +      L+
Sbjct  108  YNWTPNHIETTNRKPIFWQILPYPMIGNMDDSVGWSWSINIGIIEYTPLMLLIFTLCPLS  167

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
                      +              +    +   + + +  +   +   + +    ++  
Sbjct  168  IIVASQIQASEIDVPTGTSPLTAHSSWKYTSSKALKVLIIPILFLVMTELSRL---IYVL  224

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
                L  +    L  +  +L +           +   + F      +  +N   L+  ++
Sbjct  225  FSFILPKLTHSYLGTVFTLLQILV--------TIYLLITFSLYNQCIIFENRSILEIFKR  276

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYL  310
            S  LV G        ++   +I+   + +++ +          F   L P    +  L
Sbjct  277  SYSLVKGVRLKFLLIYL---LIAWIAALVSSVLMGSALLILSIFFTELVPIQDAFSLL  331


>HIJ38483.1 hypothetical protein [Rhodospirillaceae bacterium]
Length=223

 Score = 49.4 bits (114),  Expect = 0.001, Method: Composition-based stats.
 Identities = 11/37 (30%), Positives = 12/37 (32%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  V CP C      P S L     + RC  C     
Sbjct  1   MI-VVCPSCDTHFTLPPSALGPGGRALRCARCGHKWH  36


>WP_191518733.1 DUF4013 domain-containing protein [Candidatus Adamsella avium]
Length=252

 Score = 49.8 bits (115),  Expect = 0.001, Method: Composition-based stats.
 Identities = 23/154 (15%), Positives = 55/154 (36%), Gaps = 3/154 (2%)

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
            ++   S ++G M   +  T + L   +        S  + L+L    +    L + I  +
Sbjct  92   VMPRWSNISGIMLTGLKATIISLLYYIPSLALIAISAMVTLLLNQPNLILLLLPVNIVVI  151

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            +  ++F F   ++    +    AL     L+S           LL+++ L +S L   + 
Sbjct  152  ITSMFFMFMAMIIFIKRLSITDAL--DFSLISKILSQYLVDLFLLMLVMLGISLLIGLVG  209

Query  287  YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
            ++         +L+   +F    +  + +     
Sbjct  210  FLLCITC-IGMILIPFVTFYSQLVFANLMAQFNE  242


>WP_183640183.1 hypothetical protein [Neomicrococcus lactis]MBB5597241.1 hypothetical 
protein [Neomicrococcus lactis]
Length=415

 Score = 50.5 bits (117),  Expect = 0.001, Method: Composition-based stats.
 Identities = 30/316 (9%), Positives = 81/316 (26%), Gaps = 39/316 (12%)

Query  10   GAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRL  69
                +    + PA+       +         A++                      +   
Sbjct  3    QTPNDPNDPQQPAEGWRHPPQQGWGQSASGNAQNYGQYGPPGNIPGAPYYQPSAPVAKPG  62

Query  70   EIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLL--  127
             +  + ++            +    A    L  ++ +++ +   +         + +L  
Sbjct  63   SVGLRPLSLGDIFEGTFKSFKYAPLALVIPLLIVNVIMSVAGVRYFFYWMTHNVLPILEG  122

Query  128  -----GIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
                    +        L                  I +      +     ++ S+   I
Sbjct  123  INTADSYTIDSPSDLQQLFPPAFINSFWSLGLVLSLITVVGTFLTVFCYGMVSVSVARGI  182

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLIL--------------------------------L  210
                    ++++L  R +G    L  +                                +
Sbjct  183  AGQKTSFTQAVRLASRKLGGLMGLAGIVSLLYVAVIVVFAATAVAMFNSVLTGSSLLVSM  242

Query  211  ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
             +       ++ I G L  V  F      A +++G L+AL +S  L   ++W + G  +L
Sbjct  243  FVPFFFLIPIVAIAGALITVKLFAAPQAAAIESVGPLRALGRSWELTRFNFWRVLGILLL  302

Query  271  LLVISLTLSFLTARIP  286
            + +I+     + +   
Sbjct  303  MGIITSIAVSIVSAPF  318


>MSR54880.1 hypothetical protein [Gemmataceae bacterium]
Length=428

 Score = 50.5 bits (117),  Expect = 0.001, Method: Composition-based stats.
 Identities = 38/304 (13%), Positives = 76/304 (25%), Gaps = 17/304 (6%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              V CP C A                RCPEC   +I      Q  +   + +        
Sbjct  3    IKVTCPECHATFLVGDEF---AGRPGRCPECTAVIIVAGPNLQAPEIHLDPSPYQAARGV  59

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                      + +     + +       +         L + ++  +  ++        +
Sbjct  60   EAYEDFPDRARRRREEDEQRSDYERDDYDDRAVEFKVDLEARAKAWSRVYKGLGYIQIAV  119

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            + +Y    VL          + P       +       L      +   + W+ G + + 
Sbjct  120  I-LYFFNQVLQSVFFVVHGGMDPKDPNALPDSGELAIGLGVAFIMLFACVFWILGRLSLI  178

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                      +       + S   L  L  L +     L   P      +  F   +   
Sbjct  179  RTPYLPARGWAKGSFFMVLASIASLFGLCCLFMMALGALANGPNPGAAAFLMFAVLMALL  238

Query  242  -------DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
                     + G+ AL K    +     A + R       S+ L F+   +  VG     
Sbjct  239  AMLLVAGGELCGMMALAKICDGLRAPSAANWARM------SMVLMFVLIGVLLVGACGIG  292

Query  295  AFSL  298
             +  
Sbjct  293  IYGA  296


>HCI61646.1 thioredoxin [Erythrobacter sp.]
Length=205

 Score = 49.0 bits (113),  Expect = 0.001, Method: Composition-based stats.
 Identities = 10/39 (26%), Positives = 15/39 (38%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  + CP C      P S +     + RC +C  +   D
Sbjct  1   MI-IACPACSTRYVVPDSAIGIDGRTVRCAKCKHSWFQD  38


>MBI4065111.1 hypothetical protein [Candidatus Gottesmanbacteria bacterium]
Length=224

 Score = 49.4 bits (114),  Expect = 0.001, Method: Composition-based stats.
 Identities = 39/193 (20%), Positives = 79/193 (41%), Gaps = 2/193 (1%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                   + LL  +++F   FS    +               I     A + +       
Sbjct  23   YQAWFFFLGLLPTIVSFLLSFSIAFFQGTFSEIAFVLVLLLIIGALITAIVSIWYYVFLY  82

Query  177  -SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
              + I +    + + +      + +       I+++L +  G +LLIIPGL+F VW++F 
Sbjct  83   RFIVIAVQGESIDIGQLAGQAWKRIAKLITTNIMMMLFILLGLILLIIPGLVFIVWYYFA  142

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
              +   ++   + AL++S+ LV G +W + GR V+   +++  S L  ++          
Sbjct  143  PIIAIIEDPK-IDALKESKNLVRGTFWPVTGRIVIFSCVAIVPSVLLQQLSPYLSLMWQI  201

Query  296  FSLLLTPFSFLYY  308
            F+   T  +FL+Y
Sbjct  202  FAPYFTLLTFLFY  214


>OQW46457.1 hypothetical protein A4S16_10670 [Proteobacteria bacterium SG_bin6]
Length=312

 Score = 50.2 bits (116),  Expect = 0.001, Method: Composition-based stats.
 Identities = 12/92 (13%), Positives = 35/92 (38%), Gaps = 3/92 (3%)

Query  210  LILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFV  269
            ++L+V    L +++  +     +         +    + A+ +S +L  G  W + G ++
Sbjct  164  VVLLVALIWLAMVVAVIWLVAMWAATPGASVVERRWAIAAIRRSAVLTRGLRWRMVGLYL  223

Query  270  LLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
            ++  + + +  + A    V        S  + 
Sbjct  224  VIFALYIGVGAVLAM---VQGLLLGIDSFSVA  252


>KEO86939.1 hypothetical protein EH30_04705 [Erythrobacter sp. JL475]
Length=210

 Score = 49.0 bits (113),  Expect = 0.001, Method: Composition-based stats.
 Identities = 26/147 (18%), Positives = 47/147 (32%), Gaps = 7/147 (5%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
             V     G + M    +            +     + + +F  L  L  +V+G G + L+
Sbjct  47   IVPVGWAGSAIMLFMSYWLTINMLYFGGLAPGGVQKGLFAFLGLTFLYAIVIGLGWVALV  106

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
             P   F + F         +      A  +S     GH+W I G   + LV+S+    + 
Sbjct  107  FPAFYFGIRFSAVLGYGMGETGDFRDAFGRSWNATKGHFWEIAGALFVPLVLSVISFSII  166

Query  283  AR-------IPYVGEAANLAFSLLLTP  302
                     +P     A+  F+     
Sbjct  167  GSWADKSLQVPIGAAIASNFFAAATGA  193


>WP_081969459.1 zinc-ribbon domain-containing protein [Paracoccus sanguinis]
Length=409

 Score = 50.5 bits (117),  Expect = 0.001, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 15/36 (42%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V CP CGA+    +  +PA+     C  C    
Sbjct  1   MRLV-CPRCGAQYEIDAEAIPARGRDVECCACEHVW  35


>WP_099594523.1 zinc-ribbon domain-containing protein [Amylibacter kogurei]PIB23239.1 
hypothetical protein BFP76_09500 [Amylibacter kogurei]
Length=519

 Score = 50.9 bits (118),  Expect = 0.001, Method: Composition-based stats.
 Identities = 10/82 (12%), Positives = 19/82 (23%), Gaps = 1/82 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + CP+C A+       +P      +C  C      +P       +       P    
Sbjct  1   MRLI-CPNCTAQYEVAEGAIPENGRDVQCANCSHIWFQEPILFLDPVSKVETNLAPDYIP  59

Query  61  QRRIPSDRLEIQSKTVNCRRCN  82
           +                    +
Sbjct  60  EPEPEVAPRHQPLPPETQHFQS  81


>SBW01284.1 membrane hypothetical protein [uncultured delta proteobacterium]
Length=317

 Score = 50.2 bits (116),  Expect = 0.001, Method: Composition-based stats.
 Identities = 33/312 (11%), Positives = 68/312 (22%), Gaps = 23/312 (7%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             + CP C   R    +K+P     A CP+C     F   +    +        P      
Sbjct  2    QIHCPVCDYSREVNLAKVPPTAEFATCPKCRHRFRFRAVDLDAVEHASGPEPDPKHADVW  61

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                   +  S              + +                               L
Sbjct  62   DAVDSLHDRWSGKDAGDDERGYDDERDDDGETEGEQDDAPRYGHTRRDDVPIPWENPREL  121

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
            G        +   +F       A    P         L+      +L + W +    +  
Sbjct  122  GYGQSFTRTSLWALFQPSSFFAALNRRPALLPALAFYLIFGCFQYVLNVIWTSVLGNLVR  181

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
             +    +               ++  ++   +   ++L +   L   ++       L   
Sbjct  182  DRFVANMGEEAFER--------IVGNVIEHSLLTPAVLSVPFQLAIQLFLTAAVVHLLIR  233

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
             I    A      L               +V    + F    IP  G      +  +L  
Sbjct  234  IISPRAA---DFALAFK------------VVAYAGVGFSLVVIPIAGSLIAPVWYFVLLL  278

Query  303  FSFLYYYLIYSD  314
                  + +  +
Sbjct  279  IGCRNAFGLPWN  290


>HGL81338.1 hypothetical protein [Deltaproteobacteria bacterium]HGU35853.1 
hypothetical protein [Deltaproteobacteria bacterium]
Length=118

 Score = 47.1 bits (108),  Expect = 0.001, Method: Composition-based stats.
 Identities = 11/62 (18%), Positives = 19/62 (31%), Gaps = 1/62 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V+C  CG +     + L  K +  RC  C       P   + +   + +        
Sbjct  1   MI-VQCEACGTKYRLDDALLKPKGTKVRCSRCGFIWTIYPDRGELSAFPEKVEGKKAKWY  59

Query  61  QR  62
             
Sbjct  60  LW  61


>WP_038471065.1 hypothetical protein [Candidatus Izimaplasma sp. HR1]AIO19504.1 
hypothetical protein KQ51_01628 [Candidatus Izimaplasma sp. 
HR1]
Length=251

 Score = 49.8 bits (115),  Expect = 0.001, Method: Composition-based stats.
 Identities = 24/167 (14%), Positives = 49/167 (29%), Gaps = 0/167 (0%)

Query  134  APIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSM  193
              I         T+ N  + +  +   +     + L        +F  I +    +   +
Sbjct  75   LLIQHVRGKNDLTFNNFFHLDKGFYNFIFLRVIVALIFVVALIPVFPVIRELFTQVSIMV  134

Query  194  KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKS  253
                  V      +I   +     S LLI+   LF +      +++ D  I    A E S
Sbjct  135  DPHAIRVYLTNSDIIPRFIQSIRLSALLIVIFWLFTIRLQMVPFIIIDKKISLFDAFELS  194

Query  254  RLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
              +  G+++ I       ++  L +         +         L L
Sbjct  195  FKITRGNYFKILIFPFTYILWLLLIFTFVGTFYVIPLIVVGYGYLYL  241


>EGS5729550.1 DUF975 family protein [Clostridium perfringens]
Length=148

 Score = 47.8 bits (110),  Expect = 0.001, Method: Composition-based stats.
 Identities = 15/104 (14%), Positives = 41/104 (39%), Gaps = 7/104 (7%)

Query  225  GLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS------LT  277
             ++  ++FF  +Y++ ++  +G  +A+ K+  ++ GH W +F   +  +  +      + 
Sbjct  21   NIIVSLFFFPVKYIIVEEPELGIWEAVGKAFKMMKGHKWELFVLILSFIGWAILAVLPIV  80

Query  278  LSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
            L  +   +  +       FS+ L  F      +      +    
Sbjct  81   LGSIIIVLMNLSVYILPIFSIGLIWFFVYRDTIYRVYYLSISER  124


>HAQ34615.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=196

 Score = 49.0 bits (113),  Expect = 0.001, Method: Composition-based stats.
 Identities = 8/36 (22%), Positives = 11/36 (31%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M    CP C       ++   A+    RC  C    
Sbjct  28  MIL-SCPSCATRYRADATAFGAQGRKVRCASCSHVW  62


>HGS39381.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=486

 Score = 50.5 bits (117),  Expect = 0.001, Method: Composition-based stats.
 Identities = 9/35 (26%), Positives = 12/35 (34%), Gaps = 0/35 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
             V CP C         K+P   +  +CP C    
Sbjct  52  VKVTCPQCSTSYRVGDEKVPLGGAQIKCPRCSHQF  86


>HBM67623.1 hypothetical protein [Rhodobacteraceae bacterium]
Length=160

 Score = 48.2 bits (111),  Expect = 0.001, Method: Composition-based stats.
 Identities = 9/44 (20%), Positives = 16/44 (36%), Gaps = 0/44 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRT  46
            + CP C A+   P   +P      +C  C +T      ++   
Sbjct  2   LLACPICQAQYEVPEDAIPETGCEVQCSACGETWFQPHPQATFP  45


>VWX52028.1 conserved membrane hypothetical protein [Novosphingobium sp. 
9U]
Length=215

 Score = 49.0 bits (113),  Expect = 0.001, Method: Composition-based stats.
 Identities = 17/132 (13%), Positives = 39/132 (30%), Gaps = 5/132 (4%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
              +        +  ++F  + K    + R          S+  + +L  + +  G L L+
Sbjct  39   ISSLTSFVSIGVVYAIFRMLLKKKGFIDRE-----GSFRSYFGVGLLSGIAIIVGFLFLV  93

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            +PGL     +      +       ++AL  S        W I    V+     + +    
Sbjct  94   VPGLFLMARWSVASAFVITQGSKSIEALRASWQATRDCAWTIVLILVVTFGAYVAIFATF  153

Query  283  ARIPYVGEAANL  294
              +  +      
Sbjct  154  TIVAALAPVLLG  165


>RPH65632.1 tetratricopeptide repeat protein, partial [Myxococcaceae bacterium]
Length=890

 Score = 50.9 bits (118),  Expect = 0.001, Method: Composition-based stats.
 Identities = 8/32 (25%), Positives = 12/32 (38%), Gaps = 0/32 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
            V CP C    N    ++P   +  +C  C  
Sbjct  2   RVTCPSCQTAYNIDERRIPPGGAKLKCSTCQT  33


>WP_172814629.1 zinc-ribbon domain-containing protein, partial [Corallococcus 
exiguus]NRD57861.1 zinc-ribbon domain-containing protein [Corallococcus 
exiguus]
Length=216

 Score = 49.0 bits (113),  Expect = 0.001, Method: Composition-based stats.
 Identities = 9/35 (26%), Positives = 13/35 (37%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            V CP C    N    ++P   +  +C  C  T  
Sbjct  2   KVSCPSCQTNYNIDDRRIPPGGAKLKCARCQTTFP  36


>NMM43559.1 hypothetical protein [Rhodospirillaceae bacterium KN72]
Length=299

 Score = 50.2 bits (116),  Expect = 0.001, Method: Composition-based stats.
 Identities = 7/47 (15%), Positives = 12/47 (26%), Gaps = 1/47 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQ  47
           M    CP+C  +       +       +C  C       P  +    
Sbjct  1   MIL-TCPNCATKFRVKDDAIGLNGRKVKCRNCAHVWHAMPEGADDDM  46


>MBF0470824.1 hypothetical protein [Gammaproteobacteria bacterium]
Length=449

 Score = 50.5 bits (117),  Expect = 0.001, Method: Composition-based stats.
 Identities = 41/343 (12%), Positives = 82/343 (24%), Gaps = 21/343 (6%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             + L   +     + +       + L           L A           +        
Sbjct  94   PLQLFSTLFDPLTLLAMGGGFLLSLLFLFWFQGASFHLPAASLVSGERAGILAAMKSSLR  153

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG-LLFCVWFFFCQYVLAD  241
               D     S+      +      LI   +      LL      +   +       +   
Sbjct  154  RLPDFINAISLLALFTLLAQLAFTLIGSAIGGMLWWLLASFLFTIWLSLTLILAAPIAIL  213

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY--------------  287
            +N G L  L ++  L  G    + G F+LL ++++ L  L     +              
Sbjct  214  ENRGPLDILSRAWELAEGIRLRMLGHFILLGLVTILLFLLVMVPIWLLSGLLGGGLIIAL  273

Query  288  VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLT------AAIFGW  341
            +    +      +T  S  +    Y + +    G +    +                   
Sbjct  274  LSLITSFLLYFFITFQSLFFIESFYFEARVVKEGWRPGWSETPEESWILSSGEDDYSGEG  333

Query  342  MLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKL  401
                G   + L    L A  +      +Q  L +  +       S        S      
Sbjct  334  RGWRGWGELLLITLILVAINVGGYLLLVQPNLNSATEVNQFSFSSQTHTTPYTSPVVTPP  393

Query  402  LLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSD  444
            L         +G        L  D F+    +   W+K+ +  
Sbjct  394  LSQSLPTLQGDGSPVGVTARLVRDVFFEQQDSASFWIKVAVEG  436


>MBN35005.1 hypothetical protein [Rhodospirillaceae bacterium]
Length=331

 Score = 50.2 bits (116),  Expect = 0.001, Method: Composition-based stats.
 Identities = 10/40 (25%), Positives = 12/40 (30%), Gaps = 0/40 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAES  43
           + CP CG     P   +  K    RC  C       P   
Sbjct  3   LTCPSCGTRFRLPDGAVGDKGRKLRCASCKHVWFQAPETH  42


>PKK92391.1 hypothetical protein CVV62_00570 [Tenericutes bacterium HGW-Tenericutes-7]
Length=177

 Score = 48.6 bits (112),  Expect = 0.001, Method: Composition-based stats.
 Identities = 21/113 (19%), Positives = 42/113 (37%), Gaps = 8/113 (7%)

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV-LADDNIGGLQALEKSRLLVSGHWWA  263
            L  +L+ +      LL IIPGL+    +    Y+   D ++   +A+  S  L  G+   
Sbjct  33   LTGLLVFIYTFLWLLLFIIPGLIKAYAYSMWLYLLDKDPSLLANEAITLSNKLTKGYKLR  92

Query  264  IFGRFVLLLVISLTLSFLTARI-------PYVGEAANLAFSLLLTPFSFLYYY  309
            +F   +  + I + L      +       P+      +   ++   F +  Y 
Sbjct  93   LFLMDLYFVFIYVALMIFFYILFRSSSMNPFFFVLIFILLLVVFIGFIYPKYM  145


>WP_073630711.1 hypothetical protein [Scytonema sp. HK-05]OKH58728.1 hypothetical 
protein NIES2130_12670 [Scytonema sp. HK-05]BAY43396.1 
hypothetical protein SAMD00079811_09760 [Scytonema sp. HK-05]
Length=235

 Score = 49.4 bits (114),  Expect = 0.001, Method: Composition-based stats.
 Identities = 25/164 (15%), Positives = 59/164 (36%), Gaps = 9/164 (5%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             I +L I  +    F   L+   + L      +     L  ++   +         +  +
Sbjct  16   LISVLPIYKSLLIFFIPSLILSLSKLVIPKILFSIIDELYFLSIGAVFFEAALFFCYKKL  75

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
             + ++ + +S++  + +      L I L  ++    L   +  +   V            
Sbjct  76   NQEEITISQSLQKAIENFPKILFLRICLSPLLLILFLFPRLYFVFSLV---------IIK  126

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            ++      ++   L  G+ W IF  F+ L++IS  L F+++ I 
Sbjct  127  DLSTRDVFKRCWQLTKGYGWQIFWNFLTLILISNVLDFISSLIS  170


>WP_176874803.1 hypothetical protein [Parasphingopyxis sp. CP4]QLC21745.1 hypothetical 
protein HFP51_05870 [Parasphingopyxis sp. CP4]
Length=232

 Score = 49.4 bits (114),  Expect = 0.001, Method: Composition-based stats.
 Identities = 23/157 (15%), Positives = 48/157 (31%), Gaps = 0/157 (0%)

Query  156  QWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG  215
             +++       +L    +    M       D          +          IL  + + 
Sbjct  52   IFSVQNLFGFVVLYLQIYAIVWMARRRGLLDNRSHDDGSPTIGAYFRVVGQSILASIAII  111

Query  216  GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
             G  LL++PG+     +F    +L  +N   +    +SR LV+ ++W +     L L   
Sbjct  112  FGLALLVVPGIWLMTIWFVILPILLIENEPVMDCFGRSRELVTPNFWKVLVLLALWLGAY  171

Query  276  LTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
            +  +          E  +   +  L         L +
Sbjct  172  VGAAGALVFFAPFPEEYSFTANFPLNFLVQAVTTLGW  208


>WP_155875301.1 zinc-ribbon domain-containing protein [Desulfuromonas sp. AOP6]BCA78972.1 
hypothetical protein AOP6_0759 [Desulfuromonas 
sp. AOP6]
Length=487

 Score = 50.5 bits (117),  Expect = 0.001, Method: Composition-based stats.
 Identities = 13/66 (20%), Positives = 20/66 (30%), Gaps = 1/66 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  ++C  C A       K+ A+ +  RC +C       P ES+ T              
Sbjct  1   MI-IQCSECQARFKLADEKIKAEGTKVRCSKCRHIFTVYPPESEETSMPAETEESFTASP  59

Query  61  QRRIPS  66
                 
Sbjct  60  PEPEQP  65


>WP_102908277.1 hypothetical protein [Streptomyces sp. 13K301]PNG22737.1 hypothetical 
protein C1J00_07715 [Streptomyces sp. 13K301]
Length=312

 Score = 50.2 bits (116),  Expect = 0.001, Method: Composition-based stats.
 Identities = 22/119 (18%), Positives = 37/119 (31%), Gaps = 0/119 (0%)

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
                      +    V LF  + + L  +       ++    V    LL +       V 
Sbjct  110  VLGVTLFMYLLIAIPVALFFLVWVSLIALLVAAESPLMAPGFVFLLGLLALPMVTWLVVS  169

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
            F F       ++ G + AL++S  LV G WW  FG   L   +     ++         
Sbjct  170  FSFAPAAAVLESAGPITALKRSFRLVRGAWWRTFGILSLTWGMVGVTGWIVQIPLLFAG  228


>MBI5639594.1 zinc-ribbon domain-containing protein [Nitrospirae bacterium]
Length=294

 Score = 49.8 bits (115),  Expect = 0.001, Method: Composition-based stats.
 Identities = 9/32 (28%), Positives = 9/32 (28%), Gaps = 0/32 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
              CP C      P  KL       RC  C  
Sbjct  2   KFTCPKCQTRLVLPDDKLKPGGMKFRCSRCGA  33


>WP_171778011.1 DUF975 family protein [Bacillus megaterium]QJX80027.1 DUF975 
family protein [Bacillus megaterium]
Length=261

 Score = 49.8 bits (115),  Expect = 0.001, Method: Composition-based stats.
 Identities = 26/219 (12%), Positives = 62/219 (28%), Gaps = 42/219 (19%)

Query  126  LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKT  185
             L        +F  ++L  ++W+   +++         + +I   L      +      T
Sbjct  20   PLYKTAGLYLLFYIIILALSSWIYDYSEDLSDLFQTIGLFFIFPILFANIAIIASSDKGT  79

Query  186  DVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP---------------------  224
               +               L  ++  L V    L +                        
Sbjct  80   GERISGFHFYKNNKFVKTLLTRLIPNLFVFLWILPICFLLGIGIAILYEAKTTFFVAIVV  139

Query  225  ---------GLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
                      +   + +    Y++A++ +I    AL++S+ L  G+   +F   +  +  
Sbjct  140  LLVLALIVTSICKSLQYSLTSYIVAENTDINPYDALKESKRLTKGNLGKLFLLNLSFIGW  199

Query  275  SLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYS  313
             + + F    I           S+ L P+     +  Y 
Sbjct  200  VILIPFTLTLI-----------SVYLIPYYNAAIFEFYM  227


>NLE48147.1 tetratricopeptide repeat protein [Sandaracinaceae bacterium]
Length=1258

 Score = 50.9 bits (118),  Expect = 0.001, Method: Composition-based stats.
 Identities = 12/36 (33%), Positives = 14/36 (39%), Gaps = 0/36 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V CP C A      ++LP      RCP C    
Sbjct  1   MLKVECPSCDAPYELDPARLPDAGMRMRCPACAAIF  36


>MBF0613188.1 zinc-ribbon domain-containing protein [Magnetococcales bacterium]NGZ27751.1 
hypothetical protein [Magnetococcales bacterium]
Length=276

 Score = 49.8 bits (115),  Expect = 0.001, Method: Composition-based stats.
 Identities = 9/39 (23%), Positives = 13/39 (33%), Gaps = 0/39 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  + CP+C  + N     L     + RC  C       
Sbjct  1   MVIIECPNCSTKFNVDPQVLRPAGKNLRCSRCKTMFFQP  39


>NLE42758.1 hypothetical protein [Lentisphaerae bacterium]
Length=252

 Score = 49.4 bits (114),  Expect = 0.001, Method: Composition-based stats.
 Identities = 25/202 (12%), Positives = 63/202 (31%), Gaps = 6/202 (3%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
               +  + + +L  ++     F  +    A +  P        I    +          +
Sbjct  40   PMTFVPVVLAVLLWMVTAWIPFINIGTTIALFTLPVWLADGRKISPIEIFLAEHRAKLES  99

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLL-----ILLILVVGGGSLLLIIPGLLFCV  230
              +   +          +  G+     F   +         + +       + P  +   
Sbjct  100  FLLLSAVFTAVSLAAWGLFSGMGVAPRFCGTMSRCSAFWFQMRLLLAMAAALTPLAMVGT  159

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
             +    ++L +  +G L+A++ S  L  G    I   F L + + + L  + + +PYV  
Sbjct  160  AWSMAYFLLLEKGLGPLEAIQASAELTRGSRLKILVVFGLPVALGIVLCMIFSFVPYVRW  219

Query  291  AANLA-FSLLLTPFSFLYYYLI  311
               LA    +++  + L   + 
Sbjct  220  ILMLATLLAMVSVLAKLAGTVY  241


>NIQ37400.1 DUF3426 domain-containing protein [Proteobacteria bacterium]
Length=391

 Score = 50.2 bits (116),  Expect = 0.001, Method: Composition-based stats.
 Identities = 12/65 (18%), Positives = 19/65 (29%), Gaps = 1/65 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V C  C A      S++P++    RC +C    +  P       +        H   
Sbjct  1   MI-VTCDSCNANFKVDDSRIPSEGIKVRCSKCKHVFMVTPEVPDDFLSEFKDFENFHRDH  59

Query  61  QRRIP  65
                
Sbjct  60  IETEH  64


>WP_182297868.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Sandaracinobacter sp. M6]QMW24045.1 glycerophosphoryl 
diester phosphodiesterase membrane domain-containing 
protein [Sandaracinobacter sp. M6]
Length=290

 Score = 49.8 bits (115),  Expect = 0.001, Method: Composition-based stats.
 Identities = 21/223 (9%), Positives = 64/223 (29%), Gaps = 13/223 (6%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
                G       ++ ++L    +    + +    L    +     +L        + L+ 
Sbjct  57   MMTEGAPPWLFPMIAVILLINLLAIVPITRVVLGLVAPQETVGAMLLDTLSRLPRMILAG  116

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
            +   +        + +  ++ L +  + +   +      V       ++   L       
Sbjct  117  LWLVLLYLGVTLLLSIPFAIILTVVAMAAPKAVTTTGGPVTVVLIAAMLAVLLYLGARVA  176

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI--------  285
                VL  +N+G    + ++  L  G    I    + L   ++ +S L   +        
Sbjct  177  VWFPVLIAENLGARATVRRAWSLTRGQASRILLILLALTFAAILVSALFFALGSAVGVVG  236

Query  286  -----PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                   +G       +   +    L++Y++++ +        
Sbjct  237  QTAGGASIGLLLFQLVNSGFSALFGLFFYILFALMYRRLAQLD  279


>OWJ84013.1 hypothetical protein CDV51_14660 [Haematobacter massiliensis]
Length=1199

 Score = 50.9 bits (118),  Expect = 0.001, Method: Composition-based stats.
 Identities = 9/61 (15%), Positives = 13/61 (21%), Gaps = 0/61 (0%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              + CP C A        +P      +C  C             T       + P     
Sbjct  194  VKIECPGCKAIYEVGGELIPPDGRDMQCSNCNHEWFHPSDTCDETDPPQTNPSAPEQPRV  253

Query  62   R  62
             
Sbjct  254  H  254


>EAH0363985.1 DUF975 family protein [Listeria monocytogenes]
Length=217

 Score = 49.0 bits (113),  Expect = 0.001, Method: Composition-based stats.
 Identities = 29/228 (13%), Positives = 54/228 (24%), Gaps = 14/228 (6%)

Query  229  CVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
               +    ++L D+ NI  L A+ +SR +++GH   +FG  +  L+           IP 
Sbjct  2    TYSYSQTFFILRDNPNISALDAITESRHMMNGHKGRLFGLSLTFLLWY--------LIPL  53

Query  288  VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGL  347
                A                    +D                 + + A+    + I   
Sbjct  54   AVAIAGTVIVAGGMA-----TTSYTADPAEVLSALAAGATFGGLVLILASWLITLGISLY  108

Query  348  LLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQR  407
            +   L          L A  +      T   +         E          +    +  
Sbjct  109  VYPYLITSIAVFYDDLYAATEGTFTEETVIVEEEVTPFGTTEADPFAEDTHPEGFGPEAT  168

Query  408  KTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGS  455
            K      +   P            + P      E  + P         
Sbjct  169  KEPETPVVPEAPEAPETPETPEVPETPEAPETPEAPETPEAPKDNNEP  216


>WP_083768574.1 zinc-ribbon domain-containing protein [Geobacter lovleyi]
Length=58

 Score = 45.1 bits (103),  Expect = 0.001, Method: Composition-based stats.
 Identities = 15/38 (39%), Positives = 22/38 (58%), Gaps = 0/38 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  V CPHC   R+   S+LP + + A CP+C ++  F
Sbjct  3   MIRVSCPHCSFSRDVLDSQLPDQPTKATCPKCKKSFDF  40


>MBI3506340.1 zinc-ribbon domain-containing protein [Proteobacteria bacterium]
Length=298

 Score = 49.8 bits (115),  Expect = 0.001, Method: Composition-based stats.
 Identities = 20/211 (9%), Positives = 39/211 (18%), Gaps = 3/211 (1%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAES--QRTQTTDNIATCPHC  58
            M    CP C      P   +     S RC  C  + +  P ++  +        A     
Sbjct  1    MIL-ACPRCATRFRVPDEAMGDAGRSVRCGSCGNSWVQRPRQAILEVNPRFPKTAKRMRA  59

Query  59   GLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRG  118
            G    +                        P           R                 
Sbjct  60   GADDGMEMPAAPRGRTMAEPTYAPPPPPPPPPPMPPTPPPRPRPPMPEPEMPDPEAELPT  119

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
                      +  A   + S          +  +++    I     A     +     + 
Sbjct  120  KSFAESMQAALAAADEAVPSKPASDKEGTEHRPHRSIAAVIGWLLFALTTTLIGVAVFAQ  179

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLIL  209
               + +          L          L ++
Sbjct  180  GEIMARFPEARPIYQALHFPIPQPGEGLEVV  210


>KPL81400.1 hypothetical protein SE18_22410 [Herpetosiphon geysericola]
Length=293

 Score = 49.8 bits (115),  Expect = 0.001, Method: Composition-based stats.
 Identities = 27/204 (13%), Positives = 66/204 (32%), Gaps = 26/204 (13%)

Query  138  SALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGL  197
              +L  P   + P           +      + ++    ++  +  +        +   L
Sbjct  72   MWVLEYPRADIAPYIWGLLGVGFFSIFIMRNVAMAASVIAVGEHYRQAKPRWLNVVLSSL  131

Query  198  RHVGSFTLLLILLILVVGGGSLL----------LIIPGLLFCVWFFFCQYVLADDNIGGL  247
            RH+ S      L +L +                 +     F V       ++  + I   
Sbjct  132  RHLPSLFAWGCLCVLALFIALFFALAQLPGLVLGLGLLWFFGVRLSLVPQIIVAERINVF  191

Query  248  QALEKSRLLVSGHWWAIFGRFV-LLLVISLTLSFLTARIPYVGEAANL------------  294
            +A+  S +L  G +  I   ++ L++++ +  S++T  I  +  A               
Sbjct  192  RAIHHSWVLTKGAFGRISNIWITLVVLLGVLASYVTTIISAIAMALFGETSALTIALTQG  251

Query  295  ---AFSLLLTPFSFLYYYLIYSDL  315
                 S++  P +++ + L+Y D 
Sbjct  252  LSTVTSIMTLPMAYIGFTLLYYDQ  275


>WP_130630585.1 hypothetical protein [Janibacter limosus]QBF47395.1 hypothetical 
protein EXU32_14740 [Janibacter limosus]
Length=133

 Score = 47.5 bits (109),  Expect = 0.001, Method: Composition-based stats.
 Identities = 16/102 (16%), Positives = 33/102 (32%), Gaps = 1/102 (1%)

Query  200  VGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSG  259
             G   L  +++ +    G +L IIPG++     ++  Y +  +      AL  S   V  
Sbjct  24   WGQALLAGLIMWVATTIGFVLCIIPGIIVLFLLYYTNYAV-LEGRSATDALGASFTFVKD  82

Query  260  HWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
            H        ++ + +S+           V        +    
Sbjct  83   HLGENLLLMLVAIGLSILAICTCGIGFLVVTPVMSIATAYTW  124


>HGA37708.1 hypothetical protein [Candidatus Aenigmarchaeota archaeon]
Length=252

 Score = 49.4 bits (114),  Expect = 0.001, Method: Composition-based stats.
 Identities = 29/226 (13%), Positives = 74/226 (33%), Gaps = 27/226 (12%)

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
            +F         I L       + +FS++L +    L+  + +   A+ +      L+   
Sbjct  11   IFSFLRKHPKVILLAFTFSILSTLFSSMLAQSIKTLDLVSISILVALGIILFFVSLIVAG  70

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI----------  222
                +++  + K +V + ++++          L  +L  L+ G     +           
Sbjct  71   VYQITVYQAVKKGEVEIMKAVQNSFGIFFKILLTSLLTFLIFGVPVFFVFLLALLLLKSS  130

Query  223  ---------------IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGR  267
                                  +  F    +L  +N   +++++ S  +   +  +I   
Sbjct  131  LIIFIVILVIVLCILFILFYVSLRLFLTFPILVIENKDPIESIKSSWKITKDNSISILAF  190

Query  268  FVLLLVISLTLSFLTARIPYVGEAANL--AFSLLLTPFSFLYYYLI  311
             +LL +I   +SF  + +  V E   +     +     S +   L+
Sbjct  191  LLLLGLIIGVISFPFSILKVVFEILKIDQISLIFEVIISTISGALL  236


>WP_199539243.1 hypothetical protein [Desertihabitans brevis]
Length=514

 Score = 50.5 bits (117),  Expect = 0.001, Method: Composition-based stats.
 Identities = 28/212 (13%), Positives = 58/212 (27%), Gaps = 37/212 (17%)

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI----SLTLSFLTARIPY---------  287
             + +GG+ A++++  L   ++W   G  ++  +I       ++F+   I +         
Sbjct  212  VERLGGMTAVKRAWRLTDRNFWRTLGYTIVAQLIPQAVIYVVTFVGLAIGFGVMAGGLAA  271

Query  288  -----------------------VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
                                   V  A  L  +L + PF   +  +   D  A  +  + 
Sbjct  272  AGDSLSSGSPDAATVAPIIIGVVVMYALPLVAALFVAPFLNCFSTVYSMD-LARRKATEL  330

Query  325  PPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLN  384
            P          A  +  +        S  +               Q   G Q  Q     
Sbjct  331  PSPAPGTGGWVAQPYPDLAQQWGGYPSAQQWGQQDPYGGQQSGQQQAYGGQQWNQQQAYG  390

Query  385  RSLPEEPQRLSSADYKLLLSKQRKTTSEGGLS  416
               P   Q+ + A  +    +      + G  
Sbjct  391  AQQPWGQQQDAPAGQQWGQQQDPPADRQWGQD  422


>MBO85307.1 hypothetical protein [Deltaproteobacteria bacterium]HCH61675.1 
hypothetical protein [Deltaproteobacteria bacterium]
Length=274

 Score = 49.4 bits (114),  Expect = 0.001, Method: Composition-based stats.
 Identities = 32/195 (16%), Positives = 58/195 (30%), Gaps = 6/195 (3%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            + LF       + +            F+ L     T           A L A        
Sbjct  68   FFLFSWIYGASMIVVTALTFALAGLNFTDLFDPMLTTQITAELTSNGAYLAAASLLTAPV  127

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLIL----VVGGGSLLLIIPGL  226
             + +  S++    +   G        L      + +L++L L    ++           L
Sbjct  128  NALLYLSVWNVGLRLASGQEVRFADTLPARAMGSAVLVMLCLAPLGMLPILHPTAGYLSL  187

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL--TLSFLTAR  284
               +   +   +L D +IG  +AL  S  L   +  AI G  V LLV      ++     
Sbjct  188  PVWMLTMWSLPLLLDRDIGVGEALRTSFHLTRHNILAILGLGVALLVGYFAAVITCGIGL  247

Query  285  IPYVGEAANLAFSLL  299
            + ++  A     S  
Sbjct  248  VWFLPVATGTLGSAW  262


>WP_156260253.1 zinc-ribbon domain-containing protein [Oceanobacillus sp. HTM 
045]
Length=315

 Score = 49.8 bits (115),  Expect = 0.001, Method: Composition-based stats.
 Identities = 8/34 (24%), Positives = 14/34 (41%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP CG + +  +  L     + RC +C    
Sbjct  2   LISCPSCGTQYDVAAEALGTAGRTVRCEKCGHKW  35


>MAJ22042.1 hypothetical protein [Marinovum sp.]OUU07406.1 hypothetical protein 
CBB98_11970 [Rhodobacteraceae bacterium TMED38]
Length=225

 Score = 49.0 bits (113),  Expect = 0.001, Method: Composition-based stats.
 Identities = 10/35 (29%), Positives = 18/35 (51%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            ++CP+C A+   P+  +PA     +C  C +T  
Sbjct  2   LIKCPNCNAKYEVPNDIIPATGRDVQCSNCSKTWF  36


>HGQ55783.1 hypothetical protein [Candidatus Aenigmarchaeota archaeon]
Length=238

 Score = 49.0 bits (113),  Expect = 0.001, Method: Composition-based stats.
 Identities = 21/207 (10%), Positives = 60/207 (29%), Gaps = 10/207 (5%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             + +         +     L                 +   ++   +G+     +    +
Sbjct  30   ILIVSMNYCMQKVLPFVTSLPYVFGYVYTLLLLILLGISYIISIFFVGVMIHISNKKASL  89

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF----FFCQYV  238
             K+   + +     +     + L  ++ IL       L     L+   +F     F  Y 
Sbjct  90   SKSFEFIEKRFGALVLGNLLYLLFFVVGILAYIILEKLEFFIFLIIETFFVTKIIFFPYA  149

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL------TARIPYVGEAA  292
            +  +N    ++ ++S  +   HWW  F   ++ + +   ++           +  +    
Sbjct  150  IVIENKKASESFKRSYEITENHWWDTFALILIFIAVPFLITSAYTYFSGLYFLNEIYFIL  209

Query  293  NLAFSLLLTPFSFLYYYLIYSDLKANY  319
                +LL+ P+    +   Y   +   
Sbjct  210  FFFVNLLVIPWQVASFIHAYKSFRDVK  236


>PZP85762.1 hypothetical protein DI582_04970 [Azospirillum brasilense]
Length=220

 Score = 49.0 bits (113),  Expect = 0.001, Method: Composition-based stats.
 Identities = 8/41 (20%), Positives = 14/41 (34%), Gaps = 1/41 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
           M    CP C A      +++P    + +C +C         
Sbjct  1   MIL-DCPGCHARFLVADAQIPPAGRTVKCGKCAHHWHVQHP  40


>OYV43294.1 hypothetical protein B7Z75_08885 [Acidocella sp. 20-57-95]OYV58184.1 
hypothetical protein B7Z71_10905 [Acidocella sp. 21-58-7]
Length=260

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 10/34 (29%), Positives = 15/34 (44%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP C  E + P +KL  +    RC +C    
Sbjct  2   RISCPGCQTEYDVPDAKLAGRTRRMRCAQCGHEW  35


>MRG70484.1 hypothetical protein [Alphaproteobacteria bacterium HT1-32]
Length=314

 Score = 49.8 bits (115),  Expect = 0.002, Method: Composition-based stats.
 Identities = 8/35 (23%), Positives = 13/35 (37%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            V CP+C A+ N     +       +C  C  +  
Sbjct  2   LVSCPNCAAKYNIRDELIGPAGRKVKCARCEHSWH  36


>AWX92788.1 hypothetical protein DPM13_05265 [Paracoccus mutanolyticus]
Length=311

 Score = 49.8 bits (115),  Expect = 0.002, Method: Composition-based stats.
 Identities = 10/37 (27%), Positives = 14/37 (38%), Gaps = 0/37 (0%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
             + CP C AE   P + +PA      C  C    +  
Sbjct  84   RLTCPRCRAEYEIPDTAIPAAGREVECSSCAHVWLQM  120


>TMA38529.1 hypothetical protein E6J82_17630 [Deltaproteobacteria bacterium]
Length=195

 Score = 48.6 bits (112),  Expect = 0.002, Method: Composition-based stats.
 Identities = 10/64 (16%), Positives = 17/64 (27%), Gaps = 0/64 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            VRC  C A      +++  +  S RC +C       P  +   +               
Sbjct  2   EVRCDKCQARYRVDDARIGPQGLSMRCGKCQNVFRVMPPGAAAAEPPQKPVPPAPRCASS  61

Query  63  RIPS  66
               
Sbjct  62  SEEH  65


>RME48588.1 hypothetical protein D6795_12775 [Deltaproteobacteria bacterium]
Length=406

 Score = 50.2 bits (116),  Expect = 0.002, Method: Composition-based stats.
 Identities = 50/363 (14%), Positives = 92/363 (25%), Gaps = 59/363 (16%)

Query  4    VRCPHCGAER-NTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + CP CG      P +         RC  C           +  +  D            
Sbjct  62   LTCPRCGGRLQEIPFAAQGHL-KLDRCTACHGIWFDRDEIIEFDRLADRQREFEGIEAAI  120

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                                     +            RS   ++ +S  L+        
Sbjct  121  DA----------------------YRDAEYPAFRPLPSRSAWGIVRESIHLYQENFLAFF  158

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYIL-LGLSWMTGSMFIY  181
             I LL ++   A      L++               +       +  +    +T ++   
Sbjct  159  TILLLPVIAIVAVGVVQALVEGVDSRFQAPALEGRLLAGFVSLLLGQIATGALTHAVSER  218

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII------------------  223
                 V +  + ++    +     + +   LV   G  ++ I                  
Sbjct  219  YVGRPVRILPAYRIAFVRLFPLLFVGLSFGLVTFLGIGIMGICVGVLRPILIARGMSSLL  278

Query  224  ---------PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
                       L F + +     V+  +  G L A+E+S  LV G      G  +LL  +
Sbjct  279  FLLLLPGVGFTLWFALRWLLSAPVIVLEGRGPLAAMERSADLVRGFRCHALGVLLLLGFV  338

Query  275  SLT----LSFLTARI---PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPI  327
                   L  L   +     V     L    +L P  F+   L+Y DL+          I
Sbjct  339  LSVAPHSLGALLTFLQQNTLVAGVGTLLLEAILLPLFFVALVLLYYDLRVRKEDFGVEEI  398

Query  328  KRQ  330
            +  
Sbjct  399  RGW  401


>MBL91501.1 hypothetical protein [Myxococcales bacterium]
Length=1310

 Score = 50.5 bits (117),  Expect = 0.002, Method: Composition-based stats.
 Identities = 11/34 (32%), Positives = 20/34 (59%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP C A+ N   S++P + +S +CP+C  + 
Sbjct  2   RIVCPSCSAKYNLDDSRVPPQGASIKCPKCKHSF  35


>HBO69709.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=174

 Score = 48.2 bits (111),  Expect = 0.002, Method: Composition-based stats.
 Identities = 13/105 (12%), Positives = 27/105 (26%), Gaps = 1/105 (1%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  V C  C        S++  + +  RC +C + +I       R              L
Sbjct  1    MI-VECQACHTRFRLDESRIQGRGARMRCRKCGEAIIVMKDPGDRPAPPPKNLFDLRSML  59

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQ  105
            ++         +      +  + +   +PE               
Sbjct  60   RQPEGRTARPKEHPPEAEKPPSHAAEWRPEAPEPEIPPAHADXPW  104


>HAS10806.1 hypothetical protein [Acidimicrobiaceae bacterium]
Length=337

 Score = 49.8 bits (115),  Expect = 0.002, Method: Composition-based stats.
 Identities = 30/175 (17%), Positives = 58/175 (33%), Gaps = 13/175 (7%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                 + L    +   +  +    DV  + +  + LR   +  L  IL+ ++ G G L  
Sbjct  152  VGTVSLALLAGALGVLVDGWYRGRDVSAWEAAGVALRRSWALVLGTILVHVLEGIGLLAF  211

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
             +   L           +  + +G L A+ +S  L             L+ ++ L + F 
Sbjct  212  GVGAYLAMALCHVVSPAVTVEGLGPLAAIRRSVQLTRRRIGPALTVPGLVGLVGLLVGFG  271

Query  282  TARIP-------------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
               +P              V  A  +A  L++ PF+     L + DL+       
Sbjct  272  FQSVPELATTIVPDDWDWLVRAAGQIASQLVVVPFTAGVAVLFHLDLRIRLEAHD  326


>MBC8206348.1 hypothetical protein [Kiritimatiellaeota bacterium]
Length=252

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 29/231 (13%), Positives = 66/231 (29%), Gaps = 29/231 (13%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY  166
             + SW++F      +L   L+  +L     +    + P T         +  +       
Sbjct  16   FSVSWKVFYSNIKYILVFSLVAGLLVRVADYLLFSVLPETDNLMVEAYSKNILGSLFGYL  75

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLIL--------------  212
              +    +       +      LF S++          L+  +L+               
Sbjct  76   FGVFSITIVERFVNGLTVCWTVLFESLRKYFWLAFWVHLIESVLMFLSRAPRIVSWGADS  135

Query  213  ---VVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFV  269
                +       +   +   V F+F    +   +  G +A+  S  +V G WW +FG  +
Sbjct  136  SPSWLVFLEFAGMFAFIAMTVCFYFTMQAVVLRDQRGFKAICYSCRVVKGRWWLLFGYSI  195

Query  270  LLLVISLTLSFLTAR------------IPYVGEAANLAFSLLLTPFSFLYY  308
            +L ++     F                + Y     +      +  F  + +
Sbjct  196  ILSIVIGIFLFCFTFPLSRVLNCPVSDVAYWAYPVSSVLGSYIMVFFTVIF  246


>WP_198926574.1 efflux RND transporter permease subunit, partial [Acidithiobacillus 
thiooxidans]
Length=655

 Score = 50.5 bits (117),  Expect = 0.002, Method: Composition-based stats.
 Identities = 30/366 (8%), Positives = 81/366 (22%), Gaps = 4/366 (1%)

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
            + +  + S                  +  +    A  ++ +  +    F       + + 
Sbjct  288  IDWISVPSGNHATIKWGGAWTVTYKTFRDMGIAFAVAIVLIYMLVVWEFGNFVIPAIIMV  347

Query  191  RSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
                  +  +    L           G + L    +   +           + +  + AL
Sbjct  348  PIPLTLIGILPGHWLFGASFTATSMIGFIALAGIIVRNSILLVDYSQQRVAEGMPVMDAL  407

Query  251  EKSRLLVSGHWWAIFGRFVLLL---VISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLY  307
             ++          I    ++L    +I+  +          G   +   +LL+ P   + 
Sbjct  408  IEACA-TRTRPIVITALALMLGSAEIITSPIFRGMGISLLFGVMISTILTLLVIPLGCVS  466

Query  308  YYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGK  367
                +     +    Q            +          L    +    L  +       
Sbjct  467  GRAAFCPAGMDLGDTQPNNPPPFPFAPASMQVSLSPRDSLARSDIQSAALVQDTPPKDAG  526

Query  368  DIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRF  427
             +   L    Q      R++ +    L    +  L+ +               +      
Sbjct  527  PLTMTLEMIGQDIRAFFRAITKILLALLGQLWAWLMKRPAAHNKPEAPGPQSPSGPVSPP  586

Query  428  WADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVG  487
               D +    +    S   + S + K     + + V    A          +    H   
Sbjct  587  QGPDGSSQGPVNPHASTSESTSGSAKSPVSGQSNPVATKAAETTAPPSPRAKARGIHLRS  646

Query  488  INQTDE  493
                 +
Sbjct  647  SGSKPK  652


>WP_117591346.1 hypothetical protein [Haloprofundus halophilus]
Length=259

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 20/134 (15%), Positives = 41/134 (31%), Gaps = 2/134 (1%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
                 +         + G + +            +   L  +    +   LL LV     
Sbjct  104  FTTLGLELTAFLAVGVAGWLTLSRVLGGRARSAQLFSYLGLLALLHVAFRLLGLVGTVSG  163

Query  219  LLLIIPGLLFCVWFF--FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
             + ++  +     F   F   VL         A  +S  L  GH W +FG  +++ + + 
Sbjct  164  PIGLLLFVGTLAVFVRVFLVPVLVVAGEDISTAFGRSARLTRGHRWTVFGLVLVVGLATF  223

Query  277  TLSFLTARIPYVGE  290
             L  ++   P +  
Sbjct  224  ALGSVSPVGPVLSA  237


>RYZ02686.1 hypothetical protein EOO73_30980, partial [Myxococcales bacterium]
Length=53

 Score = 44.8 bits (102),  Expect = 0.002, Method: Composition-based stats.
 Identities = 9/34 (26%), Positives = 15/34 (44%), Gaps = 1/34 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           M  + C  C A+ + P +K+  +K    C  C  
Sbjct  1   MI-IVCDSCQAQYSVPDAKVRGRKVRVTCKHCGF  33


>TMK30199.1 hypothetical protein E6G69_10350, partial [Alphaproteobacteria 
bacterium]
Length=78

 Score = 45.5 bits (104),  Expect = 0.002, Method: Composition-based stats.
 Identities = 9/44 (20%), Positives = 14/44 (32%), Gaps = 1/44 (2%)

Query  3   TVRCPHCGAERNTPSSKLP-AKKSSARCPECCQTLIFDPAESQR  45
            + CP C        + +  A     RC  C     F P+  + 
Sbjct  2   LLTCPSCETRYQVDEAAIDRAAGRQVRCANCGYLWHFAPSLEEH  45


>WP_171183858.1 zinc ribbon domain-containing protein [Alienimonas chondri]NNJ24658.1 
hypothetical protein [Alienimonas chondri]
Length=391

 Score = 50.2 bits (116),  Expect = 0.002, Method: Composition-based stats.
 Identities = 37/300 (12%), Positives = 77/300 (26%), Gaps = 8/300 (3%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            V CP CGA+    +          +     +         +R  T     T         
Sbjct  72   VNCPMCGADNPRGAKTCSVCGERLQTETPPEVDPRGTRFRRRVGTVSLGETFSRGWELCT  131

Query  64   IPSDR-------LEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCR  116
                        + +      C        +             R+  +       +   
Sbjct  132  EHFGILLGGAALVGVLLMVAGCFIGGPLGVVAQVANGNFGQPPRRASVEDALVMTGVNQI  191

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                   I  L          +      A   +  N+   WA    T     + L+    
Sbjct  192  AQVLTTAITALLFAGYVRLRLNLARRGRAKIDDLFNEREVWASAALTSIVYNVLLAMPGV  251

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             ++      D G+       +   G   +++     V+   SL++ +  ++  V+F+   
Sbjct  252  ILWAIASAGDGGMRAIGFDPIMQPGIPGMVMFDFDPVLAPISLVVFLVQMVIGVFFWPYL  311

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR-IPYVGEAANLA  295
            ++  D  + GL  L  +     G   A+ G  ++  +I    S      +   G      
Sbjct  312  FLCVDYKMSGLTPLSAAWEATKGSRLALLGLTIIQGIIMFVASLPCGLGLLLAGPYICAL  371


>PKL14862.1 hypothetical protein CVV50_02165 [Spirochaetae bacterium HGW-Spirochaetae-6]
Length=253

 Score = 49.0 bits (113),  Expect = 0.002, Method: Composition-based stats.
 Identities = 27/207 (13%), Positives = 67/207 (32%), Gaps = 8/207 (4%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
             R +    + +  +      +  +++ +       +         L       +  + + 
Sbjct  28   WRKYHTSLLMVFFLYFVPLLVILSVVFRHELAYTGEQMGLLQYHALVLAFLQKVFSAALL  87

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
              +  +  K  +      +     +    + + +    +  G +L I+PG+    +F F 
Sbjct  88   LFVIAFFEKKIIRPDIFFRRMFYLLIPLLMTISMNFFFILSGLMLFIVPGIFAAFFFIFS  147

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY--------  287
              ++  +NI G  AL +S  LV   W  +F    L+ + S  + F    I          
Sbjct  148  DLLVFHENIFGFWALRESFRLVKSFWLRVFSITFLVEIFSQGMEFGLIIISGQLNWPRPE  207

Query  288  VGEAANLAFSLLLTPFSFLYYYLIYSD  314
            +           +  F FL + + + +
Sbjct  208  IQFFVTYLLIYFIKFFGFLLFAVFFYN  234


>NJK90111.1 hypothetical protein [Myxococcales bacterium]
Length=152

 Score = 47.5 bits (109),  Expect = 0.002, Method: Composition-based stats.
 Identities = 10/38 (26%), Positives = 15/38 (39%), Gaps = 0/38 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
             +C  CGA+      KL    +  RC +C   +   P
Sbjct  2   KFQCEQCGAKFLIADEKLGPAGARVRCKKCQHLMHIPP  39


>NBR40721.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=139

 Score = 47.5 bits (109),  Expect = 0.002, Method: Composition-based stats.
 Identities = 8/41 (20%), Positives = 12/41 (29%), Gaps = 0/41 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAES  43
            + C  C        S + A     RC  C      +P + 
Sbjct  2   LLTCEQCQTIFRIDDSAIAATGQQVRCSVCQHVWHVEPHQP  42


>WP_172472239.1 DUF975 family protein [[Clostridium] cocleatum]GFI40806.1 hypothetical 
protein IMSAGC017_00843 [[Clostridium] cocleatum]
Length=281

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 28/182 (15%), Positives = 56/182 (31%), Gaps = 7/182 (4%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
              +             L                 I L       L +     S    +  
Sbjct  78   ISVEHEGLAGLKRFKELFFTYFLQTAFLMVIMLLICLVLFLIAKLVIDESVFSNLGMLFS  137

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
                    +   L+          +  +VV G ++++++  +   + F    YVL  + I
Sbjct  138  QAGIYTNDVTAYLQDPAFIQAAASISGIVVLGLTVMVVVGVMYTLI-FALTPYVLEKNKI  196

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT------LSFLTARIPYVGEAANLAFSL  298
             G++A+  S  L+ G+   +F  ++  L   +       +  +   IP V E    A S+
Sbjct  197  YGVKAMSYSAHLMKGYKGTLFVLYLSYLGWYILTIVITAVVQVFLPIPLVIEILMSALSV  256

Query  299  LL  300
             L
Sbjct  257  YL  258


>MBG77668.1 hypothetical protein [Alphaproteobacteria bacterium]HCQ71146.1 
hypothetical protein [Rhodospirillaceae bacterium]
Length=335

 Score = 49.8 bits (115),  Expect = 0.002, Method: Composition-based stats.
 Identities = 11/41 (27%), Positives = 16/41 (39%), Gaps = 1/41 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
           M    CP C  + + P++ L  +    RC  C  T    P 
Sbjct  1   MIL-TCPQCKTKFSVPTTALGEEGRKVRCTLCEHTWFQTPN  40


>PWU13708.1 hypothetical protein C5B50_18770 [Verrucomicrobia bacterium]
Length=585

 Score = 50.2 bits (116),  Expect = 0.002, Method: Composition-based stats.
 Identities = 20/150 (13%), Positives = 51/150 (34%), Gaps = 0/150 (0%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
            +     ++   +           ++       ++  +  +      L  +   +   I  
Sbjct  389  FWRITGISALVLVLIGAATALGEVSRSVGKMTFSTPILGLVLDAPLLGGLYFYLLKRIRG  448

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
              V    +     R      L   +  +++G   L  ++PG+   + + F   ++ D  +
Sbjct  449  EPVRAETAFAGFSRCPLQLFLAGFVTQVLIGLSLLCFLLPGIYLFIAWRFTLPLVIDKRL  508

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
                A+  SR ++S HWW     F +++ I
Sbjct  509  EFWTAMRLSRKIISRHWWKFLAFFFVVVAI  538


>TNN37383.1 hypothetical protein EYF80_052451 [Liparis tanakae]
Length=228

 Score = 49.0 bits (113),  Expect = 0.002, Method: Composition-based stats.
 Identities = 18/219 (8%), Positives = 43/219 (20%), Gaps = 0/219 (0%)

Query  40   PAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSG  99
                              C +   +    L        C                     
Sbjct  1    MWFCLLWFCLMWFCLLWFCLMWFCLLWFCLMWFCLLWFCLLWFCLLWFCLMWFCLLWFCL  60

Query  100  LRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAI  159
            +          W            +    ++      F  +      +         + +
Sbjct  61   MWFCLLWFCLLWFCLMWFCLLWFCLLWFCLLWFCLLWFCLMWFCLLWFCLMWFCLMWFFM  120

Query  160  LLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
            L   + +  +    +     ++ C     +     L    +    L   LL   +    L
Sbjct  121  LWFYLLWFYMLWFCLLWFCLLWFCMLWFCMLWFYLLWFYMLWFCLLWFCLLWFCMLWFCL  180

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVS  258
            L         +WF+   + +    +     L    +LV 
Sbjct  181  LWFYMLWFCMLWFYMLWFYMLWFCLLWFYLLRHLIVLVR  219


>WP_083225497.1 zinc-ribbon domain-containing protein [Neptunicoccus sediminis]
Length=608

 Score = 50.2 bits (116),  Expect = 0.002, Method: Composition-based stats.
 Identities = 9/41 (22%), Positives = 14/41 (34%), Gaps = 0/41 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAES  43
            + CP C A+     + +P      +C  C      D  E 
Sbjct  2   RIVCPRCVAQYEVDEAAIPESGREVQCANCDNIWFQDYIEM  42


>NCO61614.1 hypothetical protein [bacterium]OIP37934.1 hypothetical protein 
AUK25_13750 [Desulfobacteraceae bacterium CG2_30_51_40]PIP45047.1 
hypothetical protein COX16_15240 [Deltaproteobacteria 
bacterium CG23_combo_of_CG06-09_8_20_14_all_51_20]PIY25649.1 
hypothetical protein COZ11_04765 [Deltaproteobacteria bacterium 
CG_4_10_14_3_um_filter_51_14]PJB36600.1 hypothetical 
protein CO107_07305 [Deltaproteobacteria bacterium CG_4_9_14_3_um_filter_51_14]
Length=267

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 39/301 (13%), Positives = 73/301 (24%), Gaps = 37/301 (12%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             + CP CG  R     K+P     A CP C          S+   +    A C       
Sbjct  4    ELTCPACGFTRPVDDKKVPENARRAICPRCK----SRFPFSRGEISYVQSAPCAASLGGS  59

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                        +        +  +      R   S       L           G G +
Sbjct  60   GDRDISRGQGDDSAGFFGLAIAATISAACSPRRFFSSSAIKYSLRESFAVGLLMGGAGAM  119

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
              +L+ ++          +        P              +  ++  S       + +
Sbjct  120  LAFLVRVLFYGPVDAFFGMSPDGRGTGPAWLIRGLMAGPVYASANIVASSVFLHICLLIV  179

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                 G   + +           +  L         +  +   L   V + +  Y++   
Sbjct  180  RGGKNGFGATFR-----------VAALSQFPYLLSFVPYVGYWL--AVLWSWVLYLVGLK  226

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
             I  +           G+   + G FV    +          I  +G A N+   +L T 
Sbjct  227  EIHSI-----------GYGRVVAGLFVAFGFL---------GIFALGAALNMILGILRTA  266

Query  303  F  303
            F
Sbjct  267  F  267


>NLB53179.1 hypothetical protein [Syntrophomonadaceae bacterium]
Length=236

 Score = 49.0 bits (113),  Expect = 0.002, Method: Composition-based stats.
 Identities = 32/197 (16%), Positives = 73/197 (37%), Gaps = 16/197 (8%)

Query  127  LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD  186
            +  ++    I   L+L   +    +      A  +       L +  +   + +Y  + +
Sbjct  17   IAAMVLPVGILVGLILLGISNSRNEALQLMMAFNVPGWLLSPLWIGPLIYGLSLYESEQE  76

Query  187  VGLFRSMKLGLRHVGSFTLLLILLIL--------------VVGGGSLLLIIPGLLFCVWF  232
                  ++ G+R      ++ +++I+                   SLL  +  +     +
Sbjct  77   FSYKTVLRNGVRVWWKLAIVALIVIVIKFIPVAGGEIFPQQQVLFSLLTWVIAIFLFAIY  136

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
             F   VL  ++ G L+ L +   L  G   +IF  F+ L ++   +SF+   +  +G+AA
Sbjct  137  VFIWPVLVLESHGPLETLRRCAALSKGRRLSIFFEFIALWLLIYVVSFIMMVL--LGQAA  194

Query  293  NLAFSLLLTPFSFLYYY  309
            NL   +     +F+   
Sbjct  195  NLIGGMASIYINFILIM  211


>HBD83533.1 hypothetical protein [Dehalococcoidia bacterium]
Length=173

 Score = 47.8 bits (110),  Expect = 0.002, Method: Composition-based stats.
 Identities = 21/168 (13%), Positives = 58/168 (35%), Gaps = 14/168 (8%)

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC-VWFFFCQ  236
            +  Y       +  ++   +  V +    L+L  + V   S  +    +L   +      
Sbjct  1    VDCYRRLIWRLVSMAILAAIFAVTTIMGWLLLSFVGVTLYSFAVATVLILGAYIVPALVG  60

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE------  290
             V+  + +  + AL ++  LV  +   +    ++  +++  L F+      +        
Sbjct  61   PVVIIEGVKTIAALPRAFELVRQNVLRMIWDLLVFFLVAFGLGFVLFTPFILFFPSETDA  120

Query  291  ------AANLAF-SLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
                    +L   ++++ P   +   L+Y DL+    G     + ++ 
Sbjct  121  LSRTLVVVSLIAPAVVVPPVLSIAITLLYYDLRVRNEGFNIEKLSQEM  168


>WP_129204334.1 DUF4129 domain-containing protein [Xylanimonas allomyrinae]QAY63335.1 
DUF4129 domain-containing protein [Xylanimonas allomyrinae]
Length=614

 Score = 50.2 bits (116),  Expect = 0.002, Method: Composition-based stats.
 Identities = 29/184 (16%), Positives = 55/184 (30%), Gaps = 19/184 (10%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
            +              +  S+   +    VG        L  + +      L +LVV    
Sbjct  183  VRDVWARVGPRVWLLLAWSLLQTVALLVVGCVLLFVTFLLTIAAEQASTALAVLVVIPML  242

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            L  +       +        LA +       + ++ +L    +W  FG ++L  VI   +
Sbjct  243  LGTVAVLAWLGIRLLLVPPALALERAPLWATVRRAWILTRRSFWRTFGIYLLAYVIVSVI  302

Query  279  S-FLTARIPYVGEAANL------------------AFSLLLTPFSFLYYYLIYSDLKANY  319
            +  +   +  VG  A+L                    + + T F      L+Y DL+   
Sbjct  303  AQIIAGALGVVGGVASLSSQSVGMAVFVTTVVTTVISTAISTIFLAGVVSLMYIDLRMRR  362

Query  320  RGPQ  323
             G  
Sbjct  363  EGLD  366


>WP_027175982.1 zinc-ribbon domain-containing protein [Desulfovibrio aminophilus]
Length=305

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 33/317 (10%), Positives = 69/317 (22%), Gaps = 25/317 (8%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             V CP C   R  P  K+PA+   A CP+C     F     +  +T            + 
Sbjct  2    LVTCPQCQFSRELPEDKIPARAQVATCPKCKHKFRFRDLPPEEAETAPAAPAEDVPTAEA  61

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                         +  R  +        +  +       +  +             +G  
Sbjct  62   PETPPPAAPAEDDIWERLGSIQPEQAEPQPGQTPDDPFAADPERPEVDVPFERLDQFGFF  121

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
                  I  A           P   L               V      + +        +
Sbjct  122  PGITQTIRRAMFSPQLFFQAMPQRGLGKPL--------TFAVLLGQFQIFFQLLWSMTGL  173

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                  +             F L++ L++  +     L +   +       F       +
Sbjct  174  LGEKPEVA-------PGTMGFGLVMALVLAPLFLSVFLFLETAVFHFCLILFRSANKGFE  226

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF----------LTARIPYVGEAA  292
                + A   + L++S    A      L  +    +            +      +    
Sbjct  227  GTFRVMAYSNAPLVLSFVPVAGPILAYLWGLGITVVGARHMHGASLGRVLGAFALLLVIV  286

Query  293  NLAFSLLLTPFSFLYYY  309
                 L+    + +   
Sbjct  287  GGILGLMYYAAATIPPA  303


>HHB90317.1 hypothetical protein [Anaerolineae bacterium]
Length=338

 Score = 49.8 bits (115),  Expect = 0.002, Method: Composition-based stats.
 Identities = 28/196 (14%), Positives = 64/196 (33%), Gaps = 36/196 (18%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL--  219
             +   +L+ ++ +       +     G+  S + GLR    +  +++L  L +    +  
Sbjct  86   LSGIVVLITMTAVIWMADRLLHGESPGVIESWRAGLRFFLRYLAIILLFALAIVLSMIGL  145

Query  220  ---------------LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
                           +  I      V        L  ++ G ++A++ +     G +W I
Sbjct  146  IIVLLIPCLGAIMLLVGFILYFYLYVRLILTPVALVVEDCGPIEAIQTAWRTSRGFFWRI  205

Query  265  FGRFVLLLVISLTLSFL-------------------TARIPYVGEAANLAFSLLLTPFSF  305
             G  +LL +++L    +                      + +         ++L TP   
Sbjct  206  AGYALLLGLLALVFYLVPLGLMQFFLIANAFEPSNTLLMVSFGMSTLVSIINILWTPVYL  265

Query  306  LYYYLIYSDLKANYRG  321
                ++Y DLK  +R 
Sbjct  266  TGLLILYYDLKLRHRP  281


>WP_162658013.1 hypothetical protein [Tuwongella immobilis]VIP02885.1 : zinc_ribbon_5 
[Tuwongella immobilis]VTS02743.1 : zinc_ribbon_5 [Tuwongella 
immobilis]
Length=336

 Score = 49.8 bits (115),  Expect = 0.002, Method: Composition-based stats.
 Identities = 19/184 (10%), Positives = 41/184 (22%), Gaps = 5/184 (3%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              V CP C    + P +         RC  C  T+          +              
Sbjct  3    IPVTCPGCQTVYDVPDTI---AGKRIRCKACGATI--PVDAPIELEFDPTSRDSKPAFEV  57

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                  +     +  +    +     +   +    G               LF R  + +
Sbjct  58   VADDFTKKRSWERDPDSPDSDPFQHERYTFKPVGIGINGCYEIGDAYGEPILFARAPYRI  117

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              +  + +      + S   +           +      + TV ++ + L   T    I 
Sbjct  118  SILGWIALFFLGPAMASCFCVSMVFQAAQFAGHQGDNSGILTVFFLGMALVSSTFFGMIG  177

Query  182  ICKT  185
                
Sbjct  178  GRGK  181


>NJO84766.1 hypothetical protein [Blastochloris sp.]
Length=316

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 21/221 (10%), Positives = 52/221 (24%), Gaps = 2/221 (1%)

Query  91   REFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNP  150
                     +   S    +++            + +    L        L       L  
Sbjct  52   WHAAVELPPMIGYSFGFFEAFANITSAELLFSLLIIARYFLLIPLAIGVLTYVSVALLAD  111

Query  151  QNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL  210
            Q  +   A  L       L  + +   +   I    +  F    L   +   +       
Sbjct  112  QTYSIADAYRLMIKRAPPLVFASLAPFVCSGISIVLMSAFSLASLSWYNGLGWGYTSGSF  171

Query  211  ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
            ++       L     ++  +      ++   +    L+   +S  L       +   + L
Sbjct  172  LVRQILYWYLPAFLLVMLFIRLSLAPHIAILEKQSPLRCWRRSWWLTKDISGRLLIIWAL  231

Query  271  LLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
            L  +    S + A +  +        S     +      +I
Sbjct  232  LGFLVFISSTVFAAL--IYFLLTFCASAFPGMWELHNVAMI  270


>MYE60404.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=225

 Score = 48.6 bits (112),  Expect = 0.002, Method: Composition-based stats.
 Identities = 10/37 (27%), Positives = 12/37 (32%), Gaps = 1/37 (3%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            M    CP+C A        L A     RC +C     
Sbjct  87   MIL-ACPNCIARYRVEVEALGAMGRRVRCQKCGHVWH  122


>MBA3502893.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=280

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 22/212 (10%), Positives = 58/212 (27%), Gaps = 24/212 (11%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
               +    F       L I     +   + +  AL       L   +       +     
Sbjct  14   WGRNLPRFFLLTVVCYLPIVGWYALHEISAVKDALHNYLYEPLFRLHPMLHPEAVGLGFI  73

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGG---------  216
             + +  +     +   +      ++R + + LR + +  ++ ++  L   G         
Sbjct  74   PLAVFAAAAAVCIVATLRDERASIWRGLAVALRRLPALVVIALVTRLATTGIASVIRIVR  133

Query  217  ---------------GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHW  261
                             +LL +  ++   +F         +  G   A+ +   L  G+W
Sbjct  134  YDPDRPYASPDTTYPWLVLLAVIWIVAWSFFISAIPAATLERRGPFSAIARGFALARGNW  193

Query  262  WAIFGRFVLLLVISLTLSFLTARIPYVGEAAN  293
              I    ++  V+   L    +++        
Sbjct  194  IKILAVVLVHYVLVFALYMTVSQLMLPWVFGG  225


>HBW19175.1 hypothetical protein [Actinobacteria bacterium]
Length=277

 Score = 49.0 bits (113),  Expect = 0.002, Method: Composition-based stats.
 Identities = 24/135 (18%), Positives = 44/135 (33%), Gaps = 0/135 (0%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
            A      + LS     +         G   +               +L++L    G +LL
Sbjct  77   AVSILGTVFLSGFLCRLVGAAEHGRRGASIAEVARSLPWWRLVRADLLVVLFTTVGIILL  136

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            IIPGL+  V       V+  ++     AL +S  LV  ++W +    + L++    +  +
Sbjct  137  IIPGLVALVLLAIAGPVIELEDRPVWAALRRSAHLVRPYFWKVALLTLPLMIAGSEVESI  196

Query  282  TARIPYVGEAANLAF  296
                   G       
Sbjct  197  APHPDGPGTIVAALV  211


>MAJ63489.1 hypothetical protein [Alphaproteobacteria bacterium]MAS46674.1 
hypothetical protein [Alphaproteobacteria bacterium]MAX94769.1 
hypothetical protein [Alphaproteobacteria bacterium]MBN53779.1 
hypothetical protein [Alphaproteobacteria bacterium]OUT42660.1 
hypothetical protein CBB62_05360 [Micavibrio sp. 
TMED2]
Length=290

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 10/37 (27%), Positives = 12/37 (32%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  V CP C      P S +  +    RC  C     
Sbjct  1   MI-VACPACNTRYELPPSSISGEGRQVRCARCGNQWF  36


>RYD76756.1 hypothetical protein EOP53_14130 [Sphingobacteriales bacterium]
Length=311

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 20/143 (14%), Positives = 43/143 (30%), Gaps = 5/143 (3%)

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
            +     TG           GL   +  G           +L++ V   G   +    + F
Sbjct  116  VWRGAKTGIWKSIGIGLGFGLIMMIIAGAYAFIFRLGGAVLMVFVFLIGFFAV----IYF  171

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
               F +       D     ++  +S  L   H+WA FG  ++  ++   +  +     ++
Sbjct  172  IFRFVYTFPAALVDEYSVAESFSRSWKLTYDHFWANFGIALVFSLLISMVGLVAMIPYFI  231

Query  289  GEAANLA-FSLLLTPFSFLYYYL  310
                     S  ++P       +
Sbjct  232  LTFMTAMHGSAGISPIWKFVVIV  254


>HAO79507.1 hypothetical protein [Verrucomicrobia subdivision 3 bacterium]
Length=329

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 29/170 (17%), Positives = 53/170 (31%), Gaps = 11/170 (6%)

Query  143  KPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGS  202
              A +     Q +    +   +    L    +   + ++     +  F  ++ G      
Sbjct  162  MFAGFRRSFGQLFLGTFVQGLLVLACLIPFLIILLVKLFPILPQISQFSHLQPGATPDKE  221

Query  203  FTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWW  262
                L+  +L      LL  IP     V + F   ++ D  +    A++ S  +V+ HWW
Sbjct  222  TVNALVSALLTGLPVGLLCAIPATYLGVCWKFTLPLIIDQQMDFWTAMKTSWKMVNKHWW  281

Query  263  AIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
             IFG              L + +   G  A     L   P       + Y
Sbjct  282  QIFGLV-----------ILISLLNVAGLCACCVGLLFTIPIGIAALMIAY  320


>MBI1322290.1 hypothetical protein [bacterium]
Length=283

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 21/118 (18%), Positives = 38/118 (32%), Gaps = 11/118 (9%)

Query  195  LGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSR  254
              +   G+  L  ++    +  G    I             Q+++ D     LQ+L+ S 
Sbjct  151  QWIVIFGTLILGELIGPFALMIGIFAGIPLLYAVYFSLSQFQFLVVDRETKPLQSLQLSW  210

Query  255  LLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
             L+ GH        +           +   I  VG  A     L+  P  F+ + + Y
Sbjct  211  ELMRGHRLEYLYLNL-----------ICGVINIVGFLACGFGILITLPLQFVSFAVFY  257


>HHN39454.1 AgmX/PglI C-terminal domain-containing protein [Deltaproteobacteria 
bacterium]
Length=730

 Score = 50.2 bits (116),  Expect = 0.002, Method: Composition-based stats.
 Identities = 10/34 (29%), Positives = 17/34 (50%), Gaps = 1/34 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           M    C +CGA+      ++PAK +  +C +C  
Sbjct  1   MI-FTCSNCGAQYKISDDRIPAKGAKVKCKKCGN  33


>WP_074826113.1 zinc-ribbon domain-containing protein [Paracoccus sanguinis]SDW40232.1 
MJ0042 family finger-like domain-containing protein 
[Paracoccus sanguinis]
Length=309

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 15/36 (42%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V CP CGA+    +  +PA+     C  C    
Sbjct  1   MRLV-CPRCGAQYEIDAEAIPARGRDVECSACEHVW  35


>OQA96685.1 hypothetical protein BWY22_01645 [Bacteroidetes bacterium ADurb.Bin217]
Length=246

 Score = 49.0 bits (113),  Expect = 0.002, Method: Composition-based stats.
 Identities = 23/173 (13%), Positives = 58/173 (34%), Gaps = 0/173 (0%)

Query  110  SWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
              + F +    L+ IY   +    A   S   ++          ++ W  LL      ++
Sbjct  25   FIQQFGKEYVFLVFIYAAPLFALSAYFASQANIEIHASQQLFYSSYLWYSLLVDFFADVI  84

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
                   ++ ++I    +     M    ++      + I+  +++  G +L I+PG++  
Sbjct  85   VNGVTFSALIMFIQTGKIVREDVMTYFNQNFLFILGVTIVANVIISLGFILFIVPGIISL  144

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            V      +           +  +S  L   +    +G   L+  +   + F+ 
Sbjct  145  VPMSLFVFDRFLHKQSFEISFLRSFELTRSNIALSYGVIFLMYAVIFIVKFVF  197


>KAB2879169.1 hypothetical protein F9K33_10095 [bacterium]
Length=278

 Score = 49.0 bits (113),  Expect = 0.002, Method: Composition-based stats.
 Identities = 35/276 (13%), Positives = 79/276 (29%), Gaps = 23/276 (8%)

Query  77   NCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPI  136
                 +    +   R             +    +        +  L +            
Sbjct  1    MNPTHHHPDFIVARRPMIDMIDAGVVFYRASYQTVTQIVLTFYLPLFLIKYLFFRFDLIP  60

Query  137  FSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLG  196
               +      +          A+++   +   +        +F      ++G    +K  
Sbjct  61   IERVFNSFGFFQTGSYTLENVALMVLDGSVNSVTGGATCFFLFEKAKGHEMGASLLIKKM  120

Query  197  LRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL-EKSRL  255
                G    +  +  ++   G     IPGL   + F F   ++  +N+    A+ ++SR 
Sbjct  121  AGKTGPLLWITAITTMITLIGLGFCFIPGLWIGIIFLFSTQLVVIENVSSFDAIWKRSRY  180

Query  256  LVSGHWWAIFGRFVL------------LLVISLTLSFLTARIPYVGEAA----------N  293
            LV   WW +   F L              +IS     L  ++ ++               
Sbjct  181  LVLSEWWRVLVYFALTTLLLFVLGLSLTFLISGVYQILLEQLSWLTGLIPDLKMIEIAGG  240

Query  294  LAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
               SLLL P   ++  ++Y D+++   G     + +
Sbjct  241  AVASLLLIPMQVIFITILYFDVRSRKEGFDLEMMLQ  276


>OGV63804.1 hypothetical protein A2498_10680 [Lentisphaerae bacterium RIFOXYC12_FULL_60_16]
Length=221

 Score = 48.6 bits (112),  Expect = 0.002, Method: Composition-based stats.
 Identities = 24/181 (13%), Positives = 53/181 (29%), Gaps = 1/181 (1%)

Query  105  QLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATV  164
             +       F       + + L+     +   F    L  + +   +  +   A      
Sbjct  1    MIRQAFRTSFAVMRDQWVTLLLILGFTYWPLGFLKAYLMDSAYEPYELLSSLSATRFWAQ  60

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
              +++  + +      +I     G   S+  G  + G   +      L      LL++ P
Sbjct  61   FILIIPDAAILYVGLNHIAGEPAGFGESLGYGFLNYGRMWITRFFNYLSWLTLLLLVV-P  119

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            G+     +   +  +  D   G+ AL +S  +  G    IF    +       L      
Sbjct  120  GVYGLTRWSLSEITVVSDRTIGVGALHRSWQMTRGRVGQIFFALCIGGGTYAVLWGAITA  179

Query  285  I  285
            +
Sbjct  180  L  180


>NIV28282.1 hypothetical protein [Anaerolineae bacterium]
Length=162

 Score = 47.5 bits (109),  Expect = 0.002, Method: Composition-based stats.
 Identities = 21/100 (21%), Positives = 33/100 (33%), Gaps = 6/100 (6%)

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            +   V F     V+A +    + AL +S  LV   WWA  G  +L+ +I    + L   +
Sbjct  46   IWLSVSFSMVTSVVAIEKRTPVAALRRSLALVRPRWWASLGYLLLVGLIGSIAAQLIQVL  105

Query  286  PYVGEAA------NLAFSLLLTPFSFLYYYLIYSDLKANY  319
                             SL    F  +    I +     Y
Sbjct  106  AIPLTVVGDATSGFTLASLFGVVFQGILIAGIAAMYTRWY  145


>TGN46554.1 hypothetical protein E4L95_19345, partial [Paracoccus liaowanqingii]
Length=60

 Score = 44.8 bits (102),  Expect = 0.002, Method: Composition-based stats.
 Identities = 11/44 (25%), Positives = 16/44 (36%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  + CP CGA+   P+  +P       C  C       P   +
Sbjct  1   MELI-CPGCGADYALPAGAIPPAGREVECSRCGHVWQATPPAPE  43


>PYP10035.1 hypothetical protein DMD59_06995 [Gemmatimonadetes bacterium]
Length=366

 Score = 49.8 bits (115),  Expect = 0.002, Method: Composition-based stats.
 Identities = 11/35 (31%), Positives = 13/35 (37%), Gaps = 0/35 (0%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
              V CP C        +K+PA    ARC  C    
Sbjct  101  VNVTCPSCETVYRVDPAKVPAGGLRARCSVCSNVF  135


>MBI5228627.1 hypothetical protein [Candidatus Micrarchaeota archaeon]
Length=356

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 33/321 (10%), Positives = 85/321 (26%), Gaps = 23/321 (7%)

Query  108  ADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI  167
               +       + L+ + +   +      F   +L                 ++A + + 
Sbjct  30   HFLFLCVAGFFFLLVSLLVFSSIFESLQSFDPSVLLKLISGVFLLLVLVIVFVVALIVFS  89

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII----  223
            +L  +    S   ++      L +S+ +      S      L+ L++    LL+ +    
Sbjct  90   ILVTAAFIDSARDFLEGRPFNLRKSLGVAWSKTLSLLFASFLVFLLICLAFLLVFLGFLT  149

Query  224  ---------------PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
                             L   + F F  Y +   + G ++ ++ S  L  G+   +F   
Sbjct  150  KNIVLFIVLLVFSILFILFISLGFAFVSYYILIGDTGVVEGVQASFNLFKGNPLTVFVVC  209

Query  269  VLLLVISLTLSFLTA----RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
            +L+ + S  ++ + +     +  +  AA+ +      P       +       N      
Sbjct  210  MLVGISSYVIALVGSIPYNFLSNLALAASFSPLYFYIPLFLFAGLIYAVVYAFNSLFCIG  269

Query  325  PPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLN  384
                      +         P   L+  S                   +     +   L 
Sbjct  270  FMTSAFLQLASPPHQPMTAQPATQLLPQSPAASLELASKPFVPLPLPAIEAAQPEAKPLA  329

Query  385  RSLPEEPQRLSSADYKLLLSK  405
            +      +R +    +     
Sbjct  330  KRRAFAFRRKALDRKQPAKKP  350


>NNF76756.1 hypothetical protein [Rhizobiales bacterium]
Length=306

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 7/44 (16%), Positives = 14/44 (32%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  + CP C      P + +  +    +C  C +         +
Sbjct  1   MI-ITCPDCATSYELPDTVIGDEGKLVQCEGCRKIWTHMEPLPE  43


>QLH43106.1 hypothetical protein HWD59_10515 [Coxiellaceae bacterium]
Length=186

 Score = 47.8 bits (110),  Expect = 0.002, Method: Composition-based stats.
 Identities = 16/176 (9%), Positives = 49/176 (28%), Gaps = 24/176 (14%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
             +       +         +         + +L  +       +  +L L+V      + 
Sbjct  1    MLLVSWYFYACFLYKANDNLTSRTATYQETFRLIFKRFHHIIAVYFILFLLVVALFSTMQ  60

Query  223  IPGLL-------------------FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA  263
            +   +                         +    +  DN   + A + S  +V G+W  
Sbjct  61   LSHFITNAQARSIAAIIMALAALWLLWTLMYANTRVVLDNANAIAAFKFSFQIVKGNWIR  120

Query  264  IFGRFVLLLVISLTLSFLTARIPYV-----GEAANLAFSLLLTPFSFLYYYLIYSD  314
             F  ++ ++++ + + F    +  +        A +    L+ P       ++  +
Sbjct  121  NFLVWLAVMLVFILIGFGIVALFNLFHAKYLMIATVVILALIMPIVINSIVIVQFN  176


>KKT24208.1 hypothetical protein UW09_C0001G0271 [candidate division TM6 
bacterium GW2011_GWF2_43_87]HBL98363.1 hypothetical protein 
[Candidatus Dependentiae bacterium]
Length=262

 Score = 49.0 bits (113),  Expect = 0.002, Method: Composition-based stats.
 Identities = 32/222 (14%), Positives = 65/222 (29%), Gaps = 9/222 (4%)

Query  102  SISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILL  161
             +   L        +    +L I   G V       +     P   +          + +
Sbjct  28   WVVVSLVKLVVFIVQWVSLMLPIGFAGRVFLGTVPETWAFAYPEAMVAHYATYKIVFLGI  87

Query  162  ATVAYILLGLSWMTGSMFIYICK---TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
              + +  L    +   +                 +   +R      +  +L +L+V  G 
Sbjct  88   VFLLFFELLGGIIRMGVTRIALGQYDGQRSGADLLLSQVRLAFRHLIATVLYVLIVFSGL  147

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFG------RFVLLL  272
            +  I+PG+ F +  +F + +L D     L AL++S  L  G   A+F          +  
Sbjct  148  VFFIVPGVFFAIRLWFYRQILIDKQCCPLNALKESARLTCGKAMALFMSLLLILIINIPF  207

Query  273  VISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
            +  L L  +    P  G    +     L          ++  
Sbjct  208  LQGLFLFAVLNTHPVHGFFIMVLLLAGLLTIPVATLANVFMY  249


>CCZ83064.1 putative uncharacterized protein [Ruminococcus sp. CAG:254]
Length=268

 Score = 49.0 bits (113),  Expect = 0.002, Method: Composition-based stats.
 Identities = 18/120 (15%), Positives = 45/120 (38%), Gaps = 8/120 (7%)

Query  203  FTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL-ADDNIGGLQALEKSRLLVSGHW  261
            +  L  L  L+   G ++  I  ++  + + F  Y+L    +I  L AL++S  +  G+ 
Sbjct  137  WLFLWALFALIPIVGWVIAPIMLVIKGISYSFTPYLLLRKKDISPLDALKRSMEMTKGYR  196

Query  262  WAIFGRFVLLLVI-------SLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
              IF  +++              ++ +      +     L   +    F  +    ++++
Sbjct  197  GKIFLTYLICYAALLVVGLLLFLIAKIGGFAVVLAGLIGLVIGIFCPLFFGVLRAAMFTE  256


>MBE6365153.1 hypothetical protein [Lentisphaerae bacterium]
Length=338

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 32/282 (11%), Positives = 72/282 (26%), Gaps = 12/282 (4%)

Query  6    CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIP  65
            CPHC      P  KL    + + C             + +    ++   C  C     + 
Sbjct  68   CPHCMQFYELPRRKLNTSVTCSNCDNDFVVEKTITCPNCQNICRESETICSVCETNLDVC  127

Query  66   SDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIY  125
                 +            SF L                +    +    + +  +    I 
Sbjct  128  KSNQNLTGWKKILNCNVDSFFLYFNPPALLKAVRENMENCWQEEETASWKKVLFWPFAIL  187

Query  126  LLGIVLAFAPIFSALLLKPATW-LNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
             +   L  + +   L          P    + W I+   +A + L L  +   +   + +
Sbjct  188  TILFALCISVVPYLLFNFGYVEKCPPVGILFVWFIIHIVLATLPLMLIRVGQGLSCKLTE  247

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
              +         +          + ++  V     +LI  G+L  + + F          
Sbjct  248  KVLKWPYYFSAAIYWGIVAIGAAMWVVNKVFQIEHVLIFLGILAILMYSFFVTYSGIGE-  306

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
                        V       FG   +++   L +  + + + 
Sbjct  307  ----------RQVFKIQCKAFGFTFVIMAAVLVIIRIWSALT  338


>TMQ50188.1 hypothetical protein E6K71_03085, partial [Candidatus Eisenbacteria 
bacterium]
Length=132

 Score = 46.7 bits (107),  Expect = 0.002, Method: Composition-based stats.
 Identities = 8/34 (24%), Positives = 12/34 (35%), Gaps = 0/34 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           V C  C         K+P + +  RC +C     
Sbjct  3   VECNSCNTRYTIADEKIPPQGARVRCRKCQAVFQ  36


>MBB71708.1 hypothetical protein [Legionellales bacterium]
Length=265

 Score = 49.0 bits (113),  Expect = 0.002, Method: Composition-based stats.
 Identities = 36/232 (16%), Positives = 72/232 (31%), Gaps = 33/232 (14%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            LL    L +V   A  F    +              +  +       L  L+ +    F 
Sbjct  32   LLMCIGLILVYVTAGFFLVSAVGEILHQPTLATILFYLYVAFYFVVALFCLTAIYNKAFD  91

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGG------------------------  216
             +   D+      ++ L+     T   I+L L +                          
Sbjct  92   TLQGRDLAYGSMWRIALKRCWRMTGAYIILALAIFLPGAVWALITHFGFAAFPALHKVAY  151

Query  217  --GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
              G+ ++++      V F     ++A D    +Q L+ S  +  G W   F  F+L +++
Sbjct  152  SLGAAIIVVVFAFIAVRFLLTLPLIAIDGEHLIQGLKHSFQMTKGLWLRYFCVFLLGVLL  211

Query  275  -SLTLSFL---TARIP---YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
              + +S L      IP   ++  A     ++  T        ++ +D K   
Sbjct  212  PCVVISSLADYLQAIPNGEFLATAVQYLVTIFGTVLLVGGNIVLLNDAKLRN  263


>MBI83248.1 hypothetical protein [Planctomycetaceae bacterium]MBP63430.1 
hypothetical protein [Planctomycetaceae bacterium]
Length=358

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 31/337 (9%), Positives = 74/337 (22%), Gaps = 37/337 (11%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
                C  C     T       +     C +           S    +    +        
Sbjct  3    IEFPCSGCQQLLRTADGTAGKQAQCPECSQIQTVPSAGSPSSMTADSGYAASPESDSFEA  62

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                S     ++           F                  S+               +
Sbjct  63   VPGGSVANPDRAPAFELSSTEMGFSTGEGTLQPTRIEFGEIFSRTWERFSAQLGTCVLFV  122

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQN---WQWAILLATVAYILLGLSWMTGSM  178
              +  +     +    +  +L  A     +         A L+ ++              
Sbjct  123  FCLAGVHCAAWYISTEATGMLAAAGEQWGEPTMVKLIPMASLVWSLFVGSFVTCITVRFG  182

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL------------  226
               + +    L    K+G   +    L +++ ++V     +  +  G+            
Sbjct  183  LNLLHRRPSPLGDMWKVGPYFLRVLLLHVLIFVVVAAASVVCALPVGIAASTQDDTAMLI  242

Query  227  ---------LFCVWFFFCQ--------YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFV  269
                     +  + F F          + + D     L++L  S   + G+    F    
Sbjct  243  GATISGVLAIPAIMFAFIYGYGILLAGFFIVDQENDVLESLRNSIRYMYGNKLTAFCIKF  302

Query  270  LLLVISLTLSFLTARIPYVG-----EAANLAFSLLLT  301
            ++  ++  +  LT  +  V              L  T
Sbjct  303  VVGGLTALILLLTCGLALVFAPSYYALLMAVIYLSAT  339


>RMC22904.1 hypothetical protein DUI87_00090 [Hirundo rustica rustica]
Length=1156

 Score = 50.2 bits (116),  Expect = 0.002, Method: Composition-based stats.
 Identities = 13/128 (10%), Positives = 24/128 (19%), Gaps = 2/128 (2%)

Query  7     PHCGAERNTPS--SKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRI  64
             P C    + P     +P      R PEC +  + +                 H       
Sbjct  1012  PECHRCHHVPECHRPVPECHRRHRVPECHRHGVPECHRHCVPTCHRRRVPKCHRHRVPEC  1071

Query  65    PSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGI  124
                 +    + +    C    C    R              +          +    +  
Sbjct  1072  HCHCIPTCHRRIPECHCPVPECHHRRRVPECHRHIPECHRHIPECHHCHCVPKCHRCIPK  1131

Query  125   YLLGIVLA  132
                 I   
Sbjct  1132  CHHPIPKC  1139


>MAQ70683.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=287

 Score = 49.0 bits (113),  Expect = 0.002, Method: Composition-based stats.
 Identities = 11/44 (25%), Positives = 15/44 (34%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M    CP C  +    S KL  +    RC  C      +P   +
Sbjct  1   MIL-TCPKCSTQFKLSSEKLGNEGRKVRCSNCAHIWFQEPEPVR  43


>KIP51965.1 hypothetical protein SD72_12085 [Leucobacter komagatae]
Length=318

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 14/80 (18%), Positives = 28/80 (35%), Gaps = 0/80 (0%)

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
             L IP L           V+  +      ++ +S  L     W I G  ++L +I+    
Sbjct  193  ALCIPLLWLAARISLAVSVIVFEGTPVWASIRRSWALTRHSAWRIVGLQLVLGLIAGAAM  252

Query  280  FLTARIPYVGEAANLAFSLL  299
            F+   +  +       F++ 
Sbjct  253  FVLVYLTTLITNTAFPFAIW  272


>WP_004059610.1 hypothetical protein [Haloferax mediterranei]AFK19889.1 hypothetical 
protein HFX_2200 [Haloferax mediterranei ATCC 33500]AHZ23268.1 
hypothetical protein BM92_11745 [Haloferax mediterranei 
ATCC 33500]ELZ99433.1 hypothetical protein C439_12804 
[Haloferax mediterranei ATCC 33500]QCQ73870.1 hypothetical 
protein E6P09_00690 [Haloferax mediterranei ATCC 33500]
Length=291

 Score = 49.0 bits (113),  Expect = 0.002, Method: Composition-based stats.
 Identities = 26/169 (15%), Positives = 58/169 (34%), Gaps = 4/169 (2%)

Query  154  NWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILV  213
               + +  AT+  +L           I++    +G+  +  +       + +       +
Sbjct  113  FVLYRLGRATLHLLLGSWITFVLGAIIFLLSAGIGVAGAAAVTTVTASDWLVATWPGRGL  172

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            +   SL+L++P  +F +   F    +A  + G   AL  S  L  GH   +     + LV
Sbjct  173  LVVASLVLLLPTAVFGIGVAFFSQEVAIRDKGVRGALVGSWRLTRGHRLRLGVLIFVPLV  232

Query  274  ISLTLSFLTARIPYVGEAANLAF---SLLLTPFSFLYYYLIYSDLKANY  319
            I   L  + + +   G  +       + + T        + Y ++    
Sbjct  233  IHGILGMVLSLLAT-GALSQGIVVVETAIATVLIQGIMAVAYLEISGIN  280


>MBF1069313.1 hypothetical protein [Prevotellaceae bacterium]
Length=227

 Score = 48.6 bits (112),  Expect = 0.002, Method: Composition-based stats.
 Identities = 25/139 (18%), Positives = 48/139 (35%), Gaps = 1/139 (1%)

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
            +    L    +  S+   + +          L +     F +   +  L V  G    I+
Sbjct  63   LGINALVTVSIFASILKLLRENGGSYSFDHGLSVSVYVKFAVCQFIYGLAVVLGFAFFIV  122

Query  224  PGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            PG+   V + F    L D  N G  +AL+ S     G +W +FG  +   +++++   L 
Sbjct  123  PGVFIAVRWVFAPLYLIDHPNAGIGEALQASWNKTEGLFWPLFGLGLAATIVTISGILLC  182

Query  283  ARIPYVGEAANLAFSLLLT  301
                Y          ++  
Sbjct  183  CIGIYFTMIIAYVAQVMTY  201


>PKL41312.1 hypothetical protein CVV44_01370 [Spirochaetae bacterium HGW-Spirochaetae-1]
Length=414

 Score = 49.8 bits (115),  Expect = 0.002, Method: Composition-based stats.
 Identities = 35/294 (12%), Positives = 82/294 (28%), Gaps = 1/294 (0%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            +RC  C  + + P  ++  K+    C  C   +     + +   +        H      
Sbjct  33   IRCGKCNQKYHLPDDQVDDKRVYFFCENCGHRI-VVNRKKEAWFSYHVPEPLSHGVDILE  91

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLG  123
                    ++  +         C+       AS +     + +    + +F      +  
Sbjct  92   GIYLSFNRKNFFITFFLLLFWTCIFAIAALTASRTMTFFSTHIFLGGFIIFIMAMLFMWT  151

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
              +   +L+    F     K   +   +         LA ++  +  +  +       + 
Sbjct  152  FDVHLYLLSKNLFFRIKNGKNLIFSGARADIVHDMPSLAFISMGIPAIFILVILPVYLMK  211

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
                  +     G   V +  +LLIL    +    + L    L+  V  F    ++ + N
Sbjct  212  SEFGAAYAGFFHGFMLVCALFILLILHCKNILMAFIALRPRSLVHTVGGFSRFLIVENIN  271

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
            I     +     L       +     L +      + L + +P+ G       S
Sbjct  272  IPVYLGIISIVTLFFAGMIFLLLAGALTVTTLTAGTMLPSLLPHSGIFPGFLGS  325


>KPJ81436.1 hypothetical protein AMJ58_05070 [Gammaproteobacteria bacterium 
SG8_30]
Length=269

 Score = 49.0 bits (113),  Expect = 0.002, Method: Composition-based stats.
 Identities = 18/130 (14%), Positives = 39/130 (30%), Gaps = 13/130 (10%)

Query  204  TLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA  263
                     +V    L  ++      V        +     G + +L  S  L  G +W 
Sbjct  134  WAWQGGSYSLVVSVVLAALVALSWISVRLALVTAEVVLRPAGPVASLRSSWRLTRGFFWH  193

Query  264  IFGRFVLLLVISLTLSFLTARIPYVGEAAN-------------LAFSLLLTPFSFLYYYL  310
            I   + +LL++ + +  +   +  V                  +  S+ L P       +
Sbjct  194  IGMVYGVLLLMMIAVFLVAGIVTAVLTVLAPALAVFLSSIVSVVLLSVFLLPLGAAVVVV  253

Query  311  IYSDLKANYR  320
            ++ DL+    
Sbjct  254  VWYDLRLRAG  263


>KAA8535468.1 hypothetical protein F0562_030471 [Nyssa sinensis]
Length=285

 Score = 49.0 bits (113),  Expect = 0.002, Method: Composition-based stats.
 Identities = 27/185 (15%), Positives = 58/185 (31%), Gaps = 8/185 (4%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
            +                 S +  +       F    L    V +      +  + +    
Sbjct  93   LKDLLSGIARSWKRLFITSFYTALLGLGYTFFVLATLIPLLVIAADHPFAVKYIFIIFII  152

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            L  ++   L  VWF      + ++N  G++AL K+ +L+ GH    F    L +++   +
Sbjct  153  LATLLYLYLSVVWFLALVISVTEENCYGIEALGKAVVLIKGHRLNGFALNFLSILLIFIV  212

Query  279  SFLTARI----PYVGEAANLAFSLLLT----PFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
                  I        +     FS+        F F+ Y ++Y   K ++       +   
Sbjct  213  FPGVRMIKVGQSLALQVVTGLFSVNCIFLLSMFQFIAYTVLYFQCKKSHGEEIEMQVGMG  272

Query  331  WLPLT  335
            +  + 
Sbjct  273  YSKIP  277


>WP_011240330.1 zinc-ribbon domain-containing protein [Zymomonas mobilis]AAV89038.1 
MJ0042 family finger-like protein [Zymomonas mobilis 
subsp. mobilis ZM4 = ATCC 31821]AHB10162.1 Protein of unknown 
function (DUF3426) [Zymomonas mobilis subsp. mobilis str. 
CP4 = NRRL B-14023]AHJ70469.1 MJ0042 family finger-like domain 
protein [Zymomonas mobilis subsp. mobilis NRRL B-12526]AHJ72324.1 
MJ0042 family finger-like domain protein [Zymomonas 
mobilis subsp. mobilis str. CP4 = NRRL B-14023]AVZ25390.1 MJ0042 
family finger-like protein [Zymomonas mobilis subsp. mobilis]
Length=346

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 10/44 (23%), Positives = 13/44 (30%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M    CP C  +       + A     RC  C  + I  P    
Sbjct  1   MIL-TCPACHTDYEVQDGMITANGRRVRCASCGHSWIAYPDSRD  43


>HCP47820.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=55

 Score = 44.4 bits (101),  Expect = 0.002, Method: Composition-based stats.
 Identities = 10/36 (28%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  + CP C        +KL  K +   CP C  + 
Sbjct  1   MV-ITCPSCSERYRLNPNKLKGKGARITCPSCAHSF  35


>MBC7792681.1 hypothetical protein [Clostridia bacterium]
Length=189

 Score = 47.8 bits (110),  Expect = 0.002, Method: Composition-based stats.
 Identities = 23/148 (16%), Positives = 53/148 (36%), Gaps = 5/148 (3%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
             +   +   L      M           F  + LG + V    L  ++ +L V       
Sbjct  46   LSSTILGGPLLVGFLRMTEKSLNGKTIDFADLGLGFQKVSGPMLAWLVYVLAVTVCLTAF  105

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            ++PGL   + + F  + +A D+     +++ +  L +    A      +++ +++ L  +
Sbjct  106  VLPGLFIAIAWMFAFWFIARDDCLSSDSIKHAWRLFAKAPGAC-----VVIALTVVLVNI  160

Query  282  TARIPYVGEAANLAFSLLLTPFSFLYYY  309
               +  VG   ++  SL+     F    
Sbjct  161  VGALTIVGILVSVPVSLVFMTLCFHGLT  188


>WP_016670306.1 hypothetical protein [Propionibacterium sp. oral taxon 192]EPH00427.1 
hypothetical protein HMPREF1531_02539 [Propionibacterium 
sp. oral taxon 192 str. F0372]
Length=326

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 24/182 (13%), Positives = 60/182 (33%), Gaps = 13/182 (7%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            + +  +  + +   +  A +              + A      +   +GL  +   +   
Sbjct  93   INLCWVVSIWSVLLLSVATVQVDRGGAPTMGSVMRDAKGSFARSAGAVGLIMIMAIIGYA  152

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            I    V L   ++ G     +  L+L L  L                   F     V+A 
Sbjct  153  ILGAIVALSIYLESGWPMAIAVLLVLALFGLQFVFI------------ARFGLVAQVMAI  200

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI-PYVGEAANLAFSLLL  300
            +++G + AL++S  +  G    + G  + + +I    S +   +  ++   ++   ++  
Sbjct  201  ESLGPVSALQRSWQITRGQGLRVLGYTIAVSLIISAASSIIQLLTNFMTSISSNLATIQT  260

Query  301  TP  302
             P
Sbjct  261  PP  262


>HHU43896.1 DUF975 family protein [Clostridiales bacterium]
Length=117

 Score = 46.3 bits (106),  Expect = 0.002, Method: Composition-based stats.
 Identities = 20/110 (18%), Positives = 39/110 (35%), Gaps = 12/110 (11%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
              G  LLIIPG+     +    Y+ AD  +I   +A+ +S  ++ G+   +F   +  + 
Sbjct  1    MIGLFLLIIPGIYLAFKYSMIDYIFADKKDIKYAEAMRESGEIMKGNKARLFVLVLSFIG  60

Query  274  ISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                         +VG        + + P+        Y D++       
Sbjct  61   WF-----------FVGVITLCLGFIYVIPYVETTLAAFYLDIRKPLEEQP  99


>NQW61576.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=457

 Score = 49.8 bits (115),  Expect = 0.002, Method: Composition-based stats.
 Identities = 12/36 (33%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  + CP C A       KLPA     +CP C    
Sbjct  1   MI-IECPSCQARYRIREEKLPAGGGGIKCPNCAHVF  35


>WP_088259214.1 hypothetical protein [Fimbriiglobus ruber]OWK36161.1 Cytoplasmic 
axial filament protein CafA and Ribonuclease G [Fimbriiglobus 
ruber]
Length=261

 Score = 48.6 bits (112),  Expect = 0.002, Method: Composition-based stats.
 Identities = 7/30 (23%), Positives = 12/30 (40%), Gaps = 0/30 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCP  30
           M    CP CGA  +   ++   +    +C 
Sbjct  1   MIRFTCPVCGAAYSADDARAGKRGKCPKCQ  30


>MAU40852.1 hypothetical protein [Kordiimonas sp.]
Length=302

 Score = 49.0 bits (113),  Expect = 0.002, Method: Composition-based stats.
 Identities = 9/39 (23%), Positives = 11/39 (28%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M    CP C         KL       RC +C  +    
Sbjct  1   MIL-TCPECSTRYVVDPKKLLPSGRVVRCAKCSHSWQEP  38


>NOZ85968.1 DUF3426 domain-containing protein [Deltaproteobacteria bacterium]
Length=584

 Score = 49.8 bits (115),  Expect = 0.002, Method: Composition-based stats.
 Identities = 10/36 (28%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V CP C A      +K+    S  +C +C    
Sbjct  1   MV-VSCPKCKARYKVDPAKIGETGSKIKCNKCGTMF  35


>HHS12604.1 hypothetical protein [bacterium]
Length=125

 Score = 46.3 bits (106),  Expect = 0.002, Method: Composition-based stats.
 Identities = 11/71 (15%), Positives = 21/71 (30%), Gaps = 0/71 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
             + CPHC         K+P     A C +C +        + + +T           + 
Sbjct  28  VRIACPHCQTIYKVAVDKIPVSGGEATCRKCGKKFPMRYIAAGQFKTGAERKDALRKPIL  87

Query  62  RRIPSDRLEIQ  72
           +       + Q
Sbjct  88  KCPKCGHRQNQ  98


>NUQ65953.1 hypothetical protein [Pirellulales bacterium]
Length=96

 Score = 45.5 bits (104),  Expect = 0.002, Method: Composition-based stats.
 Identities = 7/28 (25%), Positives = 11/28 (39%), Gaps = 0/28 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARC  29
             + CP C A  +   + +  K    RC
Sbjct  4   IEISCPKCSARYSVDETHMGRKGLCKRC  31


>NOQ52509.1 hypothetical protein [Desulfuromonadaceae bacterium]
Length=228

 Score = 48.2 bits (111),  Expect = 0.002, Method: Composition-based stats.
 Identities = 9/46 (20%), Positives = 16/46 (35%), Gaps = 1/46 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRT  46
           M  + CP C  +    S ++P   +  RC  C      +    +  
Sbjct  1   MVII-CPECSTKFRVNSERIPTSGTKVRCARCKHVFFTEKPVEETM  45


>OGY84421.1 hypothetical protein A3F54_00590 [Candidatus Kerfeldbacteria 
bacterium RIFCSPHIGHO2_12_FULL_48_17]
Length=263

 Score = 48.6 bits (112),  Expect = 0.002, Method: Composition-based stats.
 Identities = 29/189 (15%), Positives = 58/189 (31%), Gaps = 0/189 (0%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L  I  L I +    +   +            +     + L          + +   + I
Sbjct  68   LGLIVFLAIFVTLVALGIQIFSFTLIVQILSQKKETPWVQLIYNTMERFWPTVLLALVLI  127

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +      L   +   L    +         L      +L  +P LL  +   F  + + 
Sbjct  128  LVITLLAVLTDLIAALLLTATAGVSESFFGGLASAFILVLSALPILLITLLLMFMPFEVI  187

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
             +    + A   +  L+ GH++   G  ++L      L+ L   +P++G A N   S +L
Sbjct  188  LNKANIMDAFRTNVTLLRGHFFRTLGYAIMLYFAIFGLTLLLQLLPFIGSAINFLLSSVL  247

Query  301  TPFSFLYYY  309
                    Y
Sbjct  248  LVSYVFVMY  256


>WP_108261701.1 zinc-ribbon domain-containing protein [Mangrovicoccus ximenensis]
Length=174

 Score = 47.5 bits (109),  Expect = 0.002, Method: Composition-based stats.
 Identities = 8/38 (21%), Positives = 13/38 (34%), Gaps = 0/38 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
            + CP C A  + P + + A     +C  C        
Sbjct  2   RIVCPACQAAYDVPQAAISAGGRDVQCSACGHNWFQLW  39


>MBI3183634.1 zinc-ribbon domain-containing protein [Myxococcales bacterium]
Length=500

 Score = 49.8 bits (115),  Expect = 0.002, Method: Composition-based stats.
 Identities = 14/70 (20%), Positives = 20/70 (29%), Gaps = 1/70 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V+C  C      P  K+  K    RC +C  T     A ++           P    
Sbjct  1   MI-VKCSQCQTRFKIPDEKVTEKGVKVRCTKCQHTFRVKRAPAEAQPAVAPPPARPAPAP  59

Query  61  QRRIPSDRLE  70
                 +  E
Sbjct  60  VSPGRFNPFE  69


>BAL53766.1 hypothetical conserved protein [uncultured Acetothermia bacterium]BAL59464.1 
hypothetical protein HGMM_OP4C100 [Candidatus 
Acetothermum autotrophicum]
Length=225

 Score = 48.2 bits (111),  Expect = 0.002, Method: Composition-based stats.
 Identities = 26/196 (13%), Positives = 52/196 (27%), Gaps = 1/196 (1%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
                G L ++ L    +                  +               I    + +T
Sbjct  27   WAMLGGLLVWNLFAYFSGLSWLILAPSSRLPKSFIEIIVDFLLFPALNAIVIRFVYTVVT  86

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
             S  + + +T           +     F +   +  +      LL+ IP +       F 
Sbjct  87   KSQMLSLTETIFSALGRFPTLVGLHAIFVIAGYIFTITPDIPKLLVAIPAIYVWTKLIFA  146

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
               +        +AL  S  L  G+WW +F   ++  +I +  +   +     G      
Sbjct  147  YQEIVAREADVWEALSTSWKLTEGNWWRMFFLALIPGLIMIPFTVEGSSQIVEGTIGW-I  205

Query  296  FSLLLTPFSFLYYYLI  311
             S  L       Y  +
Sbjct  206  SSFWLWCIVTYAYAQL  221


>TXD32378.1 hypothetical protein FRC96_17635, partial [Bradymonadales bacterium 
TMQ2]
Length=232

 Score = 48.2 bits (111),  Expect = 0.002, Method: Composition-based stats.
 Identities = 11/39 (28%), Positives = 14/39 (36%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  +RCP C    N P  ++  K    RC  C       
Sbjct  1   MI-IRCPECSTGFNLPDERVSEKGVKLRCSRCSHVFRVR  38


>WP_116716332.1 hypothetical protein [Euzebya tangerina]
Length=305

 Score = 49.0 bits (113),  Expect = 0.002, Method: Composition-based stats.
 Identities = 14/101 (14%), Positives = 33/101 (33%), Gaps = 13/101 (13%)

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
              L  ++    +A+  +  L    +W + G  +L   +   ++ L A +  +   A+   
Sbjct  194  PALMLEDRPATEAISHAVRLARRGYWRVVGLILLGGFVFNLIAGLLAGVASLFAVASGLG  253

Query  297  SLLL-------------TPFSFLYYYLIYSDLKANYRGPQH  324
               +              P +     L++ DL+    G   
Sbjct  254  FAWVLVGLSNVLSLLIQGPLTAAAMVLLHQDLRVRQEGLDF  294


>NLN63514.1 hypothetical protein [Myxococcales bacterium]
Length=365

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 11/64 (17%), Positives = 19/64 (30%), Gaps = 1/64 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  ++CP+C    N P   +  K    RC +C          +Q         +      
Sbjct  1   MI-IKCPNCETGYNIPDEVVGDKPRRMRCTKCKTMFTVARHSAQPPVGYVEYTSDHSLPP  59

Query  61  QRRI  64
           +   
Sbjct  60  EFAF  63


>WP_020949270.1 zinc-ribbon domain-containing protein [Paracoccus aminophilus]AGT07631.1 
hypothetical protein JCM7686_0522 [Paracoccus aminophilus 
JCM 7686]
Length=395

 Score = 49.4 bits (114),  Expect = 0.002, Method: Composition-based stats.
 Identities = 11/39 (28%), Positives = 16/39 (41%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  + CP+C A+    S+ +P K     C  C Q     
Sbjct  1   MRLI-CPNCDAQYEIDSTLVPPKGRDVECSSCGQVWFQP  38


>HBY73909.1 hypothetical protein [Candidatus Kerfeldbacteria bacterium]
Length=229

 Score = 48.2 bits (111),  Expect = 0.002, Method: Composition-based stats.
 Identities = 32/211 (15%), Positives = 68/211 (32%), Gaps = 3/211 (1%)

Query  105  QLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATV  164
                 +          L  + +L ++      +              N   Q A+     
Sbjct  14   WRQFLNRFDIILFSQTLFNLPVLAVIWYVKQWYPLPETALTIQDYWPNLAIQLALDYGAS  73

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
                + +  +  +M + + +            LR      LL ++ +L+   G  L ++P
Sbjct  74   VITSITVIMIVLAMQLAVKRQPTNFAYIFGAALRLYPWVVLLSLVELLLTVLGLSLFVLP  133

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA-IFGRFVLLLVISLTLSFLTA  283
            GLL  + F           +   QAL +S  LV  H+W   F   +  L++++ +   T 
Sbjct  134  GLLVTILFAMMVPAYVWYKLSPWQALLRSVQLVRKHFWMNAFYILLTQLLVTMVVMLTTW  193

Query  284  RIP--YVGEAANLAFSLLLTPFSFLYYYLIY  312
             +P        +     +   F  +Y  ++ 
Sbjct  194  GLPTTLWFNIFSAWVGAICASFYTVYVTILM  224


>ESW95477.1 hypothetical protein X769_28690 [Mesorhizobium sp. LSJC268A00]
Length=161

 Score = 47.1 bits (108),  Expect = 0.002, Method: Composition-based stats.
 Identities = 25/160 (16%), Positives = 60/160 (38%), Gaps = 15/160 (9%)

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGG  246
            +GL   + + L  +      +++ I       ++L++ G+ + +       VL  + +G 
Sbjct  1    MGLIVFLAMILGLIVMGIAGMLVPIFGAIIAFVVLLVAGVRWLLGISVSVPVLMQERLGV  60

Query  247  LQALEKSRLLVSGHWWAIFGRF---------------VLLLVISLTLSFLTARIPYVGEA  291
              A+ +SR L  G  W +FG                 +L+ +I    S L+      G  
Sbjct  61   FGAMSRSRALTKGSRWPMFGVLLILFLAALAFQMMFALLIGLIFAFFSGLSTVALIFGAF  120

Query  292  ANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
             ++    +++  + +   + Y +L+    G     +   +
Sbjct  121  GSVLIVTVVSTVASVAIAVAYVELRQVREGTSVDELAEIF  160


>WP_191978909.1 hypothetical protein [Lactobacillus fructivorans]
Length=233

 Score = 48.2 bits (111),  Expect = 0.003, Method: Composition-based stats.
 Identities = 30/188 (16%), Positives = 61/188 (32%), Gaps = 5/188 (3%)

Query  104  SQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLAT  163
              +     + +         ++L+GI       FS   +        +     + +++  
Sbjct  5    HMIANAMTKSWEVVKKHWWMMFLMGIPAGIVGAFSDPSMLLKLQGMQRALFSIFELIVGL  64

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
            V  +      M  +  +                  +     L+ +L+ L     +LL I+
Sbjct  65   VGILFSASLAMGYAKSMQTGSFKFKNAFVAFNRKYNWPVIILVGLLMALAEAFATLLFIV  124

Query  224  PGLLFCVWFFFCQYVLAD-----DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            PG++  V +    Y   D     D +G  QA   +     G+ W  FG  +L   + + L
Sbjct  125  PGVILSVGWGLWMYSYQDNIEAGDEVGITQAFTDAWKTTKGYKWNFFGLQLLFFCMYMVL  184

Query  279  SFLTARIP  286
            + L   I 
Sbjct  185  TILLTAIT  192


>MBE7075657.1 hypothetical protein [Clostridiales bacterium]
Length=303

 Score = 49.0 bits (113),  Expect = 0.003, Method: Composition-based stats.
 Identities = 24/205 (12%), Positives = 55/205 (27%), Gaps = 9/205 (4%)

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
            +    +   +L               ++ + +            +    I      +   
Sbjct  90   IYAIIVLFLILPFLVNIGKYTFCEMLYSYMTSKTKIGFFSAMIKSLKKSIPFSLCRIVYN  149

Query  191  RSMKLGLRHVGSFTLL----LILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGG  246
                  +  V    +L      +   +V     +L+I   L  +            ++  
Sbjct  150  WLFLAIIGGVVYALVLPTNEFFVKYCLVFVLYAVLVILFALNKIILLGWTPASIVFDVNV  209

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFL  306
              A +K    V  H+WAIFG  VL   +   L+FL      +         L++   S  
Sbjct  210  FSAFKKGIKAVKRHFWAIFGTTVLYFGLFWLLTFLLGAYSLIPMVV-----LMMALLSIY  264

Query  307  YYYLIYSDLKANYRGPQHPPIKRQW  331
               + +      +    +  +  + 
Sbjct  265  NMTVFFMSQGMRFYINDNKILTPKK  289


>WP_124329045.1 zinc-ribbon domain-containing protein [Desulfonema ishimotonii]GBC61802.1 
rod shape-determining protein [Desulfonema ishimotonii]
Length=459

 Score = 49.4 bits (114),  Expect = 0.003, Method: Composition-based stats.
 Identities = 14/78 (18%), Positives = 21/78 (27%), Gaps = 0/78 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
           +RCP C         K+  K   ARC  C            +     +      C    +
Sbjct  3   IRCPECKTRYRVDDEKISGKTVYARCARCQTRFAIKKNRHGQKPPCSDAPQTLFCNRCGQ  62

Query  64  IPSDRLEIQSKTVNCRRC  81
               R  +  K V  +  
Sbjct  63  KSDRRFAVNGKPVCEKCY  80


>NIQ94740.1 hypothetical protein [Desulfuromonadales bacterium]
Length=38

 Score = 44.0 bits (100),  Expect = 0.003, Method: Composition-based stats.
 Identities = 8/39 (21%), Positives = 12/39 (31%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  ++C  C         KL       RC +C +     
Sbjct  1   MV-IQCSSCDTRFKLADDKLKPGGVKVRCSKCKEVFTVM  38


>WP_188660446.1 zinc-ribbon domain-containing protein [Terasakiella brassicae]GGF53045.1 
hypothetical protein GCM10011332_02910 [Terasakiella 
brassicae]
Length=321

 Score = 49.0 bits (113),  Expect = 0.003, Method: Composition-based stats.
 Identities = 13/42 (31%), Positives = 20/42 (48%), Gaps = 1/42 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAE  42
           M  + CP+C A  + P + L  +  + RC +C  T    P E
Sbjct  1   MI-ISCPNCSAHYSVPIAALGEEGRTLRCAKCAHTWEQPPYE  41


>HIF44654.1 hypothetical protein [Dehalococcoidia bacterium]
Length=359

 Score = 49.4 bits (114),  Expect = 0.003, Method: Composition-based stats.
 Identities = 33/312 (11%), Positives = 84/312 (27%), Gaps = 31/312 (10%)

Query  30   PECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQP  89
                 T               +                              +     +P
Sbjct  1    MSQQGTPPDRNQTRNAYCMCGDFGPNNGRCDVCGFNFTSFRQPKPQSITLGPSGPISNRP  60

Query  90   EREFRASGSGLRSISQLLADSWELFCRRG--WGLLGIYLLGIVLAFAPIFSALLLKPATW  147
            +    +    + +  +L   SW +            + ++  ++  A      L      
Sbjct  61   DGFLASIAYSINNSWRLHKVSWGVLSHDKELIIFPLLAIVSFLVVLAFGIVIQLSFLMDA  120

Query  148  LNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLL  207
                +      I        +   + + G+  I     D  L   ++    ++ S  L  
Sbjct  121  PITVSGTLFGLIYFLVFFIFVYFEAAVIGAARIRFHGGDPNLSDGIRTSNANLKSLILWA  180

Query  208  ILLILVVGG-------------------------GSLLLIIPGLLFCVWFFFCQYVLADD  242
            ++   V                            GSL++ +   ++    +    V+  +
Sbjct  181  VVSGTVFLILTSLRTLARSLQGGRGLYGLVLRIVGSLVIWLLDAIWKGTTYLVLPVIVYE  240

Query  243  NIGGLQALEKSRLLVSGHWWAI----FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
             +  + A+++S  LV   W  I    FG  ++  ++ L L+  +A +  +  ++    + 
Sbjct  241  QVYPITAIKRSTNLVRHTWGEIIAGEFGFGIIFFLLMLPLTLNSALLGLLAGSSFGFGAG  300

Query  299  LLTPFSFLYYYL  310
            L T F  +   +
Sbjct  301  LSTFFIMIVTTV  312


>RWX46182.1 MJ0042 family finger-like domain-containing protein [Candidatus 
Electrothrix aarhusiensis]
Length=403

 Score = 49.4 bits (114),  Expect = 0.003, Method: Composition-based stats.
 Identities = 19/147 (13%), Positives = 45/147 (31%), Gaps = 0/147 (0%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             +RC  CG   +   SK+PAK + A+C +C   +I  P  + + + ++            
Sbjct  2    KIRCEKCGKSYSVNESKIPAKGAKAKCKDCGILIIIPPKIAVQPEISNFKLCPKCGTKNE  61

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                D        +  ++       + + + +   S  +   +L              + 
Sbjct  62   ATSEDCFSCGIVFLKYQKLQEKNKRKKDLDEKKYTSNQKINKRLQELEAINLYCDKEEIE  121

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLN  149
             +  +             ++    WL 
Sbjct  122  FLPEIINHDEVIIDIMTAMIGNTNWLI  148


>WP_067487144.1 hypothetical protein [Actinomadura hibisca]
Length=464

 Score = 49.4 bits (114),  Expect = 0.003, Method: Composition-based stats.
 Identities = 28/195 (14%), Positives = 52/195 (27%), Gaps = 22/195 (11%)

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
                  + + I    +            + +      L  L    G    I   +   V 
Sbjct  238  VGALLVVSLAIMGAALLGLLVPAGPFLALLAVDAHPALSALAAIVGVPAGIALMVWLYVL  297

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE-  290
                   +  +     +AL ++R L  G WW   G  +L L++++ + F   RIP++   
Sbjct  298  LVLAAPAVVLERQPVGRALARARHLSRGRWWRTCGTLLLALLVTVFMGFFALRIPFLLAQ  357

Query  291  ---------------------AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
                                    +    ++ PF      L+Y D +    G       R
Sbjct  358  LIFFGDAEGGGAALGALALDTVGRIVSWSVVLPFDAGVIALLYMDRRMRREGMDLDLRTR  417

Query  330  QWLPLTAAIFGWMLI  344
                       W   
Sbjct  418  SRAGADGFFELWRPA  432


>OQA80769.1 hypothetical protein BWY31_03911 [Lentisphaerae bacterium ADurb.Bin242]
Length=646

 Score = 49.8 bits (115),  Expect = 0.003, Method: Composition-based stats.
 Identities = 26/278 (9%), Positives = 67/278 (24%), Gaps = 17/278 (6%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              + CPHC  E     S L   +    C  CC+       +   +  T N      C   
Sbjct  381  IEIVCPHCHQEYKVSESDL---QQEIECAMCCRKFTITQTKYCSSCGTPNPMQAFSCWSC  437

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
            +        I  K            +Q      + G G +    + +    +  +     
Sbjct  438  QASFYMTKPIPEKEK------IPEPIQRFSSADSLGIGYKIAFFIYSFCILVGYKSDKSD  491

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              +      +                +   +     +               M   + I 
Sbjct  492  SSLLQSIFGIVILLYVLCSW-----KVFCLSHMITKSYRKIFWTTFRKISIGMVLIIQIL  546

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            +       F + +     + S  L + +    +    + +    L          +    
Sbjct  547  LLLLSPFSFFAFREDFNFLASVILTVFVFSYYLFTTFMTIKFIQLFNLRAKEDVDF---V  603

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            ++   +  + +    +   +  I G  ++  ++   + 
Sbjct  604  EDKTFIAFIAEHNNKIIIKFCLITGICLISCIVLCIIW  641


>EKD32722.1 hypothetical protein ACD_76C00161G0021 [uncultured bacterium]HBD05396.1 
hypothetical protein [Candidatus Uhrbacteria bacterium]
Length=238

 Score = 48.2 bits (111),  Expect = 0.003, Method: Composition-based stats.
 Identities = 34/220 (15%), Positives = 62/220 (28%), Gaps = 20/220 (9%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
                +                +    + L         A+    +  +L     +     
Sbjct  18   HFKTLIGYAAWPLAPYFLLISVQLFGSPLGRFQPYADTALNAIVLVSLLWISLLLLIYTD  77

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
              + K  +          +   +F    IL+ L    G +LLI P ++F  WF F   + 
Sbjct  78   SILNKKAISARDLSASANKRFPAFVATAILVALAEISGLMLLIFPAIIFAGWFAFAPAIS  137

Query  240  ADDNIGGLQALEKSRLLVSGH----WWAIFGRFVLLLVISLTLSFLTARI----------  285
            A      L+A+ +S  L  G      W +   F  +  +    +F               
Sbjct  138  ALTGAWPLRAMGQSMELAKGRLVAVTWRLLLGFFSIFAVYTIATFAIIIPISILTGEITA  197

Query  286  ------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
                  P   +  + A S L  PF   Y  ++  +L    
Sbjct  198  DLSKNAPVWMDIVSTAISTLFIPFGISYMLILLRELMKPK  237


>WP_158820944.1 MULTISPECIES: hypothetical protein [unclassified Streptomyces]
Length=392

 Score = 49.4 bits (114),  Expect = 0.003, Method: Composition-based stats.
 Identities = 15/73 (21%), Positives = 31/73 (42%), Gaps = 0/73 (0%)

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            +           V   +  G + AL +S  LV G WW IFG  ++  +I++++++L    
Sbjct  175  VWLGTRLSLAPAVAVMEKAGPVAALRRSAALVKGAWWRIFGITLIGSMIAMSVAYLIQMP  234

Query  286  PYVGEAANLAFSL  298
              +     +   +
Sbjct  235  FQLVGMFGMIPLM  247


>MBM85706.1 hypothetical protein [Rhodospirillaceae bacterium]
Length=290

 Score = 49.0 bits (113),  Expect = 0.003, Method: Composition-based stats.
 Identities = 13/62 (21%), Positives = 18/62 (29%), Gaps = 0/62 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + CP C    N P   + AK    RC  C       P  + +        T    G + 
Sbjct  12  RIECPECSMAFNVPDGAIKAKGRKLRCSRCEHQWTQYPTLTDQKPQKKKPQTEALNGGEP  71

Query  63  RI  64
             
Sbjct  72  DA  73


>TMA21638.1 hypothetical protein E6J85_07075 [Deltaproteobacteria bacterium]
Length=256

 Score = 48.6 bits (112),  Expect = 0.003, Method: Composition-based stats.
 Identities = 9/34 (26%), Positives = 15/34 (44%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            VRC  C A      +++  +  + RC +C  T 
Sbjct  2   EVRCDKCQARYRVDDARIGPQGLTMRCGKCQNTF  35


>AHE52920.1 hypothetical protein NX02_05925 [Sphingomonas sanxanigenens DSM 
19645 = NX02]
Length=256

 Score = 48.6 bits (112),  Expect = 0.003, Method: Composition-based stats.
 Identities = 27/153 (18%), Positives = 50/153 (33%), Gaps = 3/153 (2%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
                 L  L  M G+  +      +G    +   +           ++   +G   L+ +
Sbjct  81   AFVAALPRLGAMIGATLLIALAGTLGALPLVGAIVMGAADLQAQPPVVPPWIGVYMLVFL  140

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            I  L           ++A   +G + A+  S +   G +W IFG  +L L++S   S   
Sbjct  141  IVALCVWARLILITPLVAV-GMGPVAAIRASVVSTRGCFWRIFGTLLLYLLVSGVASLAM  199

Query  283  ARIPYVGE--AANLAFSLLLTPFSFLYYYLIYS  313
                 +           LL   F+ +   LI  
Sbjct  200  GSALGLVFRLLVPGLAWLLAPAFTAIASALISM  232


>TMQ73111.1 hypothetical protein E6K81_05550 [Candidatus Eisenbacteria bacterium]
Length=185

 Score = 47.5 bits (109),  Expect = 0.003, Method: Composition-based stats.
 Identities = 11/34 (32%), Positives = 14/34 (41%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           TV CPHC      P   +    +  RCP+C    
Sbjct  49  TVHCPHCSTGYLLPDHLVGPGGARVRCPQCAGDF  82


>PIE89581.1 hypothetical protein CR997_10220 [Acidobacteria bacterium]
Length=327

 Score = 49.0 bits (113),  Expect = 0.003, Method: Composition-based stats.
 Identities = 12/40 (30%), Positives = 16/40 (40%), Gaps = 1/40 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
           M  + CP C A  N   S +  K  + RC +C       P
Sbjct  1   MI-ITCPECKARYNVKDSLVAEKGKNVRCKKCKTAFRTYP  39


>MBI2410695.1 hypothetical protein [Candidatus Kerfeldbacteria bacterium]
Length=298

 Score = 49.0 bits (113),  Expect = 0.003, Method: Composition-based stats.
 Identities = 27/175 (15%), Positives = 64/175 (37%), Gaps = 4/175 (2%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICK--TDVGLFRSMKLGLRHVGSFTLLLILLILVVGG  216
            I      Y++  +      +  +       + +   ++   ++        +L+ L++G 
Sbjct  95   ISWLLSFYLMAAIILGVWRVAKHAPGDQEQLVVQDILRDAKQYWWPLFWTQLLMSLLLGF  154

Query  217  GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
              +L IIP ++F V++ F    +   N   + AL+ SR +V G+WW    RF L+  +  
Sbjct  155  FFVLFIIPAIIFSVYWAFAGVAVVVTNRKYMHALQYSRRIVRGYWWPTAARFFLVGAMVS  214

Query  277  TLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
             +  + +    V   A    ++     +      ++  +   Y       +    
Sbjct  215  IVVTIISIP--VATLAGQFSAIQAVLVAISELMGLFISVWTVYYFQDLQRVYESR  267


>WP_097279092.1 zinc-ribbon domain-containing protein [Caenispirillum bisanense]SOD94803.1 
MJ0042 family finger-like domain-containing protein 
[Caenispirillum bisanense]
Length=332

 Score = 49.0 bits (113),  Expect = 0.003, Method: Composition-based stats.
 Identities = 12/42 (29%), Positives = 14/42 (33%), Gaps = 1/42 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAE  42
           M  V CP+C +    P   L       RC  C       P E
Sbjct  1   MI-VSCPNCDSRFTLPDGALGVAGRKMRCARCEHVWHQMPPE  41


>MBI3755643.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=125

 Score = 46.3 bits (106),  Expect = 0.003, Method: Composition-based stats.
 Identities = 9/33 (27%), Positives = 14/33 (42%), Gaps = 1/33 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECC  33
           M  V+C  C  +     SK+  +    RC +C 
Sbjct  1   MI-VQCDGCNTKFRLDDSKVKGQGVRVRCTKCQ  32


>NJL62767.1 hypothetical protein [Methylacidiphilales bacterium]NJR19027.1 
hypothetical protein [Calothrix sp. CSU_2_0]
Length=183

 Score = 47.5 bits (109),  Expect = 0.003, Method: Composition-based stats.
 Identities = 22/151 (15%), Positives = 54/151 (36%), Gaps = 1/151 (1%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
             +L +  +              L  +T +  Q  N        +  +  + +  +   ++
Sbjct  33   FILIVLSVRFPANVLTEIIVHNLPKSTDVLMQTANEVRVSNFISAIFDPIYVGAIIYCVW  92

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
                        +M +G+++ G    + +L  + +  G +  IIPG++  + + F   ++
Sbjct  93   QIKRGGAFNYNNAMSVGVQNWGKLFTVNLLAGIFIVLGLIAFIIPGIILAIRYCFATAIV  152

Query  240  ADDNIGGLQ-ALEKSRLLVSGHWWAIFGRFV  269
              +  G     L++S  L  G  W +F    
Sbjct  153  IAEGYGNSSLVLKRSADLTKGKRWELFFITF  183


>WP_165167952.1 hypothetical protein [Nordella sp. HKS 07]QIG48598.1 hypothetical 
protein G5V57_13200 [Nordella sp. HKS 07]
Length=349

 Score = 49.0 bits (113),  Expect = 0.003, Method: Composition-based stats.
 Identities = 31/168 (18%), Positives = 63/168 (38%), Gaps = 0/168 (0%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
              + + LL I+     I  ++    +                       L    +  +  
Sbjct  35   WRVLVPLLLILTFIPVIQFSIEFDLSQGYGEIGSWITVFFDFLRTFCSWLTAVAVVVATD  94

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
                +  V L+++     R    + + L+L+ L+   GS+LL++PGLL  +  +   Y  
Sbjct  95   ATSERRSVSLWKAYSTAARRFWPYIVTLVLVNLIALEGSVLLVVPGLLLALALYPAIYAT  154

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
              + +   +AL +S  L +G   +I G  VL  +  L L ++   +  
Sbjct  155  IIEGLRPREALARSHFLTNGQKGSILGSVVLFWIFCLALLWIPLIVVV  202


>WP_162938119.1 hypothetical protein [Kiloniella sp. EL199]
Length=287

 Score = 48.6 bits (112),  Expect = 0.003, Method: Composition-based stats.
 Identities = 20/126 (16%), Positives = 48/126 (38%), Gaps = 1/126 (1%)

Query  191  RSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
                + +           +L        + +  P  +F + FF    ++  +     +AL
Sbjct  133  YIYDVWVDVFLKVDSSGTILREYPVVLDIAIGAPSYIFSILFFILIPLMVIEKTVFFKAL  192

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA-RIPYVGEAANLAFSLLLTPFSFLYYY  309
            +++  L  G+++AI G F+L ++++     +      ++ E   L   L  T   F+   
Sbjct  193  KRTLELSRGNYFAILGLFLLNMIVASIFGAVIWAFFTFLIEFPALFNLLNDTSTLFVQST  252

Query  310  LIYSDL  315
            +I+   
Sbjct  253  VIFMQW  258


>TAK29963.1 tetratricopeptide repeat protein [Myxococcaceae bacterium]
Length=1324

 Score = 49.8 bits (115),  Expect = 0.003, Method: Composition-based stats.
 Identities = 13/68 (19%), Positives = 21/68 (31%), Gaps = 0/68 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  VRCP C         ++P    + RCP+C  T +    E+         +       
Sbjct  1   MFEVRCPGCQNPFELDERRVPRNGMTMRCPKCQTTFVVKRPETIGFAGAPPSSDTGRSVP  60

Query  61  QRRIPSDR  68
                +  
Sbjct  61  LPMPFAPH  68


>MBI2931399.1 zinc-ribbon domain-containing protein [Planctomycetes bacterium]
Length=75

 Score = 44.8 bits (102),  Expect = 0.003, Method: Composition-based stats.
 Identities = 12/34 (35%), Positives = 15/34 (44%), Gaps = 0/34 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           M T+ CP C         KLP    + RCP+C  
Sbjct  1   MQTLTCPECKMGFTVDPKKLPKGSMNVRCPQCGG  34


>HAJ85267.1 hypothetical protein [Rhodobacteraceae bacterium]
Length=109

 Score = 45.9 bits (105),  Expect = 0.003, Method: Composition-based stats.
 Identities = 9/32 (28%), Positives = 14/32 (44%), Gaps = 0/32 (0%)

Query  6   CPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           CP C AE       +P++    +C  C +T  
Sbjct  5   CPKCEAEYKVSDDIIPSEGRDVQCSSCNETWF  36


>GFS46009.1 hypothetical protein Acr_00g0099540 [Actinidia rufa]
Length=214

 Score = 47.8 bits (110),  Expect = 0.003, Method: Composition-based stats.
 Identities = 26/195 (13%), Positives = 56/195 (29%), Gaps = 18/195 (9%)

Query  160  LLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
                    +  L       F       + L  +       +    L L +  L+     +
Sbjct  29   PCLRALVTVFHLKVFFIGSFFLFSAILISLGITCIDHPLVLLPIVLTLGVFSLIFLMYLM  88

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            ++   G +  V          ++N  G++AL K+  LV G     F    L  +I   + 
Sbjct  89   VVWELGSVISV---------IEENCYGIEALGKASELVKGKRLLGFALNFLFTLIMALVG  139

Query  280  FLTARI---------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
                 I           V     + F+ L     ++ + ++Y   K  +          +
Sbjct  140  CGFTMIRNDQKWVSTHIVIVFLLIGFNGLAMMLLYMSFTVLYFQCKKIHGEEIELQGSLE  199

Query  331  WLPLTAAIFGWMLIP  345
            +  + +       +P
Sbjct  200  YSKIPSNHLVAEDLP  214


>MBI3818273.1 hypothetical protein [Planctomycetes bacterium]
Length=270

 Score = 48.6 bits (112),  Expect = 0.003, Method: Composition-based stats.
 Identities = 33/236 (14%), Positives = 68/236 (29%), Gaps = 20/236 (8%)

Query  108  ADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI  167
               + L+    +  +    + + +A    F       +  +      W   I       +
Sbjct  21   FRRFWLYVGMSFFGILPAAIFVGIAAIVAFIVANSAGSGNVTNSFFTWFAIIGGVGGLIL  80

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
            L+ L W TG                 +  +R            +++V     +      +
Sbjct  81   LIPLMWWTGVWTGATYHALAMEATGRRATMRESFEAGRKRPFSMIMVDFVLGISACFCCI  140

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL-----------LLVISL  276
               +F+     +A ++I    AL +S +L  G  W +FG F++           L     
Sbjct  141  PIPFFWPAPPAVALEDINWTGALGRSSMLTEGRRWQVFGLFLVMLLLGMVLTMPLTAPIA  200

Query  277  TLSFLTA---------RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
             +  +            +         A S ++  F+     L Y DL+    G  
Sbjct  201  IIEVVFGRNGLTPGLLILTVFLRMIQFAVSYIMGIFTRAAQTLCYLDLRVRKEGLD  256


>MBI4836386.1 hypothetical protein [Candidatus Abawacabacteria bacterium]
Length=290

 Score = 48.6 bits (112),  Expect = 0.003, Method: Composition-based stats.
 Identities = 24/152 (16%), Positives = 50/152 (33%), Gaps = 8/152 (5%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              + +++  L  +   +   +  T V                + L  +L +V     L  
Sbjct  128  IAIYFLIPMLLVIAVMILYAVLNTIVSSGVM------DYSFASGLNSILGIVGTIALLWA  181

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            I+  +L      F  + LA       +A ++S  +  G W  +    +   +++  + F+
Sbjct  182  ILSIILRGPRTIFAFHFLAIKKYSAKEAFQESLQITKGRWGKMILHILAFGIVAFVVQFI  241

Query  282  TARIP--YVGEAANLAFSLLLTPFSFLYYYLI  311
               IP   +          LL     +Y YL 
Sbjct  242  VNLIPDGTLAMILQAIVEALLLSTWVIYTYLF  273


>KXI13466.1 cation diffusion facilitator family transporter [Peptostreptococcus 
anaerobius]
Length=358

 Score = 49.0 bits (113),  Expect = 0.003, Method: Composition-based stats.
 Identities = 23/192 (12%), Positives = 61/192 (32%), Gaps = 3/192 (2%)

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
            ++   G  ++ + L+          +      A       + +  +     +  +++   
Sbjct  74   IYITLGMSIISVILVTGQTNLTFNIAKGKEWRAGNFFVGIKEYLRSFGYNILIGLMVVAI  133

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
             +     I        + +   L L           + + V     ++ I+  +      
Sbjct  134  EIVFVFLIVSNSVAALIGKPSNLDLAQSIKHMHPENIGLAVGFLLVMVFILILIDLMYSM  193

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
                 +  + +IG +++++ SR L+      +FG ++  +  S  LS LT  I  +    
Sbjct  194  VIYIILEKNSDIGIIKSMKYSRKLMKKRKAKLFGLYLSFIGWS-LLSILTLGIGIL--FL  250

Query  293  NLAFSLLLTPFS  304
            N      +  F 
Sbjct  251  NSYIMTSMGIFY  262


>NUS34197.1 hypothetical protein [Gemmatimonadaceae bacterium]
Length=61

 Score = 44.4 bits (101),  Expect = 0.003, Method: Composition-based stats.
 Identities = 12/31 (39%), Positives = 14/31 (45%), Gaps = 0/31 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           V CP C +      SK+PA    ARC  C  
Sbjct  3   VVCPECRSLFRVDPSKVPAASVRARCSVCGG  33


>WP_146842793.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Cellulomonas composti]GEL95139.1 hypothetical 
protein CCO02nite_17970 [Cellulomonas composti]
Length=477

 Score = 49.4 bits (114),  Expect = 0.003, Method: Composition-based stats.
 Identities = 18/130 (14%), Positives = 40/130 (31%), Gaps = 18/130 (14%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
             V     +  ++  +   V        L  +       +++S LL    +W + G ++L 
Sbjct  336  FVGLVAGVAYLLALVWVSVRTLLVTPALMLEERPFWPTIKRSWLLTRRSFWRLLGIYLLT  395

Query  272  LVISLTL-------SFLTARIPY-----------VGEAANLAFSLLLTPFSFLYYYLIYS  313
             ++           + +   I +           V    ++    L T F      L+Y 
Sbjct  396  SIMVGIAAEIIVYPAAIIGLIAFDGDPTSLGAIAVNSVGSVIAQTLTTVFLSSVVALLYI  455

Query  314  DLKANYRGPQ  323
            D++    G  
Sbjct  456  DVRMRREGLD  465


>XP_026450656.1 uncharacterized protein LOC113350748 [Papaver somniferum]
Length=221

 Score = 48.2 bits (111),  Expect = 0.003, Method: Composition-based stats.
 Identities = 25/182 (14%), Positives = 58/182 (32%), Gaps = 2/182 (1%)

Query  156  QWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG  215
               I+       +  +     S FI    T + +   + L    +         ++ +V 
Sbjct  12   FKKIMSVVPKVWMRLMITFFWSFFIVFVSTLLTVGLMVLLAFIIIEPDEGEGRAILFLVL  71

Query  216  GGSLLLIIPG--LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
             G + L      +     +     +   + I GL+AL+KS+ L+ G  W     FV+L +
Sbjct  72   AGIVALTYLVGLVYIGAVWNLACVISVLEKIYGLKALKKSKNLIKGRIWVSSVIFVMLEI  131

Query  274  ISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLP  333
                +  + + +   G +  +   ++L    +L   ++                      
Sbjct  132  SFFGILGVFSAVVIHGSSVGIFGKVVLGLLCYLLMTILIHFYLVIQTMIYFVCKSYHHEN  191

Query  334  LT  335
            + 
Sbjct  192  ID  193


>WP_191390328.1 DUF975 family protein [Candidatus Alangreenwoodia gallinarii]
Length=329

 Score = 49.0 bits (113),  Expect = 0.003, Method: Composition-based stats.
 Identities = 25/183 (14%), Positives = 58/183 (32%), Gaps = 13/183 (7%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                +L +Y+      F+    +L L+         +            ++   L  +  
Sbjct  95   MMSSVLSLYIFITAGPFSLSLCSLCLRILRNRKYSTKTIFSGFSEFGKGFLTYLLVAIFT  154

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             ++  +      +  S  +      +  L  +L  ++V           ++F + +    
Sbjct  155  FLWTLVFMIPGSMVISAGIVAGSSFAIFLSSLLFFIIVIAL--------VIFLMRYELAF  206

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
            ++ AD+NI   +A+ KS  L+ G+    F   +     S     L   +P          
Sbjct  207  FIAADENITVREAVSKSVRLMKGNISNYFLMML-----SFLPWILLTAVPVFFALVTFFL  261

Query  297  SLL  299
            +L 
Sbjct  262  ALT  264


>OQX56310.1 hypothetical protein B5M53_02150 [Candidatus Cloacimonas sp. 
4484_209]
Length=264

 Score = 48.6 bits (112),  Expect = 0.003, Method: Composition-based stats.
 Identities = 13/72 (18%), Positives = 18/72 (25%), Gaps = 1/72 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V CP C  +     +K+P   +  +CP C             T T            
Sbjct  1   MI-VECPKCKKKYKIDETKVPVGGAPVKCPNCLNIFTVYREPLDITLTPIETEKTAVPQE  59

Query  61  QRRIPSDRLEIQ  72
                    E  
Sbjct  60  PFAEHIKETETP  71


>TMB02236.1 hypothetical protein E6J64_17535 [Deltaproteobacteria bacterium]
Length=628

 Score = 49.4 bits (114),  Expect = 0.003, Method: Composition-based stats.
 Identities = 8/32 (25%), Positives = 11/32 (34%), Gaps = 0/32 (0%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECC  33
              V+CP C         K+  +    RC  C 
Sbjct  149  VVVQCPSCQTRFRVADEKVGDRGVRVRCSSCK  180


>KAF8675194.1 hypothetical protein HU200_047860 [Digitaria exilis]
Length=1238

 Score = 49.8 bits (115),  Expect = 0.003, Method: Composition-based stats.
 Identities = 34/265 (13%), Positives = 73/265 (28%), Gaps = 11/265 (4%)

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
            L  +  +  + I    + +     L +  V + T   + ++L+     +   +    F +
Sbjct  135  LLTLAFTYVLEITYLVLIVGMVAFLTIVFVMTMTKHYLAMLLLDSLLIIAACVFFAYFSI  194

Query  231  WFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS--------FL  281
                   V   +    G  A+ K+  L+ G  W      V+  V +   S        + 
Sbjct  195  ILSLSTVVAVAEPGCHGAGAVVKAWRLMKGKRWRAILLIVVTGVPAAAFSPVHTLAKTYA  254

Query  282  TARIP--YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIF  339
             + I    +           L  F+       Y + K +        IK  + P T    
Sbjct  255  LSNIASGLLLGFLYTILMGALGLFATCAMTAFYYECKGSTEASAMEYIKLGFEPATFPSR  314

Query  340  GWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADY  399
                         S +     +   A      R   + + +   +    + P R++    
Sbjct  315  DLAAFLPFARAKSSSRAAFLMKFPLASPRTALRAPPRREGSRTPSYGSGKSPLRVAGGKQ  374

Query  400  KLLLSKQRKTTSEGGLSLGPVTLFA  424
              LL       +       P++   
Sbjct  375  DALLRVGGSGWAVQVSIAPPLSSAP  399


>MBF0622691.1 zinc-ribbon domain-containing protein [Magnetococcales bacterium]
Length=321

 Score = 49.0 bits (113),  Expect = 0.003, Method: Composition-based stats.
 Identities = 9/77 (12%), Positives = 24/77 (31%), Gaps = 1/77 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V+C +C +        +       +C +C         E+++  + D+  +      
Sbjct  1   MI-VQCDNCSSRFRLDDGLIGPNGRELKCAKCRHVFFQPSPEAEKKPSDDSPTSDSEQKQ  59

Query  61  QRRIPSDRLEIQSKTVN  77
            +  P    +   +   
Sbjct  60  PQPEPKTSEQETHQEHP  76


>WP_175478831.1 zinc-ribbon domain-containing protein [Rubrimonas cliftonensis]SEA37439.1 
MJ0042 family finger-like domain-containing protein 
[Rubrimonas cliftonensis]
Length=195

 Score = 47.8 bits (110),  Expect = 0.003, Method: Composition-based stats.
 Identities = 8/27 (30%), Positives = 10/27 (37%), Gaps = 0/27 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARC  29
            + CP C A    P   +P      RC
Sbjct  2   QITCPACAAVYEAPDDAIPEGGRMVRC  28


>WP_146675064.1 hypothetical protein [Pirellula sp. SH-Sr6A]AMV30655.1 hypothetical 
protein VN12_00970 [Pirellula sp. SH-Sr6A]
Length=560

 Score = 49.4 bits (114),  Expect = 0.003, Method: Composition-based stats.
 Identities = 39/312 (13%), Positives = 75/312 (24%), Gaps = 11/312 (4%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            VRCP C +E   P  K+P K+ +         L  + A+ +  ++  N          R 
Sbjct  234  VRCPDCFSEFIVPEPKIPKKQRTV-------VLDHEIADVKFVRSEGNSVREASSSKART  286

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLG  123
                           +           + + +          L+                
Sbjct  287  DEMLEKARAEVDAEQKELEGLTASFDSQRWISLLFWFCRDPLLVFIMIFFGLFCAIWFPM  346

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            +     +         +L      +           L   V   +               
Sbjct  347  VTAAPELFQATERLGVILQAVIFLVPCVPVFGSLLALGMCVLSTVANQYHRIQDWPFTRT  406

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
                G    +   L        ++   +  + G  +   + GL+     F    +   DN
Sbjct  407  GEMAGEVIMVLASLAIASIPGGMVGSGLASIAGSHITTALFGLISTWLLFPFFLLSMADN  466

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP----YVGEAANLAFSLL  299
                +   K+           +G   L  +++    F+          VGE A   F   
Sbjct  467  NAITEPFSKNVFESFKAKPDAWGAMYLQTMLAYGFLFVLNAWALRDGIVGEIAIGLFMPF  526

Query  300  LTPFSFLYYYLI  311
               F F  Y L+
Sbjct  527  TVLFVFNQYGLL  538


>MBF0530743.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=98

 Score = 45.5 bits (104),  Expect = 0.003, Method: Composition-based stats.
 Identities = 11/32 (34%), Positives = 15/32 (47%), Gaps = 0/32 (0%)

Query  6   CPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           CP CG   + P +K+P K +  RC  C     
Sbjct  5   CPQCGHIYDIPDNKIPEKGTFGRCRLCGTRFP  36


>NMO23222.1 hypothetical protein [Pyxidicoccus fallax]
Length=193

 Score = 47.5 bits (109),  Expect = 0.003, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V+C  C      P  K+  K    RC +C  T 
Sbjct  1   MI-VKCERCQTRFKIPDEKVTEKGVKVRCTKCQNTF  35


>QNJ06529.1 putative membrane protein [Synechococcus sp. MEDNS5]
Length=101

 Score = 45.5 bits (104),  Expect = 0.003, Method: Composition-based stats.
 Identities = 23/100 (23%), Positives = 33/100 (33%), Gaps = 11/100 (11%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
              G L   +PG+   V + F    + D     + AL +SR LV+ HW+ I          
Sbjct  1    MLGLLTFAVPGIYLAVAYIFSGMAMVDRPQSFVDALNQSRRLVTPHWFDI----------  50

Query  275  SLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
                      I  +G  A L    +  P  F      Y  
Sbjct  51   -GLFLLAVVGIVLLGYLACLVGGFVSVPVGFCMIGAAYDQ  89


>KIF00650.1 hypothetical protein PL81_40040 [Streptomyces sp. RSD-27]
Length=105

 Score = 45.5 bits (104),  Expect = 0.003, Method: Composition-based stats.
 Identities = 17/80 (21%), Positives = 29/80 (36%), Gaps = 2/80 (3%)

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
            + +G + AL +S  LV G WW         +V +   ++       +  A +    L   
Sbjct  13   EGLGPVAALRRSAGLVRGAWWRTLAVTAPAVVFTGAAAYALLLWFGLPGALSALVLLPAF  72

Query  302  PFSFLYYYLIYSDLKANYRG  321
            P   L   L+Y D +     
Sbjct  73   P--QLPAGLLYVDRRIRREH  90


>NCU25591.1 hypothetical protein [Candidatus Nomurabacteria bacterium]
Length=274

 Score = 48.6 bits (112),  Expect = 0.003, Method: Composition-based stats.
 Identities = 18/168 (11%), Positives = 48/168 (29%), Gaps = 0/168 (0%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L   + +G       I   +                +  L      +L     +  +   
Sbjct  32   LFTAFPIGEQPDIQAILVFIQSGELKDFPSVTPANIYYALSFLGFSMLTAFFSVIYATCF  91

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
               K      + +   +R + S  + +++LI+     ++   IP +      F    ++ 
Sbjct  92   IAEKDGFPARKGIIDSVRKIPSLIVFVLILIVPAMISAIFAFIPLIYLYYSLFVAAVLIT  151

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
            +       A+ +S     G+   IF   +++         +   +   
Sbjct  152  EGKQSLFSAMSESFRYTKGYKLNIFFTQMVVYFAVNIPMSIFEALFIF  199


>MBI2002721.1 zinc-ribbon domain-containing protein [candidate division NC10 
bacterium]
Length=52

 Score = 44.0 bits (100),  Expect = 0.003, Method: Composition-based stats.
 Identities = 11/37 (30%), Positives = 16/37 (43%), Gaps = 0/37 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
            T+ CP CG         +P +    +CP+C  T  F
Sbjct  4   ITITCPACGRSGAVDERAVPDRPMRLKCPQCQGTFTF  40


>MBI4854132.1 zinc-ribbon domain-containing protein [Acidobacteria bacterium]
Length=894

 Score = 49.8 bits (115),  Expect = 0.003, Method: Composition-based stats.
 Identities = 9/40 (23%), Positives = 14/40 (35%), Gaps = 0/40 (0%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAE  42
             + CP C  E     S++P       C  C   +   P +
Sbjct  151  KITCPKCQHEYRIDPSRIPKGGGKFTCRNCQARIEVRPNQ  190


>WP_199260036.1 zinc-ribbon domain-containing protein, partial [Paracoccus sp. 
wg1]
Length=243

 Score = 48.2 bits (111),  Expect = 0.003, Method: Composition-based stats.
 Identities = 10/37 (27%), Positives = 13/37 (35%), Gaps = 0/37 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
            + CP CGA+     S +PA      C  C       
Sbjct  2   RLTCPRCGAQYEIAESAIPALGREVECSACSHVWFQP  38


>WP_124057337.1 DUF975 family protein [Vaginisenegalia massiliensis]
Length=275

 Score = 48.6 bits (112),  Expect = 0.003, Method: Composition-based stats.
 Identities = 21/169 (12%), Positives = 50/169 (30%), Gaps = 8/169 (5%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             +  L   L+ + +    L +   +           ++          +      +    
Sbjct  30   FLVGLVACLSLSLVLVPPLKQELIFGINL-DFLSDWVVFFLGILTFPLVFLGIKCIDQPE  88

Query  183  CKTDVGLFR--SMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                +      +             + +L  L V   ++LL+IPGL+  + +     +  
Sbjct  89   TNQTLDYKDGLTWIENTSDFIDLVAMSVLYFLFVTLWTILLVIPGLIKALSYSQALPLYF  148

Query  241  DDNI-----GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            D          LQA + S  L+ G+   +   F+  +   +   F+   
Sbjct  149  DAKKAKQPISYLQAFKISTQLMKGNKTQLLVLFLSFIGYGILFVFVWGI  197


>MBA3550358.1 hypothetical protein [Nannocystis sp.]
Length=251

 Score = 48.2 bits (111),  Expect = 0.003, Method: Composition-based stats.
 Identities = 23/188 (12%), Positives = 57/188 (30%), Gaps = 10/188 (5%)

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLIL---LILVVGGG  217
               +  + LGL+ +   +        + +                 + L   +   +   
Sbjct  62   QLGLMILPLGLAAILAMLLTLAQLAAMTIMMPALYRYVLGAYLGQPVDLRATVQDQIANA  121

Query  218  SLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT  277
              +++   +   V   F   +   +       ++++  L+S +   I G    + + +  
Sbjct  122  KDVIVNCFVPAIVLGVFAGPIYFVEGKKLGDVIKRNFELLSRNLVPILGAVFGVAIAAGV  181

Query  278  LSFLTARI-------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
            L FL + I         +          ++T +   Y   +Y DL+  + G       R+
Sbjct  182  LIFLFSWILGLLPAGGLLASLFANLVVAIVTAYFASYSVAMYFDLRRRFEGGDPEGEARE  241

Query  331  WLPLTAAI  338
             L L    
Sbjct  242  RLSLALPP  249


>WP_057624075.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Candidatus Berkiella cookevillensis]KRG18956.1 
Membrane domain of glycerophosphoryl diester phosphodiesterase 
[Candidatus Berkiella cookevillensis]
Length=322

 Score = 48.6 bits (112),  Expect = 0.003, Method: Composition-based stats.
 Identities = 15/97 (15%), Positives = 29/97 (30%), Gaps = 9/97 (9%)

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
              + F V       ++ + N     A+++S  L  G +W  F       +  +   F+ +
Sbjct  215  WFIYFSVRLALIIPLVVNQNKNPFTAVKESFRLTKGRFWKTFVVIFGAALPYMLAFFIFS  274

Query  284  RI---------PYVGEAANLAFSLLLTPFSFLYYYLI  311
             +               A L   L+L P         
Sbjct  275  TVCSLVFPEYAGIALGIAVLIVQLVLAPIIPATITAY  311


>WP_026699054.1 hypothetical protein [Bacillus chagannorensis]
Length=269

 Score = 48.6 bits (112),  Expect = 0.003, Method: Composition-based stats.
 Identities = 23/146 (16%), Positives = 52/146 (36%), Gaps = 5/146 (3%)

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
                      +  +     + L +       ++  +L         L +I  ++F   F 
Sbjct  118  FLQDGRRLFGRFLLVNVLVILLLVLGALLVLIVSSILPAAGIVLITLGVIAAIVFRYVFI  177

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI-----PYV  288
            F +Y +A +++  ++AL +SRL+   +     G  V+ ++I++ LS     +      YV
Sbjct  178  FWEYTIAAEDMPVIEALGRSRLIRRRNDATTIGLLVVFILINVVLSITLNLLINVPVLYV  237

Query  289  GEAANLAFSLLLTPFSFLYYYLIYSD  314
                N      +      +Y  +   
Sbjct  238  MTILNAVIMTGVGLAFMSHYAELRRQ  263


>MBC6444239.1 zinc-ribbon domain-containing protein [Alphaproteobacteria bacterium 
GM202ARS2]
Length=214

 Score = 47.8 bits (110),  Expect = 0.003, Method: Composition-based stats.
 Identities = 11/54 (20%), Positives = 17/54 (31%), Gaps = 1/54 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIAT  54
           M  + CP C  + +   S +P      RC +C       P  S       +   
Sbjct  1   MI-ISCPSCSTDFSVADSMIPQGGRRCRCNQCGHIWDAYPHASSSPPQLHDNPH  53


>NIT04273.1 thioredoxin [Candidatus Saccharibacteria bacterium]
Length=65

 Score = 44.4 bits (101),  Expect = 0.003, Method: Composition-based stats.
 Identities = 10/29 (34%), Positives = 15/29 (52%), Gaps = 0/29 (0%)

Query  6   CPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           CP C A+   P  K+  + S  +CP+C  
Sbjct  1   CPKCKAKLKIPDEKIKPEGSKFKCPKCQT  29


>PPD77655.1 hypothetical protein GOBAR_DD25419 [Gossypium barbadense]
Length=790

 Score = 49.4 bits (114),  Expect = 0.003, Method: Composition-based stats.
 Identities = 40/365 (11%), Positives = 85/365 (23%), Gaps = 22/365 (6%)

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
             +L  S     +          L  +  + +R           L +   G  +      L
Sbjct  66   GILVTSMYVQFVSNGFLLGSTWLMMNYYVIVRSFCYNVFTAAFLRICGVGLVV----KFL  121

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
             + V +     +   + + G+ AL  S  L  G     F    +  V  + L F+     
Sbjct  122  EWMVMWNMSIVISTSEEVHGVDALGLSAYLCRGKERRGFWLTFVFFVSRIGLRFVCFNNG  181

Query  287  YVG---EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWML  343
              G           + L         ++Y       R  +      + L           
Sbjct  182  SYGKRWWMILGVSLICLGKVIKWVVCVMYFYHCKEGRLERVDVEVGEHLKRVGKSEYPRK  241

Query  344  IPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLL  403
                      +       L       +     +P     + ++ P  P R S        
Sbjct  242  ----------KAKKRKLDLDWNKLLSKDVADEEPPPPLVVIKAEPHPPPRKSDTMGGGDD  291

Query  404  SKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKV  463
              + +             +   +   +     L  K +        L ++   R      
Sbjct  292  QGKEEFWESLPDYKLEEKILRQQRNLECLGSKLPDKGKKISDQLRLLEEEKRRRTVSRAK  351

Query  464  LDDD-----ARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSIL  518
            ++ D      +          +   H     Q      F       L + T +  +++  
Sbjct  352  MNADECEKPGQSPSSDLVGSSNGFEHQSKSQQAFSQSAFGASFCKKLEENTDSRSLNTFE  411

Query  519  GKLEL  523
              L +
Sbjct  412  KSLSV  416


>TMA25139.1 hypothetical protein E6J78_18525 [Deltaproteobacteria bacterium]
Length=210

 Score = 47.8 bits (110),  Expect = 0.003, Method: Composition-based stats.
 Identities = 30/201 (15%), Positives = 70/201 (35%), Gaps = 3/201 (1%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
              + +  L      A +  A LL               A + A +A  LL  + +T ++ 
Sbjct  6    FHIWLSQLPAFAGVAALLHAPLLLVPFLPPLPRPALVVAFVAAELAIALLVKAALTKAVL  65

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
             +  +     F   +  LR   +  +L   ++      + LL++P L +    F     L
Sbjct  66   DW-QRQLPTEFSEYRDALRTGPAVLVLGTRILARAAVRAFLLVLPALNYLADNFAAVPAL  124

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY--VGEAANLAFS  297
              +     QAL +S  L  G    +F   +++ ++    +     +    +G+ + + F 
Sbjct  125  IVEGGSTGQALLRSEQLTRGVRARVFSICLVIWLVGALWTLCFGVLTGAHLGKMSWMIFY  184

Query  298  LLLTPFSFLYYYLIYSDLKAN  318
            + +      +  ++ +     
Sbjct  185  VCVRALERSFAAVLAAVAYHR  205


>MBI4173361.1 hypothetical protein [Candidatus Aenigmarchaeota archaeon]
Length=229

 Score = 47.8 bits (110),  Expect = 0.003, Method: Composition-based stats.
 Identities = 13/110 (12%), Positives = 40/110 (36%), Gaps = 0/110 (0%)

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
               + + +    + +     + L       L + L+ ++     + L+I G +  +    
Sbjct  119  LIYIHLALAGAALLVVPVFAILLAASSGSFLAIALVSVLGFVWLVALLIIGAVGFLRISL  178

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
              Y L  +    +Q+L  S     G+  ++    ++++ + + +S     
Sbjct  179  SHYYLVLEGKSAVQSLVASWRATKGNVASLLFLLLIVMAVGIAVSVAVGL  228


>KKP91379.1 hypothetical protein UR97_C0002G0045 [Candidatus Nomurabacteria 
bacterium GW2011_GWE2_36_115]KKP94450.1 hypothetical protein 
US00_C0001G0044 [Candidatus Nomurabacteria bacterium GW2011_GWF2_36_126]KKP96912.1 
hypothetical protein US04_C0001G0415 
[Candidatus Nomurabacteria bacterium GW2011_GWD2_36_14]KKP99484.1 
hypothetical protein US08_C0001G0166 [Candidatus Nomurabacteria 
bacterium GW2011_GWF2_36_19]KKQ05660.1 hypothetical 
protein US17_C0002G0044 [Candidatus Nomurabacteria bacterium 
GW2011_GWF1_36_47]KKQ09959.1 hypothetical protein US21_C0001G0067 
[Candidatus Nomurabacteria bacterium GW2011_GWB1_36_6]KKQ13219.1 
hypothetical protein US26_C0002G0044 [Candidatus 
Nomurabacteria bacterium GW2011_GWE1_36_71]KKQ21043.1 
hypothetical protein US34_C0002G0065 [Candidatus Nomurabacteria 
bacterium GW2011_GWC2_36_9]KKQ45045.1 hypothetical protein 
US64_C0003G0048 [Candidatus Nomurabacteria bacterium GW2011_GWC1_37_9]OGJ06024.1 
hypothetical protein A2387_00820 [Candidatus 
Nomurabacteria bacterium RIFOXYB1_FULL_36_10]OGJ11397.1 
hypothetical protein A2467_01265 [Candidatus Nomurabacteria 
bacterium RIFOXYC2_FULL_36_8]OGJ11438.1 hypothetical protein 
A2565_02710 [Candidatus Nomurabacteria bacterium RIFOXYD1_FULL_36_19]HAQ02508.1 
hypothetical protein [Candidatus 
Nomurabacteria bacterium]
Length=258

 Score = 48.2 bits (111),  Expect = 0.003, Method: Composition-based stats.
 Identities = 25/175 (14%), Positives = 61/175 (35%), Gaps = 0/175 (0%)

Query  127  LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD  186
               ++ F      L L     LN  +           +    L  ++    + + +C   
Sbjct  50   FVSLVFFIINTFFLYLGIKANLNLLDSKGFHPFSREVLPTWPLFWNFFKTYLLLILCILP  109

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGG  246
            + +     +    +    LL ++ +  +     +LIIPG+      F   Y+  D   G 
Sbjct  110  IIIIPMFIVIAISILPSLLLGVVFLPYLIPVIAILIIPGMYIASRLFPSIYMSIDKGQGA  169

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
            + +++ +  +  G+ W IF +  ++ + ++   F       V     +   ++L 
Sbjct  170  VMSIKGAWEITKGYGWYIFWKTFVIGLFAVVGIFALFIGIVVTYPIAMIVIVMLY  224


>PPR28260.1 hypothetical protein CFH38_00110 [Alphaproteobacteria bacterium 
MarineAlpha10_Bin1]
Length=125

 Score = 45.9 bits (105),  Expect = 0.003, Method: Composition-based stats.
 Identities = 10/94 (11%), Positives = 19/94 (20%), Gaps = 1/94 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M    CP C       +  L       +C +C      +P      +        P    
Sbjct  1   MIL-NCPECSTRFAIDAQALRPDGRRVQCGKCEHIWFEEPPAPSALEPLSVTPLEPEEQS  59

Query  61  QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFR  94
               P+      +   +    +            
Sbjct  60  TIPTPNLPAITAAVGFDALSHSLEAYCCAFPHPF  93


>HEA94367.1 hypothetical protein [Chloroflexi bacterium]HEI10026.1 hypothetical 
protein [Chloroflexi bacterium]HEN54883.1 hypothetical 
protein [Chloroflexi bacterium]
Length=329

 Score = 48.6 bits (112),  Expect = 0.003, Method: Composition-based stats.
 Identities = 23/227 (10%), Positives = 62/227 (27%), Gaps = 0/227 (0%)

Query  130  VLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGL  189
                       + + A              +       LL  S    ++ I +       
Sbjct  101  CQGALIYLVDAVERGAPLSFSAGLEAGMRSMWRLFFIALLVFSPFLLTVLILLGGALASF  160

Query  190  FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQA  249
              ++           L L+ L+ +      L+I+   L  V     Q  +  +N+G  Q+
Sbjct  161  IAAITGPQELEAPRVLALLALLCLGIPAVCLVIVVAYLLSVLAILAQRAVVLENLGLWQS  220

Query  250  LEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYY  309
            + +   L+   +W +    ++ + I   +  + A    +        + +  P   +   
Sbjct  221  IRRGWTLLRARFWELVLLSLIWIAIGFLVGIVVALPVALVALPFAMVTGMQEPGMGVIAL  280

Query  310  LIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQN  356
            L+   L           +   +      +    +      ++ +   
Sbjct  281  LLVVALGLVIYAMFISTLNAVFSSALWTLAYRRIAGWQPEIAGAPTP  327


>MBI4411734.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=276

 Score = 48.2 bits (111),  Expect = 0.003, Method: Composition-based stats.
 Identities = 29/135 (21%), Positives = 51/135 (38%), Gaps = 4/135 (3%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                +     +W   +  +      V +   + L +   G F L  +L          +L
Sbjct  100  IGDLFGGWRDAWKYMTALLVYTVISVVINFVISLVIFFFGRFVLGFMLGTASWWIFIFIL  159

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
              P L   + F F  Y +AD  +G  +AL+ S    + + WA+     +  ++ L +S  
Sbjct  160  TWPFLYQLIRFGFFPYFIADQGMGPFEALKASYAGTADYKWAV----TISWLMVLGISLA  215

Query  282  TARIPYVGEAANLAF  296
               IP VG  A+  F
Sbjct  216  NGLIPVVGSFASAFF  230


>WP_146678041.1 zinc ribbon domain-containing protein [Pirellula sp. SH-Sr6A]AMV34085.1 
hypothetical protein VN12_18285 [Pirellula sp. SH-Sr6A]
Length=294

 Score = 48.6 bits (112),  Expect = 0.003, Method: Composition-based stats.
 Identities = 26/290 (9%), Positives = 60/290 (21%), Gaps = 1/290 (0%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              V C  CG   + P           +C    +           +         P     
Sbjct  3    IRVTC-QCGQTLSVPDEMAGKSGRCPKCKGGLKVPASQGKPVAASTKGSPATANPAAPKP  61

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                S       ++      +    L  +         +        D   + C      
Sbjct  62   PSAGSTSKTSPGRSAAPAPSSNLAGLFDDVGLVTKTGKMCPSCDSPLDPRAVLCVHCGFN  121

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            L                              +       L         +  +   + I+
Sbjct  122  LAEGKKLEGFQAKGQKKFGNKHLNEAAEMMEREQATEKRLLGAGAPWWMMFSILAGIVIF  181

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            I    + +  +       +     +     L V  GS    +  +              +
Sbjct  182  IAGLAIKMDAATSGKSSSIELLRRIQSATYLTVMSGSFGAAMVAISIFASLAILITAFKE  241

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
                GL ++     ++   +  +F + ++  VI   +S +   I      
Sbjct  242  SAKQGLLSMFVPFYILYYMFSRLFSKHLVTTVIIYWVSSILGGILLGYAM  291


>WP_081994941.1 zinc-ribbon domain-containing protein [Paracoccus sp. PAMC 22219]
Length=205

 Score = 47.5 bits (109),  Expect = 0.003, Method: Composition-based stats.
 Identities = 11/40 (28%), Positives = 13/40 (33%), Gaps = 0/40 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
             + CP C AE   P   +P       CP C       P 
Sbjct  4   FKLICPGCRAEYAVPPDAIPQGGREVECPACGHVWQAHPP  43


>WP_175481698.1 zinc-ribbon domain-containing protein [Maribius pelagius]SEN34310.1 
MJ0042 family finger-like domain-containing protein [Maribius 
pelagius]
Length=364

 Score = 49.0 bits (113),  Expect = 0.003, Method: Composition-based stats.
 Identities = 9/37 (24%), Positives = 16/37 (43%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  + CP+C A+       +P++    +C  C  T  
Sbjct  1   MRLI-CPNCEAQYEIDPGLIPSEGRDVQCSNCGNTWF  36


>MAX17452.1 hypothetical protein [Nitrospina sp.]
Length=78

 Score = 44.8 bits (102),  Expect = 0.004, Method: Composition-based stats.
 Identities = 9/45 (20%), Positives = 20/45 (44%), Gaps = 0/45 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQ  47
            + CP C A  +    ++P + +  +C +C Q+ +  P   +   
Sbjct  2   KISCPDCQASYDIDLPEIPKEGTQVKCAKCQQSFLVMPESREENN  46


>MBI5583343.1 ankyrin repeat domain-containing protein [Deltaproteobacteria 
bacterium]
Length=397

 Score = 49.0 bits (113),  Expect = 0.004, Method: Composition-based stats.
 Identities = 26/246 (11%), Positives = 58/246 (24%), Gaps = 21/246 (9%)

Query  387  LPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFP  446
               +P +    +    +   R+   +     G                       +   P
Sbjct  19   AYAQPPKAVLLNPPQYVENPRRPAFKSMAGPGVRNGLKVVSGRSSAALEFNTPKVVVYLP  78

Query  447  NLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLR  506
             +  +          ++ D   + +   +                     FS    +  +
Sbjct  79   EIDNSAYALVEFGDPELFDSRGQGIPFERTGGTDER-------------TFSREIQLRKK  125

Query  507  QGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQI---GGKQLILQRLGSNAVTLR  563
             G+ A +   + G  ++  PL I +    +             G  +         +   
Sbjct  126  TGSGALEFSKVRGAGKIRYPLEIATRVFKKGKGPGGPGDPVFDGPYVSYPDPNIEEMFF-  184

Query  564  FLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDA-FSLRQMFDGNIESITVLVAGDSMTQ  622
                   L  V A ++    L    F    + D        F G I  + + V       
Sbjct  185  ---LNPALGPVRAYDASGRRLVRDAFNQTTTQDGRNRRTLAFYGEIAELQIDVVKKWAEL  241

Query  623  SYPFEL  628
             + +EL
Sbjct  242  EFTYEL  247


>NNE56980.1 DUF3426 domain-containing protein [Hellea sp.]
Length=264

 Score = 48.2 bits (111),  Expect = 0.004, Method: Composition-based stats.
 Identities = 10/66 (15%), Positives = 13/66 (20%), Gaps = 1/66 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M    CP C    +T    +     + RC  C  T                 A       
Sbjct  3   MIL-TCPDCATRYSTTPEAIGPNGRTVRCTNCSATWFVSSEPDIMELQEQERAEDIIRET  61

Query  61  QRRIPS  66
                 
Sbjct  62  PPEERY  67


>KPJ79127.1 hypothetical protein AMJ54_00235 [Deltaproteobacteria bacterium 
SG8_13]
Length=421

 Score = 49.0 bits (113),  Expect = 0.004, Method: Composition-based stats.
 Identities = 12/39 (31%), Positives = 18/39 (46%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  V+CP C +      SK+PA+ +  RC +C       
Sbjct  1   MV-VKCPTCQSGYQIDESKIPARGAYTRCRKCQTRFKVQ  38


>NLY88175.1 hypothetical protein [Firmicutes bacterium]
Length=322

 Score = 48.6 bits (112),  Expect = 0.004, Method: Composition-based stats.
 Identities = 26/160 (16%), Positives = 53/160 (33%), Gaps = 23/160 (14%)

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGG  246
            V     M   +  +    +L  +L +   G +LL  +  ++    F     VL  ++   
Sbjct  132  VSFSLMMLGFVVRLFLQLVLNNILHIGDVGFALLSNLVAVVLSFLFSLSTVVLFLEDKKT  191

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLV--------ISLTLSF---------------LTA  283
               + ++  L+ GH W + G ++L+++        + L   F                  
Sbjct  192  FATIRRAFTLMGGHRWRLAGTYLLVILLAYVIMLILYLLAVFPAVLFIFLGARYELIAFY  251

Query  284  RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
             I  +   A++    +L  +       IY DL     G  
Sbjct  252  IIGGLLGLASILLFNILLIYEHGPLTCIYYDLLIRKEGYD  291


>WP_020918113.1 zinc-ribbon domain-containing protein [Cystobacter fuscus]EPX61889.1 
MJ0042 family finger-like domain/tetratricopeptide repeat 
protein [Cystobacter fuscus DSM 2262]
Length=1350

 Score = 49.4 bits (114),  Expect = 0.004, Method: Composition-based stats.
 Identities = 9/35 (26%), Positives = 13/35 (37%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            V CP C    N    ++P   +  +C  C  T  
Sbjct  2   KVSCPSCQTNYNIDDKRIPPGGAKLKCARCQNTFP  36


>WP_183377250.1 hypothetical protein [Helcobacillus massiliensis]MBB3023953.1 
hypothetical protein [Helcobacillus massiliensis]
Length=305

 Score = 48.6 bits (112),  Expect = 0.004, Method: Composition-based stats.
 Identities = 23/269 (9%), Positives = 73/269 (27%), Gaps = 5/269 (2%)

Query  32   CCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPER  91
              Q       +           +           S        +        +     + 
Sbjct  20   WGQYPSAPQQQGAAPSWAAGGGSSAPSAPFSAPQSAPQPAAGPSTGPSGYGTAPMSSSDD  79

Query  92   EFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQ  151
                       ++     +W  F         I ++G V+    +  A  +      +  
Sbjct  80   LGAKISRTFSWMADAFGRNWVAFIVPSLVYSIIPVIGGVILLFGMGLAGAVSAGASSSES  139

Query  152  NQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVG---SFTLLLI  208
                    ++ ++   ++ +  ++      +        R  K  +          L ++
Sbjct  140  GGAVFGVTMIVSMLVFMVAMFAVSALWMSGMANVAAMSARGEKASVAQGFVGARLLLPIL  199

Query  209  LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
            L +++     ++ +I  +   V   F   ++A + +   +A  +S  LV  +   +    
Sbjct  200  LTLVLGMVSWIIPVIGPIAVAVLTTFVLPLVALEGLSTGEAFSRSASLVIANLGLVILVT  259

Query  269  VLLLVISLTLSFLTARIPYVGEAANLAFS  297
            +++ +    L++L   +P +         
Sbjct  260  LIVTIAMSILAWLI--LPALASVIAGVLL  286


>EUA73961.1 hypothetical protein I540_0458 [Mycobacteroides abscessus subsp. 
bolletii 1513]CQA05005.1 proline and glycine rich membrane 
protein [Mycobacteroides abscessus]
Length=87

 Score = 44.8 bits (102),  Expect = 0.004, Method: Composition-based stats.
 Identities = 14/83 (17%), Positives = 33/83 (40%), Gaps = 7/83 (8%)

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
             +   D  +G + AL+ S  LV  +     G+ +++ +I+L ++F    + +        
Sbjct  1    MFFAIDRGLGPVDALKASFQLVKDN----LGQALVVFLITLGVAFGGFALTF---ITCGL  53

Query  296  FSLLLTPFSFLYYYLIYSDLKAN  318
              ++  P +     LI+      
Sbjct  54   GGIIAYPAAGALTGLIHVYTYRR  76


>HCN04818.1 hypothetical protein [Bacteroidetes bacterium]
Length=293

 Score = 48.2 bits (111),  Expect = 0.004, Method: Composition-based stats.
 Identities = 23/149 (15%), Positives = 53/149 (36%), Gaps = 0/149 (0%)

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
            +   L+    +         K DV     M+   +      +   ++ L+V    + +II
Sbjct  99   LVCWLVMSLSLCSIRQYRDGKGDVNSTEVMRHFRQIFFQALVSFFVVGLIVAFSLVGVII  158

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
              ++  V+       +  +++   QA+ +S  LV   WW   G  +LL V+     +  +
Sbjct  159  GAVIALVFLALQPAAIVFEDLSVFQAIRRSFELVQNSWWFTLGVLLLLFVLQSVPVYAVS  218

Query  284  RIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
             +  +         +L    +F     ++
Sbjct  219  FVSGILIGLRAIGGILDDSVAFDVITAVF  247


>WP_009108026.1 zinc-ribbon domain-containing protein [Desulfovibrio sp. U5L]EIG53219.1 
Protein of unknown function (DUF3426) [Desulfovibrio 
sp. U5L]
Length=331

 Score = 48.6 bits (112),  Expect = 0.004, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 18/36 (50%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V+CP+C ++   P  K+ A  +  RC +C    
Sbjct  1   MI-VQCPNCHSKFKLPDDKVVAAGTRLRCGKCRTVF  35


>MBI4666505.1 zinc-ribbon domain-containing protein [Nitrospinae bacterium]
Length=297

 Score = 48.2 bits (111),  Expect = 0.004, Method: Composition-based stats.
 Identities = 10/44 (23%), Positives = 17/44 (39%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  V+CP C A     +  +P +   A+C +C           +
Sbjct  1   MI-VKCPRCSARYKIDAQTIPDEGMYAKCAKCENVFFARKRSDE  43


>TMA29198.1 hypothetical protein E6J88_07330, partial [Deltaproteobacteria 
bacterium]
Length=358

 Score = 48.6 bits (112),  Expect = 0.004, Method: Composition-based stats.
 Identities = 10/38 (26%), Positives = 16/38 (42%), Gaps = 0/38 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
            VRC  C A      +++  +  + RC +C  T    P
Sbjct  59  EVRCDKCQARYRVDDARIGPQGLTMRCGKCQNTFKVMP  96


>XP_037917966.1 glutenin, high molecular weight subunit DX5-like [Hermetia illucens]CAD7086547.1 
unnamed protein product [Hermetia illucens]
Length=876

 Score = 49.4 bits (114),  Expect = 0.004, Method: Composition-based stats.
 Identities = 32/297 (11%), Positives = 58/297 (20%), Gaps = 22/297 (7%)

Query  320  RGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQ  379
                                       +                  G+D           
Sbjct  386  EPDTEVSNLPSNSQTPPPQGIPNTPQQVQSPQGEYYPSGPPGGAPQGQDQTTGYQNSQGP  445

Query  380  TPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLK  439
             P+     P + Q L +   +     +   T   G      +  +               
Sbjct  446  PPESFPDFPTQSQGLPAEGEQSAPQSENPQTEPQGEVPPDQSFQSPSQDQFQDISSQGQG  505

Query  440  LELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSG  499
            L       L+  Q+G        +             +F+      +   Q  E     G
Sbjct  506  LPPQGSQALNSQQQGQPP---SDLQIPSGPQQQSAGQAFDGLPPQTLNSEQGTEGQPPQG  562

Query  500  IRSIYLRQGTQAEQVHSILGK--------LELTLPLAIESLQLTRNDIGKTLQIGGKQLI  551
             ++       Q  Q     G         L+   P      ++  +   ++         
Sbjct  563  QQNEPYPSEEQVPQGQQASGTPAQGEQPALQSENPQTEPQGEVPPDQSFQSPSQD----Q  618

Query  552  LQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSLRQMFDGNI  608
             Q + S    L   G +         NS  +                S  Q FDG  
Sbjct  619  FQDISSQGQELPPQGSQ-------PLNSQQQEQPPSDLQIPSGPQQQSAGQAFDGLP  668


>MBI3014852.1 zinc-ribbon domain-containing protein [Candidatus Tectomicrobia 
bacterium]
Length=389

 Score = 48.6 bits (112),  Expect = 0.004, Method: Composition-based stats.
 Identities = 11/39 (28%), Positives = 16/39 (41%), Gaps = 0/39 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
            + CP C         K+P      RCP+C   ++ DP 
Sbjct  2   ELSCPQCATRYRASEEKIPPGGGKVRCPKCEAMIVIDPQ  40


>MBI1948748.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=1173

 Score = 49.4 bits (114),  Expect = 0.004, Method: Composition-based stats.
 Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP C A+ N    ++P +  S  CP C  T 
Sbjct  2   QISCPQCQAQYNVDEGRIPPQGVSINCPRCKHTF  35


>MBF0371418.1 zinc-ribbon domain-containing protein [Magnetococcales bacterium]
Length=73

 Score = 44.4 bits (101),  Expect = 0.004, Method: Composition-based stats.
 Identities = 9/44 (20%), Positives = 14/44 (32%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  V+C +C  +     + L  K    +C  C       P    
Sbjct  1   MI-VQCTNCDKQFEVDEAVLGPKGRKLKCSNCKTVFFQGPPAPD  43


>WP_072832764.1 hypothetical protein [Clostridium collagenovorans]SHI11592.1 
hypothetical protein SAMN02745196_02956 [Clostridium collagenovorans 
DSM 3089]
Length=270

 Score = 48.2 bits (111),  Expect = 0.004, Method: Composition-based stats.
 Identities = 26/231 (11%), Positives = 81/231 (35%), Gaps = 29/231 (13%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
               G+L +  + ++       + ++       +         ++LA V    +  +    
Sbjct  35   CIMGILLVLPMIVLGMLMVPLAGMITFTLISGSTNAIGSIILLVLAIVFIFGILGNIFMV  94

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG---------------------  215
            ++   I + +  +       ++        + +L+L++                      
Sbjct  95   AVVKLIHEKEGYMGYGPLDAIKFAMGKIKTITVLVLLIVAISISATIGVGLIAAILTLIS  154

Query  216  -----GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
                    ++ ++  ++  ++  F    +A   +G + AL+ S  +V G +  +F + ++
Sbjct  155  KTVAGIFLVVAVLASIMLFIYVTFSYQAIAIHELGAIDALKYSINIVRGRFGNVFCKLLV  214

Query  271  LLVISLTLSFLTARIPYVGEAANLAFSLLLTP---FSFLYYYLIYSDLKAN  318
            + VI   + ++   I     AA+   S++      +  +Y+ L++ D    
Sbjct  215  VAVIGGIIDYILGLIFGDAGAASFIASIITAIISQYQVIYHTLMFKDYDQE  265


>WP_101357826.1 hypothetical protein [Raineya orbicola]PKQ70336.1 hypothetical 
protein Rain11_0564 [Raineya orbicola]
Length=318

 Score = 48.6 bits (112),  Expect = 0.004, Method: Composition-based stats.
 Identities = 23/213 (11%), Positives = 56/213 (26%), Gaps = 24/213 (11%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
                 +         +  +    +   F                 Q     +      + 
Sbjct  27   NFLHFFATILITHLPIFALIYWFVNSLFTTNLIVNFFSVVVESPAQFFTTVFIFGFIGLL  86

Query  166  YILLGLSWMT--GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
                 +S       +          +   ++   R+  S  +   +  L+     ++   
Sbjct  87   AYTYSVSIFLNYYLLSENDRSKRPSIIEVLQATFRNYYSVFVTYAVYGLLFLVLMIITTF  146

Query  224  PGL-------LFCVWFFFCQYVLAD---------------DNIGGLQALEKSRLLVSGHW  261
                      +  + F    YV +                +   G  AL+KS  LV G+W
Sbjct  147  LNFALVKLSGVLSILFNLFFYVFSIFISNSLALVGAVTIKEQATGTTALKKSWNLVKGYW  206

Query  262  WAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
            W  FG  V++ +++  + ++   +  +    N 
Sbjct  207  WQTFGIRVVVAILAYLIFYVLMYLMGLFFIGNS  239


>MBI4821319.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=750

 Score = 49.4 bits (114),  Expect = 0.004, Method: Composition-based stats.
 Identities = 9/33 (27%), Positives = 14/33 (42%), Gaps = 1/33 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECC  33
           M  V+C +C  +   P  K+    +  RC  C 
Sbjct  1   MI-VQCTNCRTKFRLPDEKIGPSGTKVRCSRCG  32


>WP_014812726.1 hypothetical protein [Desulfomonile tiedjei]AFM27622.1 hypothetical 
protein Desti_5010 [Desulfomonile tiedjei DSM 6799]
Length=455

 Score = 49.0 bits (113),  Expect = 0.004, Method: Composition-based stats.
 Identities = 35/286 (12%), Positives = 80/286 (28%), Gaps = 52/286 (18%)

Query  96   SGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNW  155
                +         +           L I    I L F     + +      L       
Sbjct  161  QTFQICRNRFWKLLAIAAIPYCIMIALIILAGIIALIFGLTDMSFIDDFTPALLIAGIFL  220

Query  156  QWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG  215
                L+  +A        +  ++ +      + +  S       +  F L  +L  +++ 
Sbjct  221  IPTALVLFIALFYSSQGALIFAVSVSYLGKQISVRESYNFVFARLAKFILTSLLFTIMIL  280

Query  216  GG----------------------------SLLLIIPGLLFCVWFFFCQYVLADDNIGGL  247
                                           L L+I      +       ++  +N+  +
Sbjct  281  LSLALAVTIGIVLFFVFQAFTSSGWWSAFSWLPLMIIPFYVILKLLLFDKIVIVENVAYM  340

Query  248  QALEKSRLLV---------SGHWWAIFGRFVLLLVISLTLSFLT--------ARIPY---  287
             AL +S  L+          G+   +F   +L ++I+ T+S++         A +P    
Sbjct  341  GALSRSWNLLSGKAAGSWPRGYALRLFLLTLLFMLIAATISWVFQTPAALLTAFLPLPEF  400

Query  288  ----VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
                + +      SL+ T F+ +   + Y D++    G     +  
Sbjct  401  AKTVLTQLLGTIGSLIATVFAAVCMVVFYYDVRNRKEGFDLQMLAE  446


>WP_197444114.1 RDD family protein [Maioricimonas rarisocia]QDU36828.1 RDD family 
protein [Maioricimonas rarisocia]
Length=543

 Score = 49.0 bits (113),  Expect = 0.004, Method: Composition-based stats.
 Identities = 35/364 (10%), Positives = 86/364 (24%), Gaps = 20/364 (5%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            V+CP CG +   P  K        +  E   +                     H  +   
Sbjct  25   VKCPKCGTKLKVPGGK--------QKVEAPASSGDSAEFLAGMNLDSLSLEDRHQKVCPY  76

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLG  123
              S+  E      +C    ++  +  +   + +  G            + +         
Sbjct  77   CASEMDEEDVICPSCGMNTQTGRMDAKIAKKKARRGPDPNEFYRKAWKDSWKFVFEQKGL  136

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
                G+      + +AL +        Q     +   +  +  + +   W+  ++ I   
Sbjct  137  AIRTGMYWTIYSVLNALSVFFLVKYATQIPLLVFWGSMTFITMMGIPGWWLFLALKIVDK  196

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
                   ++ ++      S  L L ++I  V      + +      +             
Sbjct  197  SVHREKIQADRIHFDFFQSVALGLRMVIWPVVATFPFVFLAPFALGLMLPVALVH-----  251

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
               + A    R  +      +F + +   +  + L F+      V          +    
Sbjct  252  ---MTAKYTYRAWIFWEIGRVFLKNLGPSLYVVILGFVVNLP--VAALGFGIAYFVGGNA  306

Query  304  SFLYYY--LIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQ  361
                    L    +          P    ++   A      L   ++   L         
Sbjct  307  FQSEPVNDLTGRIVTWVMELAGENPNPEGFIFWIAHSTLTFLAAIIVTAPLMFLAGFPAV  366

Query  362  LLSA  365
             +  
Sbjct  367  FIMR  370


>MAH84786.1 hypothetical protein [Magnetovibrio sp.]OUT49876.1 hypothetical 
protein CBB68_10700 [Rhodospirillaceae bacterium TMED8]
Length=366

 Score = 48.6 bits (112),  Expect = 0.004, Method: Composition-based stats.
 Identities = 12/84 (14%), Positives = 19/84 (23%), Gaps = 2/84 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + CP+C    N   S +  +  + RC  C  +    P  S                 
Sbjct  1   MI-ITCPNCATRYNIQPSLVG-EGRNTRCFNCGHSWFQGPVVSPPPPPMRPPPAPQEQSQ  58

Query  61  QRRIPSDRLEIQSKTVNCRRCNRS  84
               P                 + 
Sbjct  59  VPMTPDMASAEPYSNPPQLAHVQM  82


>GFM37166.1 hypothetical protein DSM19430T_18500 [Desulfovibrio psychrotolerans]
Length=322

 Score = 48.6 bits (112),  Expect = 0.004, Method: Composition-based stats.
 Identities = 40/314 (13%), Positives = 78/314 (25%), Gaps = 9/314 (3%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ-  61
             +RCP C   +    + +PA+   A CPEC +   F   E         ++         
Sbjct  2    QIRCPRCEYSKELDDAAIPAEMVYATCPECGERFRFRQPEQAGVPADPALSGGAVREFFL  61

Query  62   ----RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRR  117
                 +  +     +   V   R  +               G  S         EL    
Sbjct  62   DAEAGQQAAPEKPYEIPGVQAGREEQGGRAYGNAPAGHGNYGAESGGNPGTAQPELPRIP  121

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
                  I      +       +      +      +           + + +  + M   
Sbjct  122  WAWRSQIGWASAFVHTVREVLSAPPVFFSRSGNDWKRAGAMTFYVISSALGVLFAQMWAW  181

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
                     +        G    G    L +  +++V G     +  G+L          
Sbjct  182  AMDTFLGDVLSEGALAAAGGGFPGWSAALGVTAVVLVLGPLFFFVSAGILHAGLSVVKGA  241

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
                    G  A   +  L +  ++ +      L VI    + +       G       +
Sbjct  242  PRGF----GATANVVAFSLSANMFYLVPFIGQYLAVIFGIYALVVGMKYAHGVNVWRVAA  297

Query  298  LLLTPFSFLYYYLI  311
             LL PF F+  + +
Sbjct  298  GLLMPFLFIMAFYV  311


>WP_078788646.1 zinc-ribbon domain-containing protein [Geobacter thiogenes]SJZ38055.1 
MJ0042 family finger-like domain-containing protein 
[Geobacter thiogenes]
Length=295

 Score = 48.2 bits (111),  Expect = 0.004, Method: Composition-based stats.
 Identities = 39/279 (14%), Positives = 72/279 (26%), Gaps = 14/279 (5%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            T+ CPHCGA      SK P   +S  CP C Q+    P ES                   
Sbjct  2    TITCPHCGASGQLDDSKRPVGATSINCPRCKQSFPLPPLESAAVAPPVIPVPPALAAEPA  61

Query  63   RIPSDRLEI---------QSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWEL  113
             +                 +     R    +          +  +     S   A     
Sbjct  62   PLRPCPACGGIIEGSGGLCNACEAARNRQSAAGNGAITLPPSVAAAEDRSSGNCAVCKGR  121

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            F +      G  L+        +    +    T        W           IL  +++
Sbjct  122  FAQSQMVRFGDKLVCASCKPTYVQMLAMGMGNTGDLRYAGFWIRFGAKMIDGIILWVVNF  181

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
             T     ++  ++     ++   + +V     + I   +    G+       +   +   
Sbjct  182  ATTMATTFLIASNNSPQMAIVAAIMNVCIQVGVGIAYNIYFLSGNHQATPGKMACGL---  238

Query  234  FCQYVLADDNIGGLQALEKSR-LLVSGHWWAIFGRFVLL  271
                    D I   +A+ +    ++SG   AI       
Sbjct  239  -KVVTADGDKISAGRAVGRYFAEMLSGMIMAIGYIMAAF  276


>WP_108102137.1 zinc-ribbon domain-containing protein [Geobacter sp. DSM 2909]PTV89671.1 
putative Zn finger-like uncharacterized protein 
[Geobacter sp. DSM 2909]
Length=407

 Score = 48.6 bits (112),  Expect = 0.004, Method: Composition-based stats.
 Identities = 8/36 (22%), Positives = 12/36 (33%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  ++C  C        S++       RC  C  T 
Sbjct  1   MI-IQCEKCRTRFKLDDSRVTESGVRVRCSRCGHTF  35


>MBA2542066.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=280

 Score = 48.2 bits (111),  Expect = 0.004, Method: Composition-based stats.
 Identities = 24/240 (10%), Positives = 62/240 (26%), Gaps = 26/240 (11%)

Query  104  SQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLAT  163
                 +++ +       +  +Y L I + +     A+L+      +         I+   
Sbjct  11   WTFTKEAFVMAKHGKLLMPSLYQLLISIVYFVGVVAVLIAIDPHWSDATWAVVSGIITFG  70

Query  164  -VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLIL----------  212
                           + +++          +    ++  +  LL  +  +          
Sbjct  71   SFLIFYFFCGVTVNMIDVHLKGGTPSFRDGVADARKNFLAIVLLATISAVIESFARFARN  130

Query  213  -VVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF------  265
                 G +L  I   ++    F     +  ++ G  QA+ + R L  GH   I       
Sbjct  131  ENSLVGRILASIIESIWTTLSFLLLPTIIIEDAGFSQAMGRVRDLHKGHLMLIAVGDVGV  190

Query  266  -GRFVLLLVISLTLSFLTARIPY-------VGEAANLAFSLLLTPFSFLYYYLIYSDLKA  317
                 L+ ++   L F      +                  +   F     ++  +    
Sbjct  191  RAITFLVGLLWFGLIFAIVFFSFSTFGNTTALIITFGVGGTMFAAFVAFSTFVRMAYYTC  250


>WP_176424889.1 zinc-ribbon domain-containing protein, partial [Myxococcus sp. 
AM009]NVJ02583.1 zinc-ribbon domain-containing protein [Myxococcus 
sp. AM009]
Length=388

 Score = 48.6 bits (112),  Expect = 0.004, Method: Composition-based stats.
 Identities = 9/35 (26%), Positives = 13/35 (37%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            V CP C    N    ++P   +  +C  C  T  
Sbjct  2   KVSCPSCQTNYNIDDKRIPPGGAKLKCARCQTTFP  36


>OIP57164.1 signal peptidase I [Candidatus Levybacteria bacterium CG2_30_37_29]PIR79562.1 
signal peptidase I [Candidatus Levybacteria 
bacterium CG10_big_fil_rev_8_21_14_0_10_36_30]PIZ96330.1 signal 
peptidase I [Candidatus Levybacteria bacterium CG_4_10_14_0_2_um_filter_36_16]
Length=503

 Score = 49.0 bits (113),  Expect = 0.004, Method: Composition-based stats.
 Identities = 62/399 (16%), Positives = 124/399 (31%), Gaps = 26/399 (7%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +L + L+   +    +    L+             +            +  S +   +  
Sbjct  80   ILALTLILGYIFIQNLTILSLITSIVNYREGIGIIESFKRSFKKTIPYIWTSALVFLVTF  139

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLI-LVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
             +    +     + + L  +   +   I+   L+    ++L I    +  VWFF   YVL
Sbjct  140  GLILIVLLPTMLIFIFLSKLALLSPGDIIGKALITFISTVLFIALVGILTVWFFNAAYVL  199

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA-------  292
            A + IGGL AL KSR  V G ++++  R VLLL++        + +P +           
Sbjct  200  ASEGIGGLNALLKSREYVRGRFFSVLLRIVLLLIVITFFETGISILPKLISLLHVPYLDI  259

Query  293  --NLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLV  350
                A S+ L P  F+Y ++IY +LKA        P K+    L  +I    LI   +L+
Sbjct  260  IITAAISIALAPLGFIYSFIIYDNLKAFKGDFVFQPSKKGK-ILLLSIIFTPLIFAFILL  318

Query  351  SLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTT  410
                 +L +   +        ++   P Q   +  +       +            +  T
Sbjct  319  YYIFPSLISSYKMPRSIGNYPKISISPTQNKPITPTYQSGTFLIEGQSMYPNYQNGQSFT  378

Query  411  SEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARD  470
                          +            +   +   P       G     I+  +  + + 
Sbjct  379  INTTTYNTTNPSRGELIIFFSPITRKPILKRVIGIP-------GDKISLINNSISLNGQQ  431

Query  471  LYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGT  509
            L        + + +     +T         + I +    
Sbjct  432  L--------NESLYLPPETKTYVGTFLENSKEITVPPDA  462


>MBI4745440.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=63

 Score = 44.0 bits (100),  Expect = 0.004, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 11/36 (31%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M    CP C        SK+       RC +C    
Sbjct  3   MIA-ACPKCKTRFKIDESKMAGAGVKLRCSKCRTVF  37


>WP_002619262.1 zinc-ribbon domain-containing protein, partial [Stigmatella aurantiaca]EAU62350.1 
adventurous gliding motility protein X, 
partial [Stigmatella aurantiaca DW4/3-1]
Length=101

 Score = 45.1 bits (103),  Expect = 0.004, Method: Composition-based stats.
 Identities = 9/32 (28%), Positives = 12/32 (38%), Gaps = 0/32 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
              C  C A+      K+ AK    RC +C  
Sbjct  2   RFVCDSCRAQYMISDDKVGAKGVKVRCKKCGH  33


>WP_084454530.1 hypothetical protein [Mycobacterium interjectum]
Length=266

 Score = 47.8 bits (110),  Expect = 0.004, Method: Composition-based stats.
 Identities = 18/134 (13%), Positives = 43/134 (32%), Gaps = 22/134 (16%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            L+    +L++I   +       F   ++  + +   +A+ +S  LV   +W + G   L 
Sbjct  107  LLGLPLALVVIALLVYLYTVVLFAPVLIVLERLPVFEAVARSFALVRNGFWRVLGIRTLT  166

Query  272  LVISLTLSFLTARIPYVG----------------------EAANLAFSLLLTPFSFLYYY  309
             +++  +    A    +G                              ++  PF+     
Sbjct  167  FIVASFIGNAVAAPFAIGGQVLLAAMTPSTGGLLLSTAIAAVGTAIGQIITAPFNAGVIV  226

Query  310  LIYSDLKANYRGPQ  323
            L+Y+D +       
Sbjct  227  LLYTDRRMRAEAFD  240


>WP_102237581.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Brevibacterium paucivorans]PMD05922.1 hypothetical 
protein CJ199_00475 [Brevibacterium paucivorans]
Length=319

 Score = 48.2 bits (111),  Expect = 0.004, Method: Composition-based stats.
 Identities = 18/139 (13%), Positives = 50/139 (36%), Gaps = 1/139 (1%)

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
                    +  +  G+   +      + + T+  I+ +L    G L+  +    F + F 
Sbjct  104  WALIGMTVLMVSGFGVSAGVFFAPSIILALTVNEIMGVLAFFVGGLIWAVLIAFFLIKFA  163

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFG-RFVLLLVISLTLSFLTARIPYVGEAA  292
            F   ++  + +    A+++S  L +  +W I G   +   + S  +  +   I  +    
Sbjct  164  FVGPLITYERLPLKDAVKRSWKLTNKGFWRILGELALGYYLSSHVVQIMITPIMILFPIV  223

Query  293  NLAFSLLLTPFSFLYYYLI  311
             +  ++     S  +  ++
Sbjct  224  MIILTIATGFQSEAFAAVV  242


>MYK17757.1 hypothetical protein [Candidatus Poribacteria bacterium]
Length=172

 Score = 46.7 bits (107),  Expect = 0.004, Method: Composition-based stats.
 Identities = 20/144 (14%), Positives = 42/144 (29%), Gaps = 3/144 (2%)

Query  211  ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
            ++  G   L+  +    F   + F    +  +       L +SR L+ G WW   G  + 
Sbjct  1    MVPGGFVLLIGTLVIGWFGTLWSFYIPTILVEGKSVRAGLRRSRNLIRGTWWRTGGMVLS  60

Query  271  LLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
            + ++S T+SF+                L    F       ++              +   
Sbjct  61   IFLLSFTISFILRASF---GFLLNLGELADETFIHTIEMALWDLPVTRRGLSFSKALIYV  117

Query  331  WLPLTAAIFGWMLIPGLLLVSLSR  354
                       + + G  L+   +
Sbjct  118  INLGADTFTMPIWVIGGTLLYFDQ  141


>PZQ46111.1 hypothetical protein DI551_05745 [Micavibrio aeruginosavorus]
Length=292

 Score = 48.2 bits (111),  Expect = 0.004, Method: Composition-based stats.
 Identities = 8/40 (20%), Positives = 11/40 (28%), Gaps = 1/40 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
           M    CP C          +       RC  C    + +P
Sbjct  1   MIL-TCPACTMRYLVSEGAVGPNGRRVRCANCGHQWVQEP  39


>OYT56359.1 hypothetical protein B6U68_03535 [Candidatus Aenigmarchaeota 
archaeon ex4484_14]
Length=117

 Score = 45.5 bits (104),  Expect = 0.004, Method: Composition-based stats.
 Identities = 15/71 (21%), Positives = 33/71 (46%), Gaps = 1/71 (1%)

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
              +   F    +  +N+G  +A++KS  +   +W  +    +++ VIS+ ++FL   IP 
Sbjct  1    MTLALIFGFQAVVLNNMGFSKAIKKSWNIFRKNWLGVLVAVIVMSVISIVIAFLF-AIPM  59

Query  288  VGEAANLAFSL  298
            +    +   S 
Sbjct  60   LFVLFSAIASF  70


>EFE80924.1 integral membrane protein [Streptomyces albidoflavus]
Length=218

 Score = 47.5 bits (109),  Expect = 0.004, Method: Composition-based stats.
 Identities = 27/217 (12%), Positives = 53/217 (24%), Gaps = 38/217 (18%)

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
                L     S  I      +     +   +  +        L+++      +  +   L
Sbjct  4    FPAVLGIQLLSGLIVAIPVALMALFMVLTMVALLSRGDAGPWLVLMPFLFLGVAALAVWL  63

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI-  285
               +           +  G + AL +S  LV  +WW  FG  +L  ++++ LS     + 
Sbjct  64   --GIRLALATPAAVFEGQGPVGALRRSVRLVRDNWWRTFGLLLLAALMAMGLSLALQILT  121

Query  286  -----------------------------------PYVGEAANLAFSLLLTPFSFLYYYL  310
                                                 +G        ++    + L   L
Sbjct  122  GAFQSPADSLVVDNTDTGWFGSAEVRALLADILVGSVIGLLVGSVVQIVGMTLTHLTAAL  181

Query  311  IYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGL  347
            +Y DL+    G     I                  G 
Sbjct  182  VYVDLRIRKEGLADALIGEVGQGAPGPQAPPATGQGG  218


>OQB43884.1 hypothetical protein BWY03_00525 [Parcubacteria group bacterium 
ADurb.Bin159]
Length=289

 Score = 48.2 bits (111),  Expect = 0.004, Method: Composition-based stats.
 Identities = 19/271 (7%), Positives = 61/271 (23%), Gaps = 15/271 (6%)

Query  88   QPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATW  147
                     G     +       +     +    +    +      +             
Sbjct  14   WTNPLLWIFGFFASYLLTNEIILFLNVPWQISVGVNPSPVFFQFIISFKNLLFNENIVIS  73

Query  148  LNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLL  207
               +     +   +            +  +        +  +  +            LL+
Sbjct  74   DFLKIIGSLFLFWIIPFILATWAEIVIFKNAKRKNKNINFSIKDTFSKFWPVFLINFLLI  133

Query  208  ILLILVVGGGSLLLIIPGLLFCVWFF---------------FCQYVLADDNIGGLQALEK  252
            ++    +   +LL  +                         F    +  +N   + A++ 
Sbjct  134  LVNDGSILLINLLTSVLSGFIFWLVILLLLLIELILFLITKFVLCFIILENKKIISAIKD  193

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
              L    +W  +F   ++L V+++  + +   I   G +     S L++ +    Y +++
Sbjct  194  GILFFKKNWQTVFYLLLILFVLNILFTIIINLIAVGGFSPFSIISTLISEWGSYGYQILF  253

Query  313  SDLKANYRGPQHPPIKRQWLPLTAAIFGWML  343
                              +           L
Sbjct  254  YTGLIIVGSIVLILWSFFFSFQIIIWPVLFL  284


>MBF0127708.1 zinc-ribbon domain-containing protein [Magnetococcales bacterium]
Length=58

 Score = 43.6 bits (99),  Expect = 0.004, Method: Composition-based stats.
 Identities = 9/44 (20%), Positives = 17/44 (39%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  V+C  C ++ +     L       +C +C       P +S+
Sbjct  1   MI-VQCESCRSQFDVDDEILRPSGRKLKCSQCQAVFFQPPPKSR  43


>MBI4391176.1 zinc-ribbon domain-containing protein [candidate division NC10 
bacterium]
Length=221

 Score = 47.5 bits (109),  Expect = 0.004, Method: Composition-based stats.
 Identities = 9/30 (30%), Positives = 15/30 (50%), Gaps = 0/30 (0%)

Query  5    RCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
            +CP C A     ++ +P   +  RCP C +
Sbjct  174  QCPKCQASFRIKAAAIPPGGARIRCPRCGE  203


>MBC3843528.1 hypothetical protein [Streptacidiphilus sp. 4-A2]
Length=336

 Score = 48.2 bits (111),  Expect = 0.004, Method: Composition-based stats.
 Identities = 13/81 (16%), Positives = 31/81 (38%), Gaps = 0/81 (0%)

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            L   +  L   +       V+  ++ G   A  ++  L SG+WW   G  +L+ ++    
Sbjct  72   LAAAVWWLFLIIRLAPLVPVVVLESQGPKDAFTRAWRLNSGNWWRTLGITLLVSLVGSFA  131

Query  279  SFLTARIPYVGEAANLAFSLL  299
            + +      +  + ++   L 
Sbjct  132  AQVVTTPISLLTSTSVLSGLP  152


>MBC7564201.1 hypothetical protein [Gemmatimonadaceae bacterium]
Length=296

 Score = 48.2 bits (111),  Expect = 0.004, Method: Composition-based stats.
 Identities = 14/88 (16%), Positives = 34/88 (39%), Gaps = 7/88 (8%)

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV-------GEAA  292
              + +G + AL +S+ L  G++  +   + L+L+I   +  +   +  +        +  
Sbjct  196  MAEGLGPIAALRRSQALSRGNYLRLARTYGLVLLIVFVVYAVLLMLATLFPTQQQALQTL  255

Query  293  NLAFSLLLTPFSFLYYYLIYSDLKANYR  320
                 + + P       L Y+DL+    
Sbjct  256  VSVLLIPVVPIIGSIMLLTYADLRVRRE  283


>HBZ69447.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=77

 Score = 44.4 bits (101),  Expect = 0.005, Method: Composition-based stats.
 Identities = 13/78 (17%), Positives = 16/78 (21%), Gaps = 1/78 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M    CP C       + +L       RC  C       P                    
Sbjct  1   MIA-ACPKCSTRYRVENERLTPDGVRLRCTRCQAVFRVRPPGXXAGCALPVRWRPSSSRR  59

Query  61  QRRIPSDRLEIQSKTVNC  78
            RR P       +    C
Sbjct  60  SRRSPPLCPNPPTARDLC  77


>RMG16541.1 hypothetical protein D6729_10685, partial [Deltaproteobacteria 
bacterium]
Length=81

 Score = 44.4 bits (101),  Expect = 0.005, Method: Composition-based stats.
 Identities = 12/37 (32%), Positives = 19/37 (51%), Gaps = 0/37 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
            V CP C AE +  +S++P    + +CP+C  T    
Sbjct  2   RVTCPGCAAEYDIDASRIPPTGLNLKCPKCSTTFPVQ  38


>WP_066022326.1 MULTISPECIES: DUF975 family protein [Clostridium]PJI09469.1 DUF975 
domain-containing protein [Clostridium sp. CT7]
Length=257

 Score = 47.8 bits (110),  Expect = 0.005, Method: Composition-based stats.
 Identities = 18/173 (10%), Positives = 54/173 (31%), Gaps = 12/173 (7%)

Query  143  KPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGS  202
               +        +   +L+     +   ++ +   +   I             G   +  
Sbjct  84   NIFSGFKNFRSAFLLQLLIIIFTLLWSFIAMIPFGVISLIVLGTHINEIPQGDGFYSIIY  143

Query  203  FTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD-DNIGGLQALEKSRLLVSGHW  261
               ++   +  +     L++   ++    +    Y+L+D  +IG  +A++KS+ ++ G+ 
Sbjct  144  TVFIVFGSMASIVVEFYLILTSAIIALYRYSMSYYILSDCQSIGAYEAIKKSKKMMKGYK  203

Query  262  WAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
            W +F   +  +               +         L + P+        Y +
Sbjct  204  WKLFYLNLSFIGWIA-----------LSIITYGIGFLWIIPYIETAKANFYEN  245


>MBA3430563.1 hypothetical protein [Actinobacteria bacterium]
Length=273

 Score = 47.8 bits (110),  Expect = 0.005, Method: Composition-based stats.
 Identities = 26/233 (11%), Positives = 64/233 (27%), Gaps = 20/233 (9%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAI--LLATVAYIL  168
            W L+      L+ IY    V           L  A   +        ++   +       
Sbjct  28   WRLYRGHFGALVAIYGTVFVAVGLLRTLGYTLFDAAGFSATATLAVVSLALTVLVAIGGS  87

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT----LLLILLILVVGGGSLLLIIP  224
            L ++  +      +    V    +++              +  +L +L +     +    
Sbjct  88   LCVAATSVIAADGVTGRGVTPGDAVRELRPKWRDLASAGLVTSMLSVLAIFLPFGVFGSI  147

Query  225  GLLFCVWFFFCQYVLA-DDNIGGLQALEKSRLLVSGHWWAIFGRFV------------LL  271
             ++  ++       +   +     +A   ++ L+SG    +    +            LL
Sbjct  148  VVMPMLFGPPVLIHVIGLEGRNFGEAWNHAKTLLSGQMGRLLIYLLNVALGLGLLQLVLL  207

Query  272  LVISLTLSFLTARIPYVG-EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
             V     S LT  +  +         + +  P      ++ Y D++A      
Sbjct  208  SVTFPVASLLTGGLGLIAQSLVQALIAAVTLPLLGTMSFVCYLDVRARKEEFS  260


>HHJ03931.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=674

 Score = 49.0 bits (113),  Expect = 0.005, Method: Composition-based stats.
 Identities = 10/44 (23%), Positives = 17/44 (39%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  ++C  C      P  K+  + +  RC +C    I  P   +
Sbjct  1   MI-IKCEKCSTAFRLPDEKIKPEGTKVRCSKCKNVFIVYPPRKE  43


>MBM04113.1 hypothetical protein [Chloroflexi bacterium]
Length=272

 Score = 47.8 bits (110),  Expect = 0.005, Method: Composition-based stats.
 Identities = 27/236 (11%), Positives = 76/236 (32%), Gaps = 12/236 (5%)

Query  81   CNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSAL  140
                  +               +  +L  +++++    +  + I L+ + +     +  +
Sbjct  12   NQYEENISNFNYDSKDLGNFEWVKYILISTFKVYLNNIFTFIFISLIPVGIFNILRYVFI  71

Query  141  LLKPATWLNPQNQNWQWAILLAT---VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGL  197
                    N  N    +   +         +L  + ++  +        + +F S    L
Sbjct  72   PNINELNQNELNGLQLFFFGVLLITGFFVSILATAALSIKINALTFGKKLNVFDSYISVL  131

Query  198  RHVGSFTLLLILLILVVGGGSLLLIIPG-----LLFCVWFFFCQYVLADDNIGGLQALEK  252
              + +  +  ++  ++    +LL I         L  V F F   ++  +++G +++   
Sbjct  132  PKIRTLFIANLVFFILFFAAALLSITIFGLPLLFLLMVNFSFFNQMILFEDMGPIESFPG  191

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR----IPYVGEAANLAFSLLLTPFS  304
            S  LV      IF   + +L++ L +  L       I  +         + + P +
Sbjct  192  SYELVKTFRIRIFFVILFILILLLFMYSLVWIIEPKINLINNLIFSILEITIFPIT  247


>MBE5763511.1 hypothetical protein [Clostridiales bacterium]
Length=317

 Score = 48.2 bits (111),  Expect = 0.005, Method: Composition-based stats.
 Identities = 21/223 (9%), Positives = 62/223 (28%), Gaps = 6/223 (3%)

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
               G  +  I    I+     I   +     + +     +       A         S++
Sbjct  92   NWNGAVIGAIVTFVILAILYGIVYFMSYYTISDILNNFMSSNSKYGFAANFIANAKKSFI  151

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
                +         +   + +G+  +           + +    LL I           F
Sbjct  152  FSIWYTLYVIGVYVVGFGVAIGIGLLIGRWSAS----IGLFVMYLLAIGTLATRRAVTPF  207

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
                +   ++    A  K+  L+ G +W  +G + +L ++++ L   ++ + +    A +
Sbjct  208  WMPAMVAKDLSCKDAFYKNFELLKGRFWKTYGAYFMLYLLAVVLFLGSSILTF--GVAMI  265

Query  295  AFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAA  337
                 +  +  +   + +  +           +         A
Sbjct  266  FAFCAVWLYFQIRDMVAFYHINGMKYYIDEQTVIDPKKIYRDA  308


>MBI2798498.1 hypothetical protein [Candidatus Saccharibacteria bacterium]
Length=271

 Score = 47.8 bits (110),  Expect = 0.005, Method: Composition-based stats.
 Identities = 29/216 (13%), Positives = 67/216 (31%), Gaps = 18/216 (8%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
                  L +     +L    + +  L      L      +  A +       +   S   
Sbjct  35   HIMSWTLLMVPAAAILVLLIVMAIFLPGGRLPLVILAAIYGLAFVAGLFVLSVYLASAYV  94

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL----------ILVVGGGSLLLIIPG  225
                    +  + L +   +    +    ++  L+           ++   G+++ ++  
Sbjct  95   YLSLSIAKRQSLSLAQLKTVAWAKMWYIFIVNFLVGLINQVVTAVQIIPLIGAVISLVLS  154

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLV--SGHWWAIFGRFVLLLVISLTLSFLTA  283
            +L   +     Y+L D N   ++A++ S  LV   GH        ++  VI +    L  
Sbjct  155  VLISAFMAPATYILVDQNQNPIEAMKDSYNLVRAKGHMSVFVKMVLVFTVILMAAIMLFV  214

Query  284  RIPYVGEA------ANLAFSLLLTPFSFLYYYLIYS  313
             I  +           L  S+ +T  + + + LI  
Sbjct  215  LIFVLPAVMLLANKLILLGSIWITISAVVLFVLIIY  250


>MBE7415897.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=297

 Score = 48.2 bits (111),  Expect = 0.005, Method: Composition-based stats.
 Identities = 9/37 (24%), Positives = 16/37 (43%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  ++C  CG +     S++  K    RC +C    +
Sbjct  1   MI-IQCDRCGTKFRLDDSRITGKGVRVRCTKCQNVFM  36


>PIN92876.1 hypothetical protein COU54_05125 [Candidatus Pacearchaeota archaeon 
CG10_big_fil_rev_8_21_14_0_10_31_24]
Length=316

 Score = 48.2 bits (111),  Expect = 0.005, Method: Composition-based stats.
 Identities = 14/82 (17%), Positives = 36/82 (44%), Gaps = 1/82 (1%)

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
              F  ++      G + +L  S  LV+ + W + G  +LL++I+  +S +   + ++G  
Sbjct  208  LMFSVFIFVYGEKGVIDSLGASWKLVAKNRWKVIGYSLLLILINWVISMILFFVSFIGAG  267

Query  292  -ANLAFSLLLTPFSFLYYYLIY  312
                  ++      ++   +I+
Sbjct  268  SMTAVGNISGAIMIYVALNIIF  289


>MXZ63771.1 hypothetical protein [Chloroflexi bacterium]
Length=144

 Score = 46.3 bits (106),  Expect = 0.005, Method: Composition-based stats.
 Identities = 23/105 (22%), Positives = 45/105 (43%), Gaps = 4/105 (4%)

Query  204  TLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLL---VSGH  260
             +L  L       GS+  +I G+ + +  FF   VL  + +G ++A+++S  L     G+
Sbjct  1    MILQALRERGGIAGSIASMIGGVAWSLATFFVIPVLVTEGVGPIEAIKRSAGLLRQTWGN  60

Query  261  WWAI-FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFS  304
                 FG  ++ L+  L      A + +V     +A  + L   +
Sbjct  61   QVTANFGFMIVGLLAVLVAIVPAALLFFVHPLLGIAVGVPLVAVA  105


>MBE7453632.1 zinc-ribbon domain-containing protein [Kofleriaceae bacterium]
Length=130

 Score = 45.9 bits (105),  Expect = 0.005, Method: Composition-based stats.
 Identities = 8/42 (19%), Positives = 13/42 (31%), Gaps = 0/42 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           VRC  C  E     ++L     + +C  C            +
Sbjct  3   VRCEKCNTEYELDEARLKPGGVTVKCTTCGHMFKIRKRSPTQ  44


>WP_181754756.1 hypothetical protein [Paenactinomyces guangxiensis]MBA4496385.1 
hypothetical protein [Paenactinomyces guangxiensis]MBH8593502.1 
hypothetical protein [Paenactinomyces guangxiensis]
Length=296

 Score = 48.2 bits (111),  Expect = 0.005, Method: Composition-based stats.
 Identities = 27/216 (13%), Positives = 61/216 (28%), Gaps = 0/216 (0%)

Query  79   RRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFS  138
             R +               +       L              L+ + +   +        
Sbjct  23   YRRHFWSLWTFCFLISFPFNLWIEWWYLQETHRFASIPADRELIILLIDTFIWFAILPPF  82

Query  139  ALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLR  198
                        +    +  + +    + +L   W+   ++  +    V L         
Sbjct  83   IQSACVYLSRLKRTGWKELFVSIRNNLWKVLAAHWLICLLWGVLFTLIVVLIGLPLYMSA  142

Query  199  HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVS  258
                 T    ++ +      L+ +IPG  F V       V+ ++N    +A+ +S  L  
Sbjct  143  QENGSTNPDEIIWMAFSICMLVFLIPGSYFYVRLSLVTPVIVEENPTIWRAITRSWELTK  202

Query  259  GHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
            G + +  G  V L  I++   F+   I  +  A + 
Sbjct  203  GVFGSTLGILVCLGAITIPFLFVQIGIKELSTAVSF  238


>OEU53617.1 hypothetical protein BA868_07285 [Desulfobacterales bacterium 
C00003106]
Length=478

 Score = 48.6 bits (112),  Expect = 0.005, Method: Composition-based stats.
 Identities = 7/36 (19%), Positives = 13/36 (36%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  + C +C ++      ++    S  RC  C    
Sbjct  1   MI-ITCENCQSKFAVDDERISETGSKVRCSNCKHVF  35


>MBE7465897.1 hypothetical protein [Planctomycetes bacterium]
Length=449

 Score = 48.6 bits (112),  Expect = 0.005, Method: Composition-based stats.
 Identities = 28/307 (9%), Positives = 67/307 (22%), Gaps = 22/307 (7%)

Query  6    CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIP  65
            CP CG +    + +              Q       +                 L   I 
Sbjct  138  CPACGQQIKVEARQCRFCGHVVDQALQQQLGAAQAQDQYPAWRGARGGVEFGATLDVGIA  197

Query  66   SDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL----  121
              R                  +         G  +                    L    
Sbjct  198  VLRRFWGMGGAMIFLFGIIVSMIAMIALIPLGLIVGMAGGAGGGGGAAAGILIAVLGGLV  257

Query  122  -LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
               +  L   +  A ++   +      +    +     +         +    +   +  
Sbjct  258  AFAVIWLLSCVFIAGLWKVSITMADNVMGKPVEPRFEDLFSGFSCIGSVLGVNLVIVLLG  317

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
            ++      L  +       +    L ++   + +                       ++ 
Sbjct  318  FLVGLAGALLSTYLGTPGKIAEIVLTIVNYWITM----------------RLLISVPLIM  361

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA-RIPYVGEAANLAFSLL  299
            D + G ++AL +S  L SG+   +F   V+  +IS     L    I +           +
Sbjct  362  DRHRGAMEALGESWRLTSGNALIMFCAVVVGTIISWLGLILLGVGIFFTTVVMLGIIGSM  421

Query  300  LTPFSFL  306
                ++ 
Sbjct  422  YRQLTWN  428


>NIQ97137.1 hypothetical protein [Desulfuromonadales bacterium]NIS43135.1 
hypothetical protein [Desulfuromonadales bacterium]
Length=74

 Score = 44.0 bits (100),  Expect = 0.005, Method: Composition-based stats.
 Identities = 16/74 (22%), Positives = 24/74 (32%), Gaps = 0/74 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M TV CPHCG  +  P  K+P       CP C ++  F    +   +  D          
Sbjct  1   MLTVSCPHCGFAKQVPPDKIPTGVVKVTCPSCGESFPFGRNAADIPEPPDETGEVTPAEQ  60

Query  61  QRRIPSDRLEIQSK  74
                +   +    
Sbjct  61  ALAATAALPKAGFW  74


>XP_012858037.1 PREDICTED: uncharacterized protein LOC105977276 [Erythranthe 
guttata]
Length=640

 Score = 49.0 bits (113),  Expect = 0.005, Method: Composition-based stats.
 Identities = 26/250 (10%), Positives = 65/250 (26%), Gaps = 15/250 (6%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
            +            +    I  L      + + +++++     +    +     +L     
Sbjct  1    MDNMFSSNSEWVTFFRFKIGYLVFFAIVSLLTTSVVVHTIACIYIAKETTLNKVLSLVPK  60

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
                 +   T           + L   + + L +           +  +    ++  +  
Sbjct  61   VWKRLMITFTWHFVFIFAYNFLFLLTVIFVLLDNGDIGIG----RMAFLIVLFIVYFMGL  116

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            L   + +     V   ++  GL A+ KSR L+ G        F +L +    +  +    
Sbjct  117  LYITMIWHLASVVSVLEDSYGLSAMMKSRGLIKGKMEISSAVFFVLGLSFFGIQHMFKIF  176

Query  286  PYVG-----------EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPL  334
              +G               L    +   F  +   +IY   K+ +            L  
Sbjct  177  VVLGHGDGIGKRIVYGIICLVLLSICILFQLIMETIIYFVCKSYHHENIDKSSLADHLGG  236

Query  335  TAAIFGWMLI  344
                F  +  
Sbjct  237  YLEEFIPLQF  246


>NIS18204.1 hypothetical protein [candidate division Zixibacteria bacterium]
Length=41

 Score = 43.2 bits (98),  Expect = 0.005, Method: Composition-based stats.
 Identities = 8/36 (22%), Positives = 16/36 (44%), Gaps = 0/36 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  + C  CG +     +K+  +K+  +C  C   +
Sbjct  1   MMDISCDACGKKYRVDETKMKGEKAKVKCKACDNIM  36


>SCJ14231.1 Protein of uncharacterised function (DUF975) [uncultured Ruminococcus 
sp.]
Length=407

 Score = 48.6 bits (112),  Expect = 0.005, Method: Composition-based stats.
 Identities = 35/272 (13%), Positives = 68/272 (25%), Gaps = 6/272 (2%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
              Y +         ++   L   T      + +        +A +              +
Sbjct  134  LAYNIFFANPLTVGYNNFFLNARTGNAGVGELFSQFKHGHYMATVKNMFFLKLRLFLWSL  193

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                      +             L+L IL+V     + +IP ++    +F   Y+ A++
Sbjct  194  LVLIPIGIGVLIFAKSDSRDIMGSLVLSILIVIPLYFVALIPQIIKKYEYFLVPYITAEN  253

Query  243  -NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP-----YVGEAANLAF  296
             NI   +A E S   + G     FG  +  +   +      +         +G AA  A 
Sbjct  254  PNITPNRAFEISSQTMKGEKMHCFGLQMSFIGWLILGGLAGSIFTAFLGVLLGSAATAAG  313

Query  297  SLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQN  356
              L+ P+ +      Y  +K                    A         +      R  
Sbjct  314  MALIYPYLYATLAEFYCCMKEKAIATGISSRDELDGLYGRAANAQPFGQPMGYSDHQRYE  373

Query  357  LSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLP  388
                   S   D      ++     D      
Sbjct  374  PYQPTAPSQPADNGLFGESKRDDNDDNYNGPE  405


>PZN29980.1 hypothetical protein DIU80_08415, partial [Chloroflexi bacterium]
Length=247

 Score = 47.8 bits (110),  Expect = 0.005, Method: Composition-based stats.
 Identities = 23/131 (18%), Positives = 46/131 (35%), Gaps = 3/131 (2%)

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
            + LGL +V    +L++L   +    +++L +      ++  F    +     G L A+  
Sbjct  107  IGLGLPYVFLSVMLMLLSPPLGLLAAMVLQLVSFWAWIYIGFANEAIVIGEQGPLGAIRA  166

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTA---RIPYVGEAANLAFSLLLTPFSFLYYY  309
            S  LV  ++W+  G   L   I      +          G  A+   S  +         
Sbjct  167  SFNLVRHNFWSTLGFLALSAFIIPLGMGIVWQSISGTTFGLVASAIGSAYIGCGLAAARM  226

Query  310  LIYSDLKANYR  320
            + Y +    +R
Sbjct  227  IFYRERIRRWR  237


>WP_146007727.1 hypothetical protein [Brachybacterium sp. UMB0905]
Length=255

 Score = 47.8 bits (110),  Expect = 0.005, Method: Composition-based stats.
 Identities = 35/203 (17%), Positives = 63/203 (31%), Gaps = 13/203 (6%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                L  +     V   A + + +                    L  +    L       
Sbjct  59   HMVLLFLVIGGSFVALMAVVIAMVESSGGGEPGAVEVVVILLGSLVMIVLSSLVSMLWMS  118

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
                       G   +++ GL   G      +++ L+V  GS+LL +PG++  V   F  
Sbjct  119  GAARVSAVLADGHRPTIRQGLAGPGRVIATSLVVTLLVVLGSILLYLPGIIAAV-LTFYA  177

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
               A        AL++S  LV  +     G  +L  +I      + +          L  
Sbjct  178  IPAALRGASVGAALKESFTLVKQN----LGITLLGYLIYSVAMSVASM--------TLIG  225

Query  297  SLLLTPFSFLYYYLIYSDLKANY  319
            +L++ PF  L    +Y  L+   
Sbjct  226  ALVVVPFGMLLMLGLYERLQGRE  248


>RZC64493.1 hypothetical protein C5167_008185 [Papaver somniferum]
Length=412

 Score = 48.6 bits (112),  Expect = 0.005, Method: Composition-based stats.
 Identities = 31/256 (12%), Positives = 63/256 (25%), Gaps = 11/256 (4%)

Query  156  QWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG  215
               ++          L  +    FI      V L   +    R  G        +++ + 
Sbjct  22   FKKVISVFSKVWGRLLVTILLFTFIMGIYIFVTLGSLVLFVSREFGPEGEGAYKVLVFLV  81

Query  216  GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
              S+   I  +   + +     +   +    ++AL +S  L+ G  W     FV L  I 
Sbjct  82   CISIPSFIAYVYITIVWNIAMVISVLEKDYWVKALVRSMKLIKGKIWVSSAIFVALETIF  141

Query  276  LTLSFLTARI-----------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
              L    + +                      +  +  FS +   +IY   KA +     
Sbjct  142  TGLMIAFSLLVLDGKILNLVGNIFVGIVCYLLAAFMLHFSLVIQTVIYFVCKAKHNEDIS  201

Query  325  PPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLN  384
                     L       + +            L  E+           LG         +
Sbjct  202  NVAAHLDSHLPNPHPFVLFVVNFYGSLYYFLFLYLERYQLQEICTILYLGLLYWFVATFD  261

Query  385  RSLPEEPQRLSSADYK  400
                +    + +    
Sbjct  262  GPDYKAFSIIIAFTTP  277


>GEY15929.1 Ty3/gypsy retrotransposon protein [Tanacetum cinerariifolium]
Length=942

 Score = 49.0 bits (113),  Expect = 0.005, Method: Composition-based stats.
 Identities = 25/219 (11%), Positives = 58/219 (26%), Gaps = 9/219 (4%)

Query  102  SISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILL  161
            +       S   F      +  + +  + L      +       +  N         +  
Sbjct  100  NHPLNFPPSHRNFPPIHHLVYLLLIYTLSLCATSSITYNAHHIFSLSNTLTTTPLPTLTT  159

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLG-LRHVGSFTLLLILLILVVGGGSLL  220
             T +   L  + +     + +      +   +    L+ + S   L  ++          
Sbjct  160  VTNSLFPLAFTAVVSHALMLLALLTFLMLAGLAFMSLQTLWSLFGLRFMVEFDSVYFMAF  219

Query  221  LIIPGLLFCVWFFFCQYVLAD-------DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
             +I G++  V   +              ++  G QAL +S  LV G         V   V
Sbjct  220  SVILGVVLFVVMMWLYVNWLLVNVVVVTESKWGFQALVRSWYLVKGMRCVALKLIVFYGV  279

Query  274  ISLTLSFLTARIPYVGEAANL-AFSLLLTPFSFLYYYLI  311
            +      + +   +          +L    F   +  L+
Sbjct  280  LEGLFVMVYSGSLFGYGVGMGRWATLFNMMFGSYFLMLL  318


>OPX36131.1 hypothetical protein B1H12_07605 [Desulfobacteraceae bacterium 
4484_190.2]
Length=688

 Score = 49.0 bits (113),  Expect = 0.005, Method: Composition-based stats.
 Identities = 9/34 (26%), Positives = 15/34 (44%), Gaps = 1/34 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           M  + C  CG +     +K+P K +  +C  C  
Sbjct  1   MIVI-CEECGKKYRIDPAKIPEKGARFKCKSCSN  33


>HFK85759.1 hypothetical protein [Chloroflexi bacterium]
Length=309

 Score = 48.2 bits (111),  Expect = 0.005, Method: Composition-based stats.
 Identities = 29/241 (12%), Positives = 66/241 (27%), Gaps = 2/241 (1%)

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
            R                     S          +    +   +      +E F      L
Sbjct  9    RAWQIIWKHKVLWIFGILAGCGSAGGNNTGYNFSRRDNVPFSNTTFEQFFERFADWQIAL  68

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA-YILLGLSWMTGSMFI  180
              I ++ ++L    +   L       L       +                 +      +
Sbjct  69   FIIVIVLLILLITLLVIFLSTIGKIGLIQGTWQVEQGKEKLNFGELFSNSSRYFWRVFLL  128

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +    V     + + + +VG+  L L +L + +     LL   G L  +        + 
Sbjct  129  NLIVGLVVAVAVIAIAVGYVGAAVLTLGILGICLLPVICLLAPLGWLLSLVLEQAIIAIV  188

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
             ++ G     ++   L   +  +     ++LLVI+L +S +   +P +          +L
Sbjct  189  VEDKGISDGFQRGWQLFRKNIGSYLLMGLILLVITLVVSAVLG-LPILLIVGPAVAGAIL  247

Query  301  T  301
             
Sbjct  248  G  248


>WP_025022160.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Lactobacillus hayakitensis]KRM19709.1 glycerophosphoryl 
diester phosphodiesterase [Lactobacillus hayakitensis 
DSM 18933 = JCM 14209]
Length=342

 Score = 48.2 bits (111),  Expect = 0.005, Method: Composition-based stats.
 Identities = 24/235 (10%), Positives = 62/235 (26%), Gaps = 0/235 (0%)

Query  59   GLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRG  118
                   +   +     +         C           + L +    + ++        
Sbjct  9    HFIEHYTNVFFKNWGSYLVFFFGIGFSCQSILFPLFNHLTNLITSHHGIRNTLANQPISI  68

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
                  + L +          ++       +       W           +    +    
Sbjct  69   LLFFLEFCLILASLLLFFIFIVVSVTNISRDTFELRLIWQETWHNFFKQGVLSYLLFVGY  128

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
             + +      LF S  L    +  F L           G LL+ +  L F +   +   +
Sbjct  129  LVILYPLVRLLFLSSFLSNFVLPQFFLDDFYTKTSYAFGILLVYLVLLYFGIRLLYALPL  188

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAAN  293
            +   +   +QA+++S  L       IF R +++ +    +++    I Y+ +   
Sbjct  189  IILKSYRPMQAIKRSWNLTKQAVLEIFQRVLVIFIFFAIIAYFLTGISYLIQIIF  243


>KRT62392.1 Uncharacterized protein XU10_C0029G0008 [Chloroflexi bacterium 
CSP1-4]
Length=298

 Score = 47.8 bits (110),  Expect = 0.005, Method: Composition-based stats.
 Identities = 17/99 (17%), Positives = 39/99 (39%), Gaps = 6/99 (6%)

Query  204  TLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA  263
                 L + +    ++ +++      + +     V+  D   G+ AL +S  LV+G  W 
Sbjct  160  PAAGGLPVFLGLVAAVGVVVLLTFLGLRWALWPQVVLLDGGAGITALGRSFRLVAGSTWR  219

Query  264  IFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
            + G  +   + +  L+ L      +G+   +   L+  P
Sbjct  220  VLGYALAFGLATGILTQL------LGQLGGIVVGLVTGP  252


>MBI9020351.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Verrucomicrobia bacterium]
Length=230

 Score = 47.5 bits (109),  Expect = 0.005, Method: Composition-based stats.
 Identities = 29/139 (21%), Positives = 51/139 (37%), Gaps = 2/139 (1%)

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV  214
                 +  +     L       SM +          R  +     + + T + +LL LV 
Sbjct  79   VWIYTIKLSKLLFALLPFIALLSMILLAWSNAAVKGRRNERSPGRMLTITGVHLLLTLVK  138

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDN-IGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            G      I+PG    +   F   ++ ++N  G + A+ +S  L  G++ A+     L   
Sbjct  139  GLAFAFFILPGFYLYIRLLFVTLIMLEENETGMMDAIRESWTLTRGNFRALLTLVCLNGT  198

Query  274  ISLTLS-FLTARIPYVGEA  291
            + L +S  L   IP  G A
Sbjct  199  LQLVVSPTLIGLIPATGFA  217


>HDY19326.1 hypothetical protein [Gemmata sp.]HEJ50530.1 hypothetical protein 
[Gemmata sp.]
Length=229

 Score = 47.5 bits (109),  Expect = 0.005, Method: Composition-based stats.
 Identities = 20/231 (9%), Positives = 50/231 (22%), Gaps = 7/231 (3%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              + CP CGA  N   +   A   + +C +C  ++      +      D          Q
Sbjct  3    ILIACPSCGARLNVSDN---AAGKTVKCLKCDASMDIPSPPAPPPAQADPPPAAATSSKQ  59

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
            +     R   + K             Q       +        +           +    
Sbjct  60   QISAKLRQYEEEKNQEEYEAAVRTPHQHRGRSEETDPEYELDEEYHEVPARTRRPQRQS-  118

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
                 +        + S                    +        ++            
Sbjct  119  ---NWMARTGMIIGLISIGFGCIPCGWLLLGYFHSILVGGLLSILGIIFSGVGLAQARRV  175

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
              K       ++   +  + +  +L +   L +    L +  P +   + +
Sbjct  176  NTKPQPSKKMAITGLVTSILALVILPVSPFLFLIVLILAVRFPPVGNWIPW  226


>HCC98734.1 hypothetical protein [Rhodobacteraceae bacterium]
Length=262

 Score = 47.8 bits (110),  Expect = 0.005, Method: Composition-based stats.
 Identities = 9/40 (23%), Positives = 15/40 (38%), Gaps = 0/40 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAE  42
            + CP C A+   P   +P      +C  C +T      +
Sbjct  2   LLACPICQAQYEVPEDAIPEAGCEVQCSACGETWFQPHPQ  41


>BAS27211.1 hypothetical protein LIP_1360 [Limnochorda pilosa]
Length=265

 Score = 47.8 bits (110),  Expect = 0.005, Method: Composition-based stats.
 Identities = 15/101 (15%), Positives = 30/101 (30%), Gaps = 11/101 (11%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            + V    +  +   L     +     ++ D N+   +AL  S   + G    + G   +L
Sbjct  150  VGVLVFFVAGVWLMLYLGFGYMITIPLVLDQNLSPWRALTTSARALRGRRLTVLGVLAVL  209

Query  272  LVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
             +           I  +G        L+  P        +Y
Sbjct  210  FL-----------INVLGALPFGLGLLVTAPLGPTAIVAVY  239


>ALS89603.1 transposase zinc-ribbon domain protein, partial [uncultured bacterium]
Length=146

 Score = 45.9 bits (105),  Expect = 0.005, Method: Composition-based stats.
 Identities = 10/34 (29%), Positives = 14/34 (41%), Gaps = 1/34 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           M  + CP C         K P +  + RCP+C  
Sbjct  1   MI-IVCPKCATRLQIDDEKSPNRPFNVRCPKCSA  33


>OGG19237.1 hypothetical protein A2721_00160 [Candidatus Gottesmanbacteria 
bacterium RIFCSPHIGHO2_01_FULL_47_48]
Length=243

 Score = 47.5 bits (109),  Expect = 0.005, Method: Composition-based stats.
 Identities = 23/151 (15%), Positives = 46/151 (30%), Gaps = 12/151 (8%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL-ILVVGGGSLLL  221
                +      +       +    V L     + L  + +F         ++    +L  
Sbjct  88   LFFALKDLKLVLKYVATSLVSGFAVFLAVIPMMILYFLPAFIWGGRAGGEVISLVLALAY  147

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            +   +   V   +  YVL D    G +AL  S  +  G            +++ +    +
Sbjct  148  LPVLIYVAVRLSYWVYVLVDQRKWGSEALRISWNVTKGK-----------VLLIIAFGVV  196

Query  282  TARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
             A I   G    L   ++  P +F+    IY
Sbjct  197  LALINVAGAILLLLGLVVTLPMTFIAIARIY  227


>MBD3679054.1 zinc-ribbon domain-containing protein [Rhodobacteraceae bacterium]
Length=588

 Score = 48.6 bits (112),  Expect = 0.005, Method: Composition-based stats.
 Identities = 9/41 (22%), Positives = 15/41 (37%), Gaps = 0/41 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAES  43
            + CP+C A  +     +P K    +C  C +T        
Sbjct  2   RLECPNCSAHYDVDERVIPVKGRDVQCSNCGKTWFQKHPSQ  42


>HAB91770.1 hypothetical protein [Pseudomonas sp.]
Length=214

 Score = 47.1 bits (108),  Expect = 0.005, Method: Composition-based stats.
 Identities = 26/194 (13%), Positives = 68/194 (35%), Gaps = 1/194 (1%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             +  L + L +  + +   L        Q                L+G+  M     I +
Sbjct  15   LLGSLAVTLLWPAMMAGFFLAFKHAKQKQAVTANDLFEPFKAPASLIGVGGMYLLASIVL  74

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                  +       +  + +  + +  +++ +   +L+ I   L   + F F   ++ + 
Sbjct  75   FLVLALVAFLSLGSITAIMNGQIDMGRMVVGLVILTLIAIPASLALAMAFIFAPVLVHEH  134

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
             +  ++A+++S      +       F++L      +S L   IP++G    +A +++  P
Sbjct  135  QVPVIEAIKRSFFGSLRNILPFIVFFMILTAALFVIS-LFVAIPFLGWLVGIAVAIIYLP  193

Query  303  FSFLYYYLIYSDLK  316
                  +  Y D+ 
Sbjct  194  LFCGALFCAYRDIF  207


>NOY64066.1 hypothetical protein [Nitrospirae bacterium]
Length=74

 Score = 44.0 bits (100),  Expect = 0.005, Method: Composition-based stats.
 Identities = 10/39 (26%), Positives = 16/39 (41%), Gaps = 0/39 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  V CP C  +      K+    +  +CP+C   L+  
Sbjct  1   MVIVICPKCRVKLKIADEKVSPGGTRFKCPKCTTILMVR  39


>WP_125933312.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Kiloniella majae]
Length=136

 Score = 45.9 bits (105),  Expect = 0.005, Method: Composition-based stats.
 Identities = 20/102 (20%), Positives = 47/102 (46%), Gaps = 1/102 (1%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
                +++  P  +F + FF    +   +    ++AL+++  L  G+++AI G F+L  V+
Sbjct  6    VVLDIVIQAPTYIFSILFFILMPLTVIEKTSFIKALKRTLELSKGNYFAILGLFLLNFVV  65

Query  275  SLTLSFLTAR-IPYVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
            +     +      +  E  +L   LL +P  F+  ++++   
Sbjct  66   AAIFGAVIWACFSFFAEFPDLFDMLLGSPDVFVQTFVLFVQW  107


>MBE5763510.1 hypothetical protein [Clostridiales bacterium]
Length=288

 Score = 47.8 bits (110),  Expect = 0.005, Method: Composition-based stats.
 Identities = 32/295 (11%), Positives = 77/295 (26%), Gaps = 17/295 (6%)

Query  45   RTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSIS  104
              +                +    +     T+     +    +    +        +   
Sbjct  1    MRKYFIQTVEYAKKNWVSVLLHVLIPAILYTIFINPTSSFDYVMTNIKELDVQHAYQIFV  60

Query  105  QLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATV  164
             +   S             I    +++  + I +            +     +       
Sbjct  61   TINDGSRWNSWEYVLFYFLIAAATLIVFSSFIGNVQNKMRYGRTIYEGFGGVFKRTNENF  120

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
               L     +  +M +Y     V ++  +K+         L+ ++    +          
Sbjct  121  FATLRAGIALVVAMEVYALIMSVVIYFVIKVSANAAVRIILVSLIGGGFIIA--------  172

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA-  283
                  W       +   N G  +++++S  +V      IFG FVL +VIS     + + 
Sbjct  173  MFYGAAWLACALPNMTMRNEGLFKSIKRSMSMVKDKQIKIFGDFVLPIVISFIPLLICSA  232

Query  284  --------RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
                     +  V       F L    +  +  Y+I+ D+    R   +   K +
Sbjct  233  FDIAFDHVILTIVKYVCIFIFYLFSFAYYIILMYVIFFDVNEIEREDLNLQNKWR  287


>NCO17101.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=26

 Score = 42.8 bits (97),  Expect = 0.005, Method: Composition-based stats.
 Identities = 8/27 (30%), Positives = 11/27 (41%), Gaps = 1/27 (4%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSA  27
           M  V CP+C A+       +PA     
Sbjct  1   MRLV-CPNCDAQYEVGDDAIPAGGRDV  26


>RKH55340.1 hypothetical protein D7W81_36500, partial [Corallococcus aberystwythensis]
Length=365

 Score = 48.2 bits (111),  Expect = 0.005, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V+C  C      P  K+  K    RC +C  T 
Sbjct  1   MI-VQCEQCQTRFKIPDEKVTEKGVKVRCTKCQNTF  35


>MBD5085539.1 DUF975 family protein [Clostridiales bacterium]
Length=310

 Score = 47.8 bits (110),  Expect = 0.005, Method: Composition-based stats.
 Identities = 28/188 (15%), Positives = 59/188 (31%), Gaps = 9/188 (5%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
              G   L + LL   +  A I +       T  +             T A       W +
Sbjct  79   YFGPQRLALALLVGFVLAALITTLWTDFMRTGYSNFCLGMARGEQPLTNALFSHFPQWGS  138

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLL-------LILLILVVGGGSLLLIIPGLLF  228
                 ++      L+  +   +     F           +L ++VV    L+  +  +  
Sbjct  139  VLFTKFLSGVFCTLWELLFGMVGLGFLFLTALLFGEMEGLLALMVVTLSYLVYSLGCMWV  198

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVS--GHWWAIFGRFVLLLVISLTLSFLTARIP  286
             + +    +++AD  + G+ A+ +SR LV   G+   +F   +  +   L  + +     
Sbjct  199  TLRYAMVDFLIADQGLTGMDAIRESRRLVRDNGNTGRLFILELSFIGWHLVEAAIFLVAI  258

Query  287  YVGEAANL  294
              G     
Sbjct  259  LAGVVMFG  266


>MBC7978701.1 zinc-ribbon domain-containing protein [Myxococcales bacterium]
Length=219

 Score = 47.1 bits (108),  Expect = 0.006, Method: Composition-based stats.
 Identities = 9/59 (15%), Positives = 14/59 (24%), Gaps = 0/59 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
           VRC  C  E     ++L     + +C  C               T            + 
Sbjct  3   VRCEKCQTEYELDEARLKPGGVTVKCTTCGHMFKIRTRTITNVGTPVPRDQKETGRHRP  61


>MAR57047.1 hypothetical protein [Rickettsiales bacterium]
Length=279

 Score = 47.8 bits (110),  Expect = 0.006, Method: Composition-based stats.
 Identities = 11/45 (24%), Positives = 17/45 (38%), Gaps = 1/45 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           M   +CP C A     +S +P      +C +C  T    P   + 
Sbjct  1   MIL-QCPECSARFLVDNSLIPVDGREVKCAKCAHTWHAMPEAPEP  44


>PHS29425.1 hypothetical protein COA85_00785 [Robiginitomaculum sp.]
Length=295

 Score = 47.8 bits (110),  Expect = 0.006, Method: Composition-based stats.
 Identities = 8/37 (22%), Positives = 11/37 (30%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  + CP C    +  +  L       RC  C     
Sbjct  1   MI-ITCPDCSTHYSVTAKALGQSGREVRCASCGHKWF  36


>TSC72604.1 hypothetical protein G01um101438_321 [Parcubacteria group bacterium 
Gr01-1014_38]
Length=265

 Score = 47.5 bits (109),  Expect = 0.006, Method: Composition-based stats.
 Identities = 26/150 (17%), Positives = 41/150 (27%), Gaps = 0/150 (0%)

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
                  T  + ++     +          R      L    L L     +LLL++P  + 
Sbjct  115  WIAFVWTWLLSVFAVALSLLPGMLFFWWARIGLQPVLEGSGLSLFALIVALLLVLPAFIV  174

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
              W+ F     A  +  G  AL  S  LV+G    +FG      +  L  S L       
Sbjct  175  ASWYAFSLIPAARGDAWGSDALRISHRLVAGVTGQVFGLLFAWFLFELLFSILLNAFFPG  234

Query  289  GEAANLAFSLLLTPFSFLYYYLIYSDLKAN  318
                        T      Y ++       
Sbjct  235  LPLFTGFVYYYTTTILGSAYLVVIYQALRR  264


>WP_115603384.1 zinc-ribbon domain-containing protein [Lujinxingia sediminis]RDV39939.1 
hypothetical protein DV096_05080 [Bradymonadaceae 
bacterium TMQ3]RVU48015.1 hypothetical protein EA187_00855 
[Lujinxingia sediminis]TXC77314.1 hypothetical protein FRC91_00860 
[Bradymonadales bacterium TMQ1]
Length=463

 Score = 48.2 bits (111),  Expect = 0.006, Method: Composition-based stats.
 Identities = 11/39 (28%), Positives = 14/39 (36%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  +RCP C    N P  ++  K    RC  C       
Sbjct  1   MI-IRCPECSTGFNLPDERVSEKGVKLRCSRCSHVFRVR  38


>NNG01870.1 hypothetical protein [Desulfobacteraceae bacterium]
Length=99

 Score = 44.8 bits (102),  Expect = 0.006, Method: Composition-based stats.
 Identities = 10/36 (28%), Positives = 12/36 (33%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  + C  C    N   S + A  S  RC  C    
Sbjct  1   MI-ITCNECTTSYNLDDSLIKADGSKVRCTNCQTVF  35


>WP_068084195.1 zinc-ribbon domain-containing protein [Pseudovibrio stylochi]
Length=286

 Score = 47.8 bits (110),  Expect = 0.006, Method: Composition-based stats.
 Identities = 9/63 (14%), Positives = 17/63 (27%), Gaps = 0/63 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + C  C       + ++ A     +C  C +     P      +T D      H     
Sbjct  2   KITCESCQTSYRIGAEQIGAGGRKVKCARCGKIWHAVPVADGVGETPDERDQSEHYEDVS  61

Query  63  RIP  65
           +  
Sbjct  62  QPY  64


>TDX43691.1 hypothetical protein C7959_1617, partial [Orenia marismortui]
Length=306

 Score = 47.8 bits (110),  Expect = 0.006, Method: Composition-based stats.
 Identities = 32/217 (15%), Positives = 71/217 (33%), Gaps = 18/217 (8%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +  + +   V +   IF  L L    W N         I+   +    L + ++      
Sbjct  18   MFMVNISFFVKSMVGIFITLALTLFLWNNNPIPIVDDFIIGFLILIFSLLIYYVVRYSLD  77

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLL---------------LILLILVVGGGSLLLIIPG  225
               K    +  S       +G    L                 L  + +   +L++ + G
Sbjct  78   IYNKKKEKIKFSDIFCNETIGQIFSLLVVILAGGIFAYFPKFFLNKINLLISNLIVAVIG  137

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            +       F  + + D+  G L  +E+S +L   +        ++LL ++L    +   +
Sbjct  138  IYIFSRLIFTLHFMVDEKAGALDGIERSHILTKENILESTILLLILLTLNLPPMVVLFTL  197

Query  286  ---PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
                 +    ++A+  +  P+S     +IY +L    
Sbjct  198  AKTNLLANLLSIAYLSISIPWSIFISVVIYCNLVKEK  234


>QDT38887.1 hypothetical protein Pan189_32860 [Planctomycetes bacterium Pan189]
Length=403

 Score = 48.2 bits (111),  Expect = 0.006, Method: Composition-based stats.
 Identities = 50/323 (15%), Positives = 84/323 (26%), Gaps = 22/323 (7%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
               RC  C     TP  K      SA CPEC   +                 +      +
Sbjct  3    IEFRCSGCDKLLRTPEDK---AGKSANCPECGHRVTVPAVAPPPEVAFSPADSNASGPGE  59

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                S     Q+        + ++ L  ER+    G     +   ++      CR     
Sbjct  60   DDFDSSSDFDQNVDGGDDGYDPNWDLTRERQSTPDGDQTCPMCGTVSGPESSRCRACGED  119

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS---------  172
            L I  L        I     ++        + +   A  L                    
Sbjct  120  LSISNLNARPGRQSIDIGEAMRDGMEWLKPDLSIMIAATLIFGMLFWFIYMVAYFSSLIL  179

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
                +  +           ++   +  +G  +L+ ILL       +L             
Sbjct  180  AGLVAAMVQGGGGGRPADETLFGMIVFLGGGSLMTILLGPPYTFLTLGFTRLTFNVLRGN  239

Query  233  FFCQYVLADDNIGGLQALEKSRLL------VSGHWWAIFGRFVL----LLVISLTLSFLT  282
                  L     G L A+  +          SG   A+F    L    +    + L  L+
Sbjct  240  TADLTDLFRGVQGFLPAIIINLFYGVLVLVTSGPAMALFMFGALAEDEVAGSGVGLILLS  299

Query  283  ARIPYVGEAANLAFSLLLTPFSF  305
              I  +G    LAF + L P+ +
Sbjct  300  GTIFLIGSLLQLAFCVYLWPYYW  322


>HEW91442.1 hypothetical protein [Thermotogaceae bacterium]
Length=201

 Score = 46.7 bits (107),  Expect = 0.006, Method: Composition-based stats.
 Identities = 34/188 (18%), Positives = 70/188 (37%), Gaps = 7/188 (4%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
               I  +  ++A       +L   +  +          + +     +L    W+   +  
Sbjct  19   HYPIIGIPPLIASFVSLLMILAVSSGGIAMFGFRMFI-LSVVEWGVVLAMQGWLVAMLDE  77

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +    V L  S    +   GS  +  +++ ++   G    IIPG+L  V        + 
Sbjct  78   ILVNEKVDLKNSWFKVMEVFGSLLITGLIVSILTSVGMTFFIIPGILIMVSLSVAIPAIV  137

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
               +G   +L +S   +  +    F   +L++VI + LSF    IP++G       + L 
Sbjct  138  KKKLGVTDSLRESINFI--YSQGNFWIILLIIVIGVLLSF----IPFIGAVLGSFLTSLW  191

Query  301  TPFSFLYY  308
             P+++L Y
Sbjct  192  IPYAYLKY  199


>WP_091571326.1 hypothetical protein [Melghirimyces thermohalophilus]SDC76755.1 
hypothetical protein SAMN04488112_1162 [Melghirimyces thermohalophilus]
Length=194

 Score = 46.7 bits (107),  Expect = 0.006, Method: Composition-based stats.
 Identities = 29/163 (18%), Positives = 60/163 (37%), Gaps = 9/163 (6%)

Query  157  WAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGG  216
                L  +    L        +   +   +  L  +    + ++    L+  +  LVV  
Sbjct  22   MISFLLYLLLFFLLSIPFASMVKQELNDGETTLPITFSNIVDYIIPVLLIGGIYGLVVIV  81

Query  217  GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
            G  LLI+PGL+  V  F   +    +     QAL+K++ +    ++ +    +L+ + ++
Sbjct  82   GLNLLIVPGLICLVLLFLFPFAYVIEGCSWKQALKKAKSIGGERFFQLLSSVLLIFLTNV  141

Query  277  TLSFLTAR---------IPYVGEAANLAFSLLLTPFSFLYYYL  310
                L +          I  + +    +F L +      YYYL
Sbjct  142  IGKLLGSVAYTGLGDARIVLITQLGFYSFILPVHLIYISYYYL  184


>HCN23773.1 hypothetical protein [Candidatus Marinimicrobia bacterium]
Length=147

 Score = 45.9 bits (105),  Expect = 0.006, Method: Composition-based stats.
 Identities = 31/122 (25%), Positives = 51/122 (42%), Gaps = 0/122 (0%)

Query  136  IFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL  195
            +   +++  A  +          I +  +  I   +   T S+        V +    K 
Sbjct  24   MLDQIVVIIANSVIFLIACAVLGITIIGIIAIPAVIGGFTESLIRAARGNKVEIGDFFKA  83

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
            G    G+     IL +L VG G + LIIPG+   V +FF  Y++ D  +G  +A EKS  
Sbjct  84   GFNKFGTLLGAGILFMLGVGIGLICLIIPGIYLMVRWFFVSYLIVDKGVGVSEAFEKSGE  143

Query  256  LV  257
            +V
Sbjct  144  MV  145


>MBD3675420.1 hypothetical protein [Planctomycetaceae bacterium]
Length=356

 Score = 48.2 bits (111),  Expect = 0.006, Method: Composition-based stats.
 Identities = 17/136 (13%), Positives = 23/136 (17%), Gaps = 0/136 (0%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M    CP C A           K +   C +          +S     T  I + P    
Sbjct  1    MIEFNCPSCQALIRVGEKASGKKGTCPSCRKKIVVPEKSVTQSPPAIDTPPIPSNPPEQS  60

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
            Q   P         T               +                   W      G  
Sbjct  61   QSDNPPTFDPANVPTDPPAASAEPVIQIKPQTSPTRRRRKGRRRSSKGGLWVPVLCAGIL  120

Query  121  LLGIYLLGIVLAFAPI  136
               I      +     
Sbjct  121  FGFIGWYFFQINAGLS  136


>PIR22608.1 hypothetical protein COV44_07220 [Deltaproteobacteria bacterium 
CG11_big_fil_rev_8_21_14_0_20_45_16]
Length=609

 Score = 48.6 bits (112),  Expect = 0.006, Method: Composition-based stats.
 Identities = 13/45 (29%), Positives = 18/45 (40%), Gaps = 1/45 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           M    CP C A+     SK+ A  +  RCP C  T +      + 
Sbjct  1   MIL-SCPSCSAKYQLADSKIKASGTKVRCPRCSHTFLVYQKGQEP  44


>MAZ95616.1 hypothetical protein [Planctomycetaceae bacterium]
Length=601

 Score = 48.6 bits (112),  Expect = 0.006, Method: Composition-based stats.
 Identities = 43/322 (13%), Positives = 89/322 (28%), Gaps = 18/322 (6%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
             ++ C  CG   N  S  +       RCP+C  +        +  + +    T     L 
Sbjct  257  FSINCLLCGTRLNVSSDDI---GGEVRCPDC-HSHTTIREPRKEKRPSSKPLTAEKDQLV  312

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
               P +     S     +        + E E +       +             +    +
Sbjct  313  PGPPIEPARSVSNEGKKQAQAFMEKARAEVESKQEQFTETANRGWWMTILSSMVQIDIAI  372

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              +    I           LL     +            +  +A     L  +   +   
Sbjct  373  RILMHSIIGFLAVLTLHYGLLMEEGIMAMLGLFLFATSFIFFLALGASFLVSVVTIIETR  432

Query  182  ICK-----TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV-----W  231
                        L     + +      +L + ++ L++       + P ++  V      
Sbjct  433  ANGDHEIENWPSLILMEWISMVPYMGISLFIAVIPLIIWIPLTEGLDPAIVLSVGSMLTL  492

Query  232  FFFCQYVLADDNIGGLQALEKSRLL----VSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
              F   +L+  + G   A+  SR++    +    W  F  FVLLL  ++  S        
Sbjct  493  LLFAPVILSTLSAGTPAAIVSSRVMASIALRPSEWFKFAGFVLLLSPAMAGSMALLGGSG  552

Query  288  VGEAANLAFSLLLTPFSFLYYY  309
                   + S ++T F +L+  
Sbjct  553  PWPILAGSASFMVTSFFYLHVV  574


>TMA22728.1 hypothetical protein E6J85_03870 [Deltaproteobacteria bacterium]TMB25934.1 
hypothetical protein E6J62_20935 [Deltaproteobacteria 
bacterium]TMB34577.1 hypothetical protein E6J61_03140 
[Deltaproteobacteria bacterium]
Length=438

 Score = 48.2 bits (111),  Expect = 0.006, Method: Composition-based stats.
 Identities = 10/64 (16%), Positives = 18/64 (28%), Gaps = 1/64 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V+CP+C +       K+  +    RC  C        +  +     +     P    
Sbjct  1   MV-VQCPNCRSRFRVADEKVSERGVRVRCSACKTVFAVRKSGIETNPAGNEKNRAPATAG  59

Query  61  QRRI  64
               
Sbjct  60  APDP  63


>OJV14073.1 hypothetical protein BGO27_01125 [Alphaproteobacteria bacterium 
33-17]
Length=225

 Score = 47.1 bits (108),  Expect = 0.006, Method: Composition-based stats.
 Identities = 10/45 (22%), Positives = 18/45 (40%), Gaps = 1/45 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           M  V CP C +  +  ++ LP +    RC +C      +  +   
Sbjct  1   MI-VTCPSCFSRFSLDNNILPPRGRKVRCSKCKNEWHQEHPDFSF  44


>HBQ14507.1 hypothetical protein [Myxococcales bacterium]
Length=513

 Score = 48.2 bits (111),  Expect = 0.006, Method: Composition-based stats.
 Identities = 14/38 (37%), Positives = 16/38 (42%), Gaps = 0/38 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
            V CP C AE      +LPA     RCP+C       P
Sbjct  2   QVACPSCSAEYPVDERRLPASGLKMRCPKCGARFHVHP  39


>NIS62866.1 hypothetical protein [Proteobacteria bacterium]
Length=44

 Score = 42.8 bits (97),  Expect = 0.006, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 16/36 (44%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V C  C  + N  +SK+ A+    RC +C    
Sbjct  1   MI-VECEVCQTKYNLENSKITAQGVKVRCVKCQNIF  35


>KRK27772.1 glycerophosphodiester phosphodiesterase [Lactobacillus acidophilus 
DSM 20079 = JCM 1132 = NBRC 13951 = CIP 76.13]
Length=454

 Score = 48.2 bits (111),  Expect = 0.006, Method: Composition-based stats.
 Identities = 18/124 (15%), Positives = 48/124 (39%), Gaps = 9/124 (7%)

Query  199  HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVS  258
             +  F +  +    ++    ++  +   +    F     ++        +A++KS  L+ 
Sbjct  18   KIPQFIIDYMTRSGILLAILIIFYLVIFVLAFRFLLTLPIMVIYKTTTREAMKKSAFLMK  77

Query  259  GHWWAIFGRFVLLLVISLTLS-------FLTARI--PYVGEAANLAFSLLLTPFSFLYYY  309
             +W  + G FV L ++++ +        +L   I   + G+ A +  ++ LT F  +   
Sbjct  78   KNWRKVIGFFVPLGILAIVIMAINVGGAYLLQLIWDTFPGKFALIMATINLTLFQIISEV  137

Query  310  LIYS  313
             +  
Sbjct  138  FLIW  141


>WP_013628776.1 RDD family protein [Rubinisphaera brasiliensis]ADY60052.1 RDD 
domain containing protein [Rubinisphaera brasiliensis DSM 5305]
Length=610

 Score = 48.6 bits (112),  Expect = 0.006, Method: Composition-based stats.
 Identities = 43/309 (14%), Positives = 88/309 (28%), Gaps = 17/309 (6%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIA-TCPHCGLQR  62
            V+CP CG +   PS K  A   +       Q      A        D          L +
Sbjct  25   VKCPACGKKMRVPSGKATAGAKTRSSRSRRQKSDDGQASDAFLNNLDLDRLVDDEVQLCQ  84

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
            +  ++    Q++  NC         + +R  + +  G+   S   +   E F   G    
Sbjct  85   KCATEIPPEQTQCPNCGFDPSHLTAEGQRRQKMAAKGIDPASFYESVWKESFAYAGRNFG  144

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             +     +L+F  + S L      W+        W +       + +G  ++  +  I +
Sbjct  145  QVMKTAFILSFCLMLSGLCGYFLIWVATGPPFAFWTLFTTVAVMMPIGWLFVQHTEIIAL  204

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                    + ++           +  L  +++ G    ++  GL   +      Y +   
Sbjct  205  TLKRKDEIKKIRFDFAQ-CGMNGIKTLAWILIFGLPFWILFGGLGVLLNTMEVPYGMPIG  263

Query  243  NIGGL--------QALEKSRL------LVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
                         QA+    +            W   G       + + + F+   IP +
Sbjct  264  MGMAAFFVLLFTPQAMSHMTMPVETPAWFFPKIWPTLGVSAAPGAMWVLVFFVAN-IPAI  322

Query  289  GEAANLAFS  297
               A     
Sbjct  323  ATIAGTVAL  331


>GEU36801.1 activating signal cointegrator 1 complex subunit like [Tanacetum 
cinerariifolium]
Length=989

 Score = 48.6 bits (112),  Expect = 0.006, Method: Composition-based stats.
 Identities = 28/236 (12%), Positives = 64/236 (27%), Gaps = 51/236 (22%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              I    +                T    +   +Q+  L+    + LL  + +  ++   
Sbjct  44   FAILAHSLFTHPLITKIQDPYGSHTSQWEKLLIFQFCYLIFLFIFSLLSTAAIVFTVASL  103

Query  182  ICKTDVGLF-------RSMKLGLRHVGSFTLLLILLILVVGGGSLLL-------------  221
                 V  +          K          L +++  +V  G  +LL             
Sbjct  104  YTSKPVSFYSTLSAIPMCFKRLFITFLWVALTMVVYNVVFLGFVVLLIVAIDTRNLVLFF  163

Query  222  ----------IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA----IFGR  267
                      ++  +     +     +   +N+ G  A++K+  ++ G        +FG 
Sbjct  164  FSLVVVFVLFLVVHVYITALWHLASVISVLENVYGFAAMKKAFEILKGKAKMGSVLVFGY  223

Query  268  FVLLLVISLTLSFLT-----------------ARIPYVGEAANLAFSLLLTPFSFL  306
            FV+   I+     +                    +  V    NL   L+ + F + 
Sbjct  224  FVICGAINGLFGSIVIHGGEYYYGVFSRIVVGGFLVGVLVIVNLVGLLVQSVFYYH  279


>NOZ34417.1 hypothetical protein [Chlorobi bacterium]
Length=189

 Score = 46.7 bits (107),  Expect = 0.006, Method: Composition-based stats.
 Identities = 28/142 (20%), Positives = 59/142 (42%), Gaps = 0/142 (0%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
             + +  L + L    IF+A +LK  T  +           L      ++ ++    S+++
Sbjct  48   FVAVTFLMVSLLANDIFNAAVLKTNTPTSTIIYAVILIFSLMFGMLAVVIVTHSYISLYV  107

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
               K +       +L  +++       IL+ ++V  G LL  IPG+   V   F   ++ 
Sbjct  108  SKGKGNFTKDDVGELLKKNLWKVFGAGILVYIMVVIGFLLFYIPGIYLAVATIFIFLIII  167

Query  241  DDNIGGLQALEKSRLLVSGHWW  262
             +N    +++ +S  ++ G WW
Sbjct  168  YENKSATESISRSFEIIKGKWW  189


>NNL65406.1 histidine kinase [Myxococcales bacterium]
Length=36

 Score = 42.8 bits (97),  Expect = 0.006, Method: Composition-based stats.
 Identities = 8/32 (25%), Positives = 11/32 (34%), Gaps = 1/32 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPEC  32
           M    CP C  +      +L  +    RC  C
Sbjct  1   MIA-ACPECRTQFRVDPQRLSGEGVRLRCTRC  31


>NLZ01476.1 hypothetical protein [Pirellulaceae bacterium]
Length=653

 Score = 48.6 bits (112),  Expect = 0.006, Method: Composition-based stats.
 Identities = 31/330 (9%), Positives = 71/330 (22%), Gaps = 22/330 (7%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              V CP CG        ++       RCP+C   +              ++ +    G+ 
Sbjct  304  IGVNCPVCGTRVQATERQV---GGQVRCPDCDSPITVPAPPKDALPQPFDLDSAGEYGVS  360

Query  62   RRIPSDRLEIQ------SKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFC  115
                 + L         +   +         +               +  L         
Sbjct  361  APAHVEPLRPALLRSASADPGDDPDYAFLRDIDDPHLRWQLQDRSDELGFLGHPEASRRW  420

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
                       L    A   +   +  + A              +L  + ++ L    + 
Sbjct  421  LAYSMSGAAACLLPAFALILLGMPVGDETAAVTWFGGCVLLGGAVLVLLGWLALFSVNLL  480

Query  176  GSMFIYICKTDVGLFRSMKLGL-----RHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
              +        V                       L+ L+ + +   +     P  +  +
Sbjct  481  AIVEDTSAGNHVVENWPDGDWGGSIGESFYALNACLVSLIPIAILLQTYPPARPLAIHLL  540

Query  231  --WFFFCQYVLADDNIGGLQALEKSRLLVS----GHWWAIFGRFVLLLVISLTLSFLTAR  284
               F+    +     +    AL      V         A    +     I L    +   
Sbjct  541  TAGFWLAYPLALLSMLETGSALTPVSAAVWSSLARRPQAWLWFYFRSAGIVLVA--VAMY  598

Query  285  IPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
            + +           LL P +  +  + +  
Sbjct  599  LAFWRHLVGPLALPLLAPVATTFAMVYFRW  628


>HAD27950.1 hypothetical protein [Rhodobacteraceae bacterium]
Length=173

 Score = 46.3 bits (106),  Expect = 0.006, Method: Composition-based stats.
 Identities = 9/38 (24%), Positives = 13/38 (34%), Gaps = 0/38 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
            + CP C A+     S +P      +C  C      D 
Sbjct  2   RIVCPRCVAKYEIDESIIPEIGREVQCENCENIWFQDF  39


>HER25631.1 hypothetical protein [Rhodospirillales bacterium]
Length=252

 Score = 47.5 bits (109),  Expect = 0.007, Method: Composition-based stats.
 Identities = 9/74 (12%), Positives = 15/74 (20%), Gaps = 1/74 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + CP+C    +   + + A   +  C  C                T      P    
Sbjct  10  MI-ITCPNCATHYSLDPASVGAVGKNVHCTNCGTGWYHTIVAQDPVAQTPAAPVAPPPAP  68

Query  61  QRRIPSDRLEIQSK  74
                         
Sbjct  69  VPAPVPAFDPAPMM  82


>MBC6497248.1 zinc-ribbon domain-containing protein [Alphaproteobacteria bacterium 
GM7ARS4]
Length=151

 Score = 45.9 bits (105),  Expect = 0.007, Method: Composition-based stats.
 Identities = 25/146 (17%), Positives = 43/146 (29%), Gaps = 1/146 (1%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  V CP CG + +  S  +P K   ARC  C   +   PA +            P    
Sbjct  1    MI-VCCPSCGTQYSVLSYLIPKKGRLARCSHCGYVMTIYPAVTASPDPDAQQGAKPLPES  59

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
               +P        K       NR       +  +A     +     +A +  + C     
Sbjct  60   ASLVPLGMKRDGDKEDTDHARNRPAQGPLSQYRQAQKRQRKPAIIPVAMALLIACAFFIA  119

Query  121  LLGIYLLGIVLAFAPIFSALLLKPAT  146
             +   L+      A + + ++     
Sbjct  120  SVYYALIYPCYEQATLTNHVISCSFL  145


>MBI5286697.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=146

 Score = 45.9 bits (105),  Expect = 0.007, Method: Composition-based stats.
 Identities = 12/82 (15%), Positives = 22/82 (27%), Gaps = 1/82 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V+C  C A+     SK+  K    RC +C    +  P      +            +
Sbjct  1   MI-VQCDKCKAKFRLNDSKVTGKGVKVRCSKCRNLFMVTPPPPSVEEVPPRKEAPFGVSV  59

Query  61  QRRIPSDRLEIQSKTVNCRRCN  82
                +   + +          
Sbjct  60  APPEETTPKKEEPFEFPFGSME  81


>MSR80878.1 hypothetical protein [Gemmataceae bacterium]
Length=340

 Score = 47.8 bits (110),  Expect = 0.007, Method: Composition-based stats.
 Identities = 44/295 (15%), Positives = 81/295 (27%), Gaps = 13/295 (4%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            MP + CP C +        L       +CP C QTL    A S  T              
Sbjct  1    MPQIACPTCKSRMTVRDEDL---GKMVQCPGCKQTLQTQAAVSTPTADVSKAVPQLKQTT  57

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
            + +    +    +K     +           E   S     +  +   D+ ++  ++   
Sbjct  58   KIQAALPKASNLTKPSVQIKKPVVKKEPEPEEEEPSDLEAYNEVEQEEDTGKISRKKRIW  117

Query  121  LLG-IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
            L       G+ L    +   L+      L   +  +     L  VA  +  L+ +   + 
Sbjct  118  LGWDAGYKGLSLTSVGMVVYLVGAVLGGLGWTSYFFTKQDTLFYVALGISILTGLVFLVL  177

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
                ++ + +    K+    +     L+   +  +  G    I    +  V F       
Sbjct  178  DVWGRSLLTMIPPEKVPNGPMLMILSLVGFCLCYLMAGLSAAIPLAAIGAVLFL------  231

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
                 G   AL    L+  G        F L   IS     +   I        +
Sbjct  232  ---GAGYWTALFGYWLMCGGLNNMSLTSFFLTHAISTIALNIIGAIFLAVLFLIM  283


>WP_078790124.1 zinc-ribbon domain-containing protein [Geobacter thiogenes]SJZ85259.1 
zinc-ribbon domain-containing protein [Geobacter thiogenes]
Length=654

 Score = 48.6 bits (112),  Expect = 0.007, Method: Composition-based stats.
 Identities = 10/67 (15%), Positives = 15/67 (22%), Gaps = 0/67 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + CP C      P  K+P    +A C  C              +       C    +  
Sbjct  2   KITCPGCQWSAEVPDEKIPVGGVTATCRNCQVKFQVLRESVSTAEPEFFCPQCGTGQMVS  61

Query  63  RIPSDRL  69
                  
Sbjct  62  DTCIHCH  68


>WP_028640253.1 hypothetical protein [Novosphingobium acidiphilum]
Length=232

 Score = 47.1 bits (108),  Expect = 0.007, Method: Composition-based stats.
 Identities = 20/91 (22%), Positives = 38/91 (42%), Gaps = 0/91 (0%)

Query  194  KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKS  253
            +  +  + S+ +L  L  L +  G   LI+PG++  + +           +G + +L  S
Sbjct  90   RWSIMSLLSYAVLSTLAGLAMLAGFAALIVPGVVLAIRWSIAGNFAIARGLGPVDSLRAS  149

Query  254  RLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
                 G    IFG   L+ +IS+   F  + 
Sbjct  150  WRATRGSAGGIFGALALVGMISVIGEFALSG  180


>MBC7329443.1 hypothetical protein [bacterium]
Length=164

 Score = 45.9 bits (105),  Expect = 0.007, Method: Composition-based stats.
 Identities = 20/158 (13%), Positives = 53/158 (34%), Gaps = 1/158 (1%)

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
            +       +              +  ++IL        L L  ++     +    ++   
Sbjct  1    MPILFFLFSTTSLYLLDETILRYSPLFSILSFIFYLTSLLLFPLSFLSLRHSLSLNLPFK  60

Query  191  RSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
            +S+++  + +  F LL+ +L + +  GS +L     +    +F        +    + +L
Sbjct  61   QSLQISFKKLPRFLLLVFIL-ICILLGSFILGFIPFILAFSWFILSPYPLLEGYEIVDSL  119

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
              S+ LV G +W        + +  L +      + + 
Sbjct  120  LLSKYLVRGRFWRTLLNLSAITLPMLAIGIGIPLLFFF  157


>HEC14613.1 hypothetical protein [Rhodospirillales bacterium]
Length=79

 Score = 44.0 bits (100),  Expect = 0.007, Method: Composition-based stats.
 Identities = 10/41 (24%), Positives = 14/41 (34%), Gaps = 1/41 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
           M  + CP+C    +     L       RC  C  T   +P 
Sbjct  35  MIII-CPNCSTRYSIDPGSLGNMGKPVRCSNCQHTWHQEPN  74


>PIE16743.1 hypothetical protein CSA66_07135 [Proteobacteria bacterium]
Length=174

 Score = 46.3 bits (106),  Expect = 0.007, Method: Composition-based stats.
 Identities = 26/158 (16%), Positives = 57/158 (36%), Gaps = 0/158 (0%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            + +F       L + LL     F  +   L  + +   +     W  AI    +  + + 
Sbjct  14   FSVFAGNALAFLLLTLLFYSPVFVLVAVGLHGERSFLDDGTIAIWLLAIFGLGILMVNIA  73

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
               +  S+   +      L +S+  G+  V +   + ++   ++  G    I+PGL+   
Sbjct  74   APAVMASVVDQLRGRRTSLRKSLMRGVARVPAVIGVAVVTGFLLALGVYTYILPGLMVVT  133

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
              +        +  G   ++++S  L +G  W I G  
Sbjct  134  ILWVAVPAAVIEGAGVGASIKRSDRLTNGVRWQILGVV  171


>PYX29213.1 hypothetical protein DMG77_13235 [Acidobacteria bacterium]
Length=364

 Score = 47.8 bits (110),  Expect = 0.007, Method: Composition-based stats.
 Identities = 19/245 (8%), Positives = 68/245 (28%), Gaps = 2/245 (1%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            L + L+ + ++    F           + +    +  +         L  + +  +  + 
Sbjct  91   LVLGLVLMYVSSVMRFILFDSVLVKECHIRQGWSRRQVPGVRFFVWHLLYTLVMIAGIVV  150

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            +    +G    +            L++  IL+     + ++   +++ +   F    +A 
Sbjct  151  LVGIPLGFAFVVGWLKEPRQHLAPLILGGILLFLLLVIFVVTASVVYVLTKDFVVPQMAL  210

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA--FSLL  299
            +NI   +   +   ++        G   + +V+++  +        +     +     L 
Sbjct  211  ENIDAFEGWRRLWRMLKAEKGGYAGYVAMKIVLAIAAAITVGVATLILALIFVIPTALLA  270

Query  300  LTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSA  359
            +          +  ++           I         A+    +I      ++       
Sbjct  271  IVAVLTGKTAGMTWNVYTITVAVVVGCILLAIFLYLVALISVPVIVFFPAYAIYFFAARY  330

Query  360  EQLLS  364
              L +
Sbjct  331  PALSA  335


>WP_181402181.1 zinc-ribbon domain-containing protein, partial [Nostoc sp. 3335mG]PXA92098.1 
thioredoxin, partial [Nostoc sp. 3335mG]
Length=167

 Score = 46.3 bits (106),  Expect = 0.007, Method: Composition-based stats.
 Identities = 9/63 (14%), Positives = 14/63 (22%), Gaps = 1/63 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M    C  C        S +     + RC  C  +    PA                  +
Sbjct  1   MIL-ECTQCHMRYLVADSAIGPAGRTVRCAGCRHSWFQPPAMLDMGNGAVPPPPADPVRV  59

Query  61  QRR  63
            + 
Sbjct  60  PQP  62


>MBF0496839.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=30

 Score = 42.4 bits (96),  Expect = 0.007, Method: Composition-based stats.
 Identities = 8/26 (31%), Positives = 14/26 (54%), Gaps = 0/26 (0%)

Query  5   RCPHCGAERNTPSSKLPAKKSSARCP  30
            CP C ++ + P  K+P   ++  CP
Sbjct  4   ECPSCKSKGSIPDDKIPPGGANVNCP  29


>WP_191401559.1 DUF975 family protein [Candidatus Gallispira edinburgensis]
Length=268

 Score = 47.5 bits (109),  Expect = 0.007, Method: Composition-based stats.
 Identities = 24/176 (14%), Positives = 61/176 (35%), Gaps = 6/176 (3%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
               LL      + +    +          +     + ++      T  +     + +   
Sbjct  67   NPFLLMGINGLVYIMQIVLSLLGAFLSIGFRRYTLKVYRGHESRYTDVFSGFTKTGVKAF  126

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV-----GGGSLLLIIPGLLFCVWF  232
            +  ++    VG      + +    SF +  ++  + V        ++L +IP  +    F
Sbjct  127  LSEFLSGIMVGAVTCAAVLVILSVSFIVFALVDDIFVAVVAGIVLTILALIPIYMLYYSF  186

Query  233  FFCQYVLADDNIGGL-QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
             F  Y++ DD+  G+ +A+ KS  ++ G+ W +F   +  +   +  +F       
Sbjct  187  IFSIYIIHDDDNSGIAEAMFKSYRMMKGNKWGLFKVDLSFMGWFILCTFTLGFAGI  242


>NCX85299.1 hypothetical protein [Rhodobacteraceae bacterium]
Length=209

 Score = 46.7 bits (107),  Expect = 0.007, Method: Composition-based stats.
 Identities = 10/38 (26%), Positives = 13/38 (34%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  V CP C A    P + +P      +C  C      
Sbjct  1   MRLV-CPSCKANYEVPRTAVPIGGREVQCASCGHKWFQ  37


>HHG89683.1 hypothetical protein [Devosia sp.]
Length=264

 Score = 47.5 bits (109),  Expect = 0.007, Method: Composition-based stats.
 Identities = 8/55 (15%), Positives = 13/55 (24%), Gaps = 0/55 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPH  57
            + CP+C A     S  +       +C  C +     P                 
Sbjct  2   LIICPNCQARYEVASQTIGNAGRKVQCANCHKNWKAVPEPENPMPDKMFSDEEER  56


>PIR32800.1 hypothetical protein COV36_03655 [Alphaproteobacteria bacterium 
CG11_big_fil_rev_8_21_14_0_20_44_7]
Length=230

 Score = 47.1 bits (108),  Expect = 0.007, Method: Composition-based stats.
 Identities = 11/42 (26%), Positives = 18/42 (43%), Gaps = 0/42 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           + CP C  +     +KL  K    RC +C      +PA  ++
Sbjct  3   LECPECKTKFLIDPAKLGDKGRKVRCGKCSHIWHQEPAAPEQ  44


>PIY08491.1 hypothetical protein COZ18_12015 [Flexibacter sp. CG_4_10_14_3_um_filter_32_15]
Length=284

 Score = 47.5 bits (109),  Expect = 0.007, Method: Composition-based stats.
 Identities = 38/222 (17%), Positives = 71/222 (32%), Gaps = 43/222 (19%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             +  L     F       L      +     +  + +       ++  L     S F  +
Sbjct  46   CLAQLISYTLFCFFADLTLSFIIPQIVSFLGDLGYLVGTIVNLTLVSALLAGFYSFFEKV  105

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGG--------------------------  216
             K D   F ++  G +H+G   L  IL++++V                            
Sbjct  106  YKKDKFDFHNLFDGFQHIGQLALHQILVVIMVILPFLSLFFLAQEFGINQTVDIFDIETY  165

Query  217  ------GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
                    ++  IP LL    + F   ++    +    A+E SR LV G++  +FG  + 
Sbjct  166  EKFTILFYVIFTIPSLLVFTLYIFVPILIVVTRMNFWSAMEVSRKLVLGNFIGVFGFVIG  225

Query  271  LLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
              +           I  VG       +L   PF+F   +++Y
Sbjct  226  FTL-----------INIVGLLFLGIGTLFTIPFTFAATFVLY  256


>NVL90362.1 zinc-ribbon domain-containing protein [Desulfobacterales bacterium]
Length=147

 Score = 45.5 bits (104),  Expect = 0.007, Method: Composition-based stats.
 Identities = 10/39 (26%), Positives = 16/39 (41%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  V C +C A+ N   + +    S  +C +C    I  
Sbjct  1   MI-VTCENCDAKFNLDENLIKESGSKVKCSKCDHAFIVH  38


>REJ62356.1 hypothetical protein DWQ28_11840, partial [Proteobacteria bacterium]
Length=105

 Score = 44.8 bits (102),  Expect = 0.007, Method: Composition-based stats.
 Identities = 15/74 (20%), Positives = 32/74 (43%), Gaps = 0/74 (0%)

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
            PG+   + + F  Y++ +  +G  +ALE SR  ++  WW  FG  ++ +++ +  S    
Sbjct  4    PGIYLSIAYAFAPYLITEKGMGVWEALETSRKAITKFWWRYFGLMLVGMLMIIIGSIPVL  63

Query  284  RIPYVGEAANLAFS  297
                         +
Sbjct  64   VGLLWALPILAIAT  77


>OYT41231.1 hypothetical protein B6U86_02785 [Candidatus Altiarchaeales archaeon 
ex4484_43]
Length=358

 Score = 47.8 bits (110),  Expect = 0.007, Method: Composition-based stats.
 Identities = 19/108 (18%), Positives = 44/108 (41%), Gaps = 4/108 (4%)

Query  209  LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
            ++  ++    +L +    L  + FFF +  +  +N G + ++++S  LV  + W +    
Sbjct  26   VIFFLLIFSFILWLFLWFLISLPFFFVETSIVIENRGIMDSIKRSADLVIKNIWQV----  81

Query  269  VLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLK  316
            +L +V+ + +      +    E      SL + P S L      +   
Sbjct  82   LLFIVVLIVIWSAYLLLMISLEIPLSLLSLGIWPLSALIILFFMTPWM  129


>KAF7359109.1 hypothetical protein [Mycena sanguinolenta]
Length=557

 Score = 48.2 bits (111),  Expect = 0.007, Method: Composition-based stats.
 Identities = 36/394 (9%), Positives = 93/394 (24%), Gaps = 24/394 (6%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                      ++ T   F    +     F  + +           +          +   
Sbjct  22   WVNLVFFTVQTFQTIHYFRSGARPRDSFFIKLAVITNLFADLAGTVACCATTFLYTATYW  81

Query  222  IIPGLLFCVWFFFCQYVLADD---NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
                 +  +++     V A      +     + +   +   H    F   +LL  I+   
Sbjct  82   GDEEAIKKLYWPLTVLVFAVGVALAVSQFFMIIRYWQMTKHHVVFSFLFMILLGAITGIF  141

Query  279  -SFLTARIPY------------VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHP  325
             S +   +              +G  A+   S+L++P  F     I    +  ++     
Sbjct  142  GSGVLLALSLDDETVLRVIFMSLGLVASAVGSVLVSPLLFWQ---IRKRYETRWKSALGT  198

Query  326  PIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNR  385
             ++         I     +P   L         A  L            ++  +  +   
Sbjct  199  LVETGTFTAVVTIVICCTVPFGRLRETMMWIPFAFVLGPVYSCTLLFALSRRPEPANPLN  258

Query  386  SLPEEPQRLSS--ADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELS  443
             + +           ++ +                   L   +     ++       ++S
Sbjct  259  GIIDTYTAHPPKSKAHRPVGMMVLNMPPTVMEDADAFRLTKKKIELPGRDVDSDSSSDVS  318

Query  444  DFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSI  503
               N  L       +  + +       L   +              +T         +S 
Sbjct  319  RNLNEELEDMEEPGLPRESLSSRPLHPLLRPRAPLRPTPNPHPVSQKTKSKAAEKAQKSK  378

Query  504  YLRQGTQAEQVHSILGKLELTLPLAIESLQLTRN  537
              ++ ++ E V       E T P     L+ + +
Sbjct  379  SSQKDSRNEGVDPHW---EYTPPSKSVRLEESAD  409


>WP_020875731.1 zinc-ribbon domain-containing protein [Desulfococcus multivorans]AOY60393.1 
zinc finger/thioredoxin domain protein [Desulfococcus 
multivorans]AQV02492.1 hypothetical protein B2D07_18105 
[Desulfococcus multivorans]EPR43063.1 MJ0042 family finger-like 
protein [Desulfococcus multivorans DSM 2059]SJZ60505.1 
MJ0042 family finger-like domain-containing protein [Desulfococcus 
multivorans DSM 2059]
Length=228

 Score = 47.1 bits (108),  Expect = 0.007, Method: Composition-based stats.
 Identities = 21/198 (11%), Positives = 46/198 (23%), Gaps = 0/198 (0%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             V CP C A      +++P    S +C  C      +   S R     + A         
Sbjct  2    KVNCPECLAAHFFEDAEVPEGGMSVKCKICGTPFRVEKVRSPRPGHEPDEAEMTCPRCFI  61

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                    +       +   +    +  ++   + +           + +     G    
Sbjct  62   TQKKSDTCMCCGMTLSQSHEQPSMPESRQDRHTAAARTEDPVVWNGPTPQGPDAIGELFR  121

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
            G++ L       P    +L                   L +V       + +   +   +
Sbjct  122  GLFDLSFDHLITPGMIKVLYALLLIFGGAVTLILVNYFLFSVGNYGAAAAAVMIYLLAVV  181

Query  183  CKTDVGLFRSMKLGLRHV  200
                   F  +   L   
Sbjct  182  VVRIQAEFLLVFFMLGKH  199


>NBQ17330.1 hypothetical protein [bacterium]
Length=309

 Score = 47.5 bits (109),  Expect = 0.007, Method: Composition-based stats.
 Identities = 26/218 (12%), Positives = 63/218 (29%), Gaps = 14/218 (6%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
               +  +L       +  I L+ I       F  +L+      +P+ +N          A
Sbjct  85   FFNNMHKLDYTFFSLITFIILVAISYYIYFSFLKILINLPNNQHPRLKNLFMYDCSIFRA  144

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
            +    +  +     I +    +G    +     ++ S  +   ++I ++   +  ++   
Sbjct  145  FAASLVHILLIGATIGLITGTLGALYLLLKSFFNINSLIVSYKIMIPLLILLACSVMSTI  204

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF--------------VLL  271
                  F +    + D N     A   S  L  G  + I   +                 
Sbjct  205  FYIATRFSYAVPFVLDKNNTIKDAFINSWYLTQGLVFKISIVYALTLLLSLVTTLLITTF  264

Query  272  LVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYY  309
             +  L +S  +  +  +    +   S  L   + +  Y
Sbjct  265  CIALLKISTTSPLLITLYALISGVISTPLIGLTLIKTY  302


>HIB69216.1 hypothetical protein [Phycisphaerales bacterium]
Length=235

 Score = 47.1 bits (108),  Expect = 0.007, Method: Composition-based stats.
 Identities = 19/123 (15%), Positives = 42/123 (34%), Gaps = 0/123 (0%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                 +   ++ +  +      K                     L++ +       +L L
Sbjct  61   VLSRLLTPPIAALMVAYLARKEKRLPDGMSLGGALRSCFFPSIGLVLAVATFGFVATLCL  120

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            + PG+ F +       VL  + + G QA+++S  L   H W +   +     ++LT+  +
Sbjct  121  VFPGIAFLLATSVVLPVLVVEGVSGPQAVKRSWELTREHRWTLLAFWSGFFGLALTICSI  180

Query  282  TAR  284
             A 
Sbjct  181  VAL  183


>WP_041977924.1 zinc-ribbon domain-containing protein [Pyrinomonas methylaliphatogenes]CDM66573.1 
response regulator with CheY-like receiver, 
AAA-type ATPase, and DNA-binding domains [Pyrinomonas methylaliphatogenes]
Length=251

 Score = 47.1 bits (108),  Expect = 0.007, Method: Composition-based stats.
 Identities = 17/66 (26%), Positives = 27/66 (41%), Gaps = 1/66 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V C HC A      +KLP++  + RCP+C Q +   P    R    ++++  P    
Sbjct  1   MI-VTCSHCAARLQLDEAKLPSRPFTVRCPKCQQNVNVQPPVLSRVGVQESVSAGPPVRP  59

Query  61  QRRIPS  66
                 
Sbjct  60  IPAPAF  65


>HGS20096.1 tetratricopeptide repeat protein [Deltaproteobacteria bacterium]
Length=1192

 Score = 48.6 bits (112),  Expect = 0.007, Method: Composition-based stats.
 Identities = 12/35 (34%), Positives = 14/35 (40%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            V CP C  E      ++P K    RC  C  TL 
Sbjct  2   KVNCPGCQKEYTIDDHRIPPKGIRMRCTACQTTLH  36


>WP_144995878.1 hypothetical protein [Polystyrenella longa]QDU80618.1 hypothetical 
protein Pla110_23490 [Polystyrenella longa]
Length=370

 Score = 47.8 bits (110),  Expect = 0.007, Method: Composition-based stats.
 Identities = 25/278 (9%), Positives = 60/278 (22%), Gaps = 23/278 (8%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD--PAESQRTQTTDNIATCPHCG  59
             +  CP CG +                C             + + ++         P   
Sbjct  3    ISFACPECGRQFTVRDELAGKAGHCNSCQHRFHIPDPTPVMSGASKSGHYRLGKASPSSK  62

Query  60   LQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGW  119
               +     +          + +     + ER +           +     W    +   
Sbjct  63   ANGQPGGYHVPSAVDLAAVSQHSLDRVQRNERRWEEEEESGPYQVKTPRTPWREQHQINK  122

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
                    G +  F       L      LN              +  +      +   ++
Sbjct  123  QRKKSKAPGAMKEFYWDRLRGLRHLLDNLNDFGY----------LVSVPFIAMVIISILW  172

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
                   +G    + + +         + L+ L V      L+   L     F F     
Sbjct  173  QSQNIGLIGASGIIAVNVIRFY-----INLIHLAVLPFRQSLMQGILFLIPPFTFYYLYQ  227

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT  277
                    + ++K    + G  + I    ++   +   
Sbjct  228  N------WKPMKKGAKKLFGPMFQIIAVILVFTFLPFL  259


>RNC80015.1 hypothetical protein ED557_12860 [Balneola sp.]
Length=228

 Score = 47.1 bits (108),  Expect = 0.007, Method: Composition-based stats.
 Identities = 32/219 (15%), Positives = 67/219 (31%), Gaps = 1/219 (0%)

Query  101  RSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAIL  160
               +         +       L I  L  +L    +    L +    ++P        + 
Sbjct  1    MKQNSTYDIIRVFYLFNRHRTLFISYLLPLLVLQFLVVHFLQEILWTISPFFSLNGIIVG  60

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
            +    ++   L        +            +        SFT++      +   G  L
Sbjct  61   IVNFLFVFALLLLTLLLAEVAFYSLPNDRNEIIDKFKNVFKSFTIVYFGYHALTIMGFFL  120

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
             IIPGLL  V+F F   ++  + +   +A+  S       ++ I    ++L +  + + +
Sbjct  121  FIIPGLLVSVYFIFTPVIVVFEKLSIKEAMYLSYTEAKIKFFRILLATLILELPFIIIGY  180

Query  281  LTARI-PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKAN  318
                      E     FS   +    L  Y+ Y ++K  
Sbjct  181  FLDLPDNIFLEMIVHIFSSFSSILLMLLVYIYYLEIKKE  219


>OGQ79628.1 hypothetical protein A2289_10340 [Deltaproteobacteria bacterium 
RIFOXYA12_FULL_58_15]OGR13153.1 hypothetical protein A2341_08630 
[Deltaproteobacteria bacterium RIFOXYB12_FULL_58_9]
Length=544

 Score = 48.2 bits (111),  Expect = 0.007, Method: Composition-based stats.
 Identities = 9/45 (20%), Positives = 19/45 (42%), Gaps = 1/45 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           M  V+C +C  +     SK+  + +  RC +C  + +     +  
Sbjct  1   MI-VQCANCDTKFRLDESKIGDRGAKVRCSKCQTSFVVQRPAAPP  44


>OGL89228.1 hypothetical protein A3I45_01350 [Candidatus Uhrbacteria bacterium 
RIFCSPLOWO2_02_FULL_53_10]
Length=363

 Score = 47.8 bits (110),  Expect = 0.007, Method: Composition-based stats.
 Identities = 33/229 (14%), Positives = 65/229 (28%), Gaps = 3/229 (1%)

Query  71   IQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIV  130
             +    +     R              +              L     +G+    +  I+
Sbjct  14   WEHYRKHFANNIRITAWLGVFIILHIIAASFYPIGAQELDRSLTGSEWFGIALFLINTII  73

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
            +        +                    LAT A+ L         +          + 
Sbjct  74   VLPIVSIWMVNALIRRIDVDVRGKSMTMHKLATEAWQLFFPQLWVRILTALAFGIAFAIP  133

Query  191  RSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
              +        S  L   + +L++  G LL ++P ++  ++  F  +    +   GL AL
Sbjct  134  LMLLSVSSQFLSQVLPFGVSMLLMFIGLLLFLVP-IVLMIYLAFVYFAFVLNKARGLNAL  192

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA--RIPYVGEAANLAFS  297
            + S   V GH+W I  R VL  ++   +        +  +G     A S
Sbjct  193  KASVACVRGHFWPITWRLVLPKLLYFGILLAVQYVLLMLLGVFVGAAAS  241


>KAF0092883.1 hypothetical protein FD128_2729, partial [Hyphomonadaceae bacterium]
Length=237

 Score = 47.1 bits (108),  Expect = 0.007, Method: Composition-based stats.
 Identities = 9/43 (21%), Positives = 16/43 (37%), Gaps = 0/43 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
            + CP C A+     + +P    + RC  C       P + + 
Sbjct  2   LLNCPACSAKYQIAKTAIPEAGRNVRCAACGHFWYQLPFDDEN  44


>OIP39291.1 hypothetical protein AUK47_10290 [Deltaproteobacteria bacterium 
CG2_30_63_29]
Length=476

 Score = 48.2 bits (111),  Expect = 0.007, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 12/36 (33%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V C  C         KLP +    RCP C    
Sbjct  1   MI-VTCERCEQRYKIREEKLPPQGGKIRCPTCRHVF  35


>WP_199388304.1 zinc-ribbon domain-containing protein [Geomonas sp. Red421]MBJ6749788.1 
zinc-ribbon domain-containing protein [Geomonas sp. 
Red421]
Length=686

 Score = 48.2 bits (111),  Expect = 0.007, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 12/36 (33%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M   +C  C  +     SKL       RC +C    
Sbjct  1   MIL-QCDQCNTKFRLDDSKLKPGGVKVRCSKCRHVF  35


>WP_027749292.1 hypothetical protein [Streptomyces sp. CNH287]
Length=418

 Score = 47.8 bits (110),  Expect = 0.007, Method: Composition-based stats.
 Identities = 25/197 (13%), Positives = 50/197 (25%), Gaps = 29/197 (15%)

Query  204  TLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA  263
             +L    +++     L  +     F V           +    + ++ +S  LV G WW 
Sbjct  222  HMLGGWSLVLSLPVLLGCLGLAAFFYVRLGLAPSAAVLEEQRPVASMRRSARLVRGSWWR  281

Query  264  IFGRFVLLLVISLTLSFLTARIPYVGEAANL-----------------------------  294
            + G  +L+ V+      +   I  +    ++                             
Sbjct  282  VLGITLLVSVLCAIADQILQYILMLSSTVSIELLLTSTDGSDWSLGSVVGLVAVVGAVLA  341

Query  295  AFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSR  354
              ++L  PF +L   L+Y DL+  + G          LP                     
Sbjct  342  FGAVLTMPFPYLVASLLYVDLRIRHEGFDLALGAAAGLPPHRPAPPEAYSGPGAYSGPGA  401

Query  355  QNLSAEQLLSAGKDIQQ  371
             +               
Sbjct  402  YSGPGAASAPGSYARPY  418


>HIC85926.1 hypothetical protein [Desulfobacterales bacterium]
Length=200

 Score = 46.7 bits (107),  Expect = 0.007, Method: Composition-based stats.
 Identities = 11/45 (24%), Positives = 19/45 (42%), Gaps = 1/45 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           M  ++C  C  + N   S L  + S  RC  C    +  P +++ 
Sbjct  1   MI-IQCEECKTKFNLDESLLKEEGSKVRCTVCQHIFVAYPPKTEP  44


>NJL60346.1 hypothetical protein [Desulfobacteraceae bacterium]
Length=68

 Score = 43.6 bits (99),  Expect = 0.007, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 12/36 (33%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V+C  C    N     +    S+ RC  C    
Sbjct  1   MI-VKCEKCQTAYNLKDDMIRPGGSNLRCSNCKHVF  35


>NQV33576.1 zinc-ribbon domain-containing protein [Phycisphaeraceae bacterium]
Length=513

 Score = 48.2 bits (111),  Expect = 0.007, Method: Composition-based stats.
 Identities = 9/31 (29%), Positives = 16/31 (52%), Gaps = 0/31 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           ++CP CG        ++PA  +S +C +C  
Sbjct  3   IKCPGCGQTYRVDPQRIPAAGASVKCMKCDH  33


>HIJ64731.1 hypothetical protein [Candidatus Hydrogenedentes bacterium]HIJ73967.1 
hypothetical protein [Candidatus Hydrogenedentes bacterium]
Length=407

 Score = 47.8 bits (110),  Expect = 0.008, Method: Composition-based stats.
 Identities = 34/344 (10%), Positives = 76/344 (22%), Gaps = 48/344 (14%)

Query  1    MPTV-RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCG  59
            M  V RC  CG +       L        C         +       ++    AT     
Sbjct  1    MLRVFRC-ECGQKMKVTPEMLGKAGRCVHCGRVVTPTAGNAPPMPPRRSDAAPATEERPQ  59

Query  60   LQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGW  119
                  S     ++  +                   +     +  Q              
Sbjct  60   PSPVPDSGSDLSEAPDLEELLVPEEDFEDEYEGAEEASRHPPAPRQPPPPPRTRRPEPPQ  119

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAIL-----------------LA  162
               G+      +  A  +  +++  A              +                  A
Sbjct  120  ESFGLDTFIRAVDIALDWRKIVVSMALGGICVVGILICIFIGAQDNDLIPVAIVLGVVWA  179

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSF-TLLLILLILVVGGGSLLL  221
                 +               +       + +   R         L L++  +  G+L+ 
Sbjct  180  VAWSGIFAGGVGRLVEIELTERRRAPAREAWRFIGRRFVGLAFGTLGLVLAAILLGALVN  239

Query  222  IIPGLLFCVWF---------------------------FFCQYVLADDNIGGLQALEKSR  254
             I  L+  + +                           +    ++A ++ G + A+++  
Sbjct  240  GIVFLIGMIPWLGPILAGILTLPLFVFNLLLFCVLWNVWIVPCIMAVEDCGAVSAVDRLV  299

Query  255  LLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV-GEAANLAFS  297
             LV      +    V+   I   ++ L+  I Y          S
Sbjct  300  QLVVRRAGRLIAYEVIANSIIALVTTLSMVIGYACLGVTAGITS  343


>MBI2796380.1 zinc-ribbon domain-containing protein [Gemmatimonadetes bacterium]
Length=490

 Score = 48.2 bits (111),  Expect = 0.008, Method: Composition-based stats.
 Identities = 10/31 (32%), Positives = 13/31 (42%), Gaps = 0/31 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           V CP C +      S++P     ARC  C  
Sbjct  3   VTCPECRSIFRVDPSRVPHGGVRARCSVCGG  33


>WP_097115055.1 zinc-ribbon domain-containing protein [Alysiella filiformis]QMT30722.1 
zinc-ribbon domain-containing protein [Alysiella filiformis]SOD70946.1 
MJ0042 family finger-like domain-containing 
protein [Alysiella filiformis DSM 16848]
Length=237

 Score = 47.1 bits (108),  Expect = 0.008, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 10/36 (28%), Gaps = 0/36 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V CP C    +     L      A C  C    
Sbjct  4   MIKVTCPSCKTALSLSEQHLRETDGRADCHHCGHVF  39


>RZO67559.1 hypothetical protein EVA70_04065 [Parvularculaceae bacterium]
Length=295

 Score = 47.5 bits (109),  Expect = 0.008, Method: Composition-based stats.
 Identities = 22/141 (16%), Positives = 43/141 (30%), Gaps = 4/141 (3%)

Query  158  AILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGG  217
               +   + I + +   +  +   +              L          + +  ++   
Sbjct  113  FFAILITSLISILIWIFSALIAQGLTGISFQELVFGFEKLMDNPEDYAGSVGIFKLMFLT  172

Query  218  SLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT  277
              +  +P +   V         A +N      L KS  +  GH W+IFG  +L  +    
Sbjct  173  LAIGALPSIYVGVKLAAFPAACAVENRLV---LLKSYAMTKGHAWSIFGAVILFSLAFFA  229

Query  278  LSFLTAR-IPYVGEAANLAFS  297
            LS +       +G  AN   S
Sbjct  230  LSIVLELSTSIIGLIANYFVS  250


>HCK07909.1 hypothetical protein [Rhodobacter sp.]
Length=78

 Score = 43.6 bits (99),  Expect = 0.008, Method: Composition-based stats.
 Identities = 8/43 (19%), Positives = 14/43 (33%), Gaps = 0/43 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
            + CP+C A        +       +C  C  T +    E  +
Sbjct  2   QIVCPNCEAHYEVGYDSIGDAGRQVQCSNCGHTWLAMRQEEDQ  44


>WP_139209220.1 hypothetical protein [Aquisalimonas asiatica]
Length=205

 Score = 46.7 bits (107),  Expect = 0.008, Method: Composition-based stats.
 Identities = 20/134 (15%), Positives = 43/134 (32%), Gaps = 6/134 (4%)

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
                   V L   +   L  +  F L+ ++ +  V    ++ I+ GL   +  +    V+
Sbjct  36   HAGLFGRVILLFVLFQFLSVLAVFPLVFLMEVGGVWLAGVVAIVSGLYLSIRCWMVPVVM  95

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLL  299
              ++   + A+ +S  L SG      G   LL +  +  +     +P +     L     
Sbjct  96   GVEDRYSMDAIGRSWKLTSG------GPAFLLALGVVLFALAATAVPGMLWKLGLDAVFY  149

Query  300  LTPFSFLYYYLIYS  313
                        + 
Sbjct  150  PGYLMDNGTAFFWW  163


>NLW83980.1 hypothetical protein [Phycisphaerae bacterium]
Length=71

 Score = 43.6 bits (99),  Expect = 0.008, Method: Composition-based stats.
 Identities = 9/34 (26%), Positives = 10/34 (29%), Gaps = 3/34 (9%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           M T +C  CG E               RC  C  
Sbjct  1   MITFKCEKCGHEYRVNDDL---SGRKVRCKSCST  31


>KAF9624612.1 hypothetical protein IFM89_012034, partial [Coptis chinensis]
Length=453

 Score = 47.8 bits (110),  Expect = 0.008, Method: Composition-based stats.
 Identities = 22/117 (19%), Positives = 45/117 (38%), Gaps = 4/117 (3%)

Query  206  LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
            L    +++ G GS+LLI+  + + V +         D+  G++A E S     G+    F
Sbjct  325  LGTFFMILYGAGSILLILKWMDWSVVWNMSILFSVLDDKHGVEAFEASGYFSRGNKKRGF  384

Query  266  GRFVLLLVISLTLSFLTARIP----YVGEAANLAFSLLLTPFSFLYYYLIYSDLKAN  318
               ++  V  + L      +     ++G  A    S +     ++   + + D K  
Sbjct  385  QLMLIFFVWRIVLRLSCVYVGGHEKWIGLVATSFLSCVGNVMKWVVCVVYFYDCKKR  441


>TVQ85144.1 hypothetical protein EA357_01125 [Micavibrio sp.]
Length=249

 Score = 47.1 bits (108),  Expect = 0.008, Method: Composition-based stats.
 Identities = 10/38 (26%), Positives = 13/38 (34%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  V+CP C  +    +  L  K    RC  C      
Sbjct  1   MI-VQCPECNTKYRVSALLLGIKGREVRCGRCRNHWHQ  37


>TMQ67476.1 hypothetical protein E6K78_04575, partial [Candidatus Eisenbacteria 
bacterium]
Length=105

 Score = 44.4 bits (101),  Expect = 0.008, Method: Composition-based stats.
 Identities = 10/31 (32%), Positives = 14/31 (45%), Gaps = 0/31 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECC  33
           TV+CP+C      P   L  + +  RC  C 
Sbjct  2   TVQCPYCETAYALPERLLGKRGARVRCRVCK  32


>NJC89222.1 hypothetical protein [Desulfuromonas sp.]
Length=425

 Score = 47.8 bits (110),  Expect = 0.008, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 13/36 (36%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  + C  C A+      KL    +  RC +C    
Sbjct  1   MI-ITCEECQAKFRMADEKLKPGGTKVRCSKCKHVF  35


>RME02878.1 hypothetical protein D6805_08625 [Planctomycetes bacterium]
Length=412

 Score = 47.8 bits (110),  Expect = 0.008, Method: Composition-based stats.
 Identities = 13/96 (14%), Positives = 30/96 (31%), Gaps = 1/96 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V CP C  + +   +K+P K+  ++C  C   +     E + T +   +        
Sbjct  1   MI-VVCPQCQQQFDIDEAKIPPKRVKSKCLNCGNWIYLREEEGELTPSPLQLGLQALQEK  59

Query  61  QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRAS  96
           + +      +         +   +   Q   +    
Sbjct  60  KWQQAQKYFQQTLHQQPQLKKQIAQEYQQIAQEYQQ  95


>HAU30617.1 hypothetical protein [Candidatus Dependentiae bacterium]
Length=210

 Score = 46.7 bits (107),  Expect = 0.008, Method: Composition-based stats.
 Identities = 25/184 (14%), Positives = 57/184 (31%), Gaps = 0/184 (0%)

Query  128  GIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDV  187
                       ++     +W+ P        +L A            +        +   
Sbjct  18   AWSPFAGHWSPSVAGSLMSWMTPSGIILAGVLLAAFSYLFAAVWVAGSAIALEVYDQGTS  77

Query  188  GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGL  247
             + R             +LL++  +++  G   LI+PG+   +   F  +++ D N    
Sbjct  78   TVRRFWAYFFTSALRAWMLLVVQSVLISVGMFCLIVPGIYLALALQFSPFIMVDTNCSWR  137

Query  248  QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLY  307
            QA+  S  +  G    +    ++LL+++  L        +V     L  + +     F  
Sbjct  138  QAIVTSWRMTRGCMGRLLVTNLVLLLLACALWSSIWLFMFVYPLVLLVRTYIYRALCFKQ  197

Query  308  YYLI  311
              + 
Sbjct  198  MTIA  201


>NQV35227.1 hypothetical protein [Phycisphaeraceae bacterium]
Length=198

 Score = 46.3 bits (106),  Expect = 0.008, Method: Composition-based stats.
 Identities = 24/154 (16%), Positives = 59/154 (38%), Gaps = 7/154 (5%)

Query  479  EHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRND  538
            E  A H     + +   +    ++ +L +   + +   + G + L +   IE + L + D
Sbjct  31   EVYAHHRESRVRLNGASMVDNQQATFLCRDIGSRRTDQMSGSVLLNVFGQIERVTLRQQD  90

Query  539  IGKT--LQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGD  596
            +     L++ G   +   L  N ++   LG   D++ + A +     L +      ++G 
Sbjct  91   LDNMCCLKMPGNTYLDLTLDKNRLSYMVLG--GDVVQLAAYDDQGRRLMQDPAWRFQAG-  147

Query  597  AFSLRQMFDGNIESITVLVAGDSMTQSYPFELTR  630
                   F G  + + + ++  +      F++T 
Sbjct  148  --MKSVYFWGVPDKVVLDISTQTQVSRLNFKVTE  179


>WP_172677062.1 zinc-ribbon domain-containing protein, partial [Desulfovibrio 
brasiliensis]
Length=66

 Score = 43.2 bits (98),  Expect = 0.008, Method: Composition-based stats.
 Identities = 16/63 (25%), Positives = 22/63 (35%), Gaps = 1/63 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + CP CG ER     K+PA+   A CP+C     F     +     D          
Sbjct  1   MI-ITCPECGFERQVNPDKIPARSQMATCPKCKTKFKFRDLPEEFDFVPDPHPGAVAPKP  59

Query  61  QRR  63
           +  
Sbjct  60  EDA  62


>NLT60994.1 hypothetical protein [Candidatus Hydrogenedentes bacterium]
Length=476

 Score = 47.8 bits (110),  Expect = 0.008, Method: Composition-based stats.
 Identities = 9/29 (31%), Positives = 11/29 (38%), Gaps = 0/29 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARC  29
           M    CP CG + N P S +        C
Sbjct  1   MIAFSCPKCGHQFNVPDSAVGQHGWCRGC  29


>PKN92012.1 hypothetical protein CVU44_17110 [Chloroflexi bacterium HGW-Chloroflexi-6]
Length=261

 Score = 47.1 bits (108),  Expect = 0.008, Method: Composition-based stats.
 Identities = 27/192 (14%), Positives = 64/192 (33%), Gaps = 31/192 (16%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
                  +    +T ++          +F+S       +  F  +L+L+ L++    + L+
Sbjct  3    FFLVSGVATLTLTRAILSAYVGEKTSVFQSFSNAWPVLMRFMSVLVLIGLLMVVFYIWLL  62

Query  223  IPGLLF-----------CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            IP + +                F   ++  +       + +S  LV   +W + G   +L
Sbjct  63   IPCVGWFSGLGILLTLSAAVLPFTVPIVILERHSATGTISRSWDLVRRRFWWVLGLMGVL  122

Query  272  LVISLTLSFLTARI--------------------PYVGEAANLAFSLLLTPFSFLYYYLI  311
             ++SL L+  +  +                      +     +  S+++ P ++    L 
Sbjct  123  TLLSLVLTGPSILLSSLSQSATQLGSLGNQTLTNTIIQSVVGVLASIIILPLNYAVVVLA  182

Query  312  YSDLKANYRGPQ  323
            Y DL+    G  
Sbjct  183  YFDLRIRTEGID  194


>WP_198373755.1 zinc-ribbon domain-containing protein, partial [Roseomonas sp. 
OP-27]
Length=70

 Score = 43.2 bits (98),  Expect = 0.009, Method: Composition-based stats.
 Identities = 12/34 (35%), Positives = 13/34 (38%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP CGAE   P   L     S RC  C    
Sbjct  2   RIACPSCGAEYEVPDRLLAGAARSLRCSRCGADF  35


>RME62854.1 hypothetical protein D6778_10480 [Nitrospirae bacterium]
Length=302

 Score = 47.5 bits (109),  Expect = 0.009, Method: Composition-based stats.
 Identities = 22/189 (12%), Positives = 65/189 (34%), Gaps = 9/189 (5%)

Query  146  TWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTL  205
              L+   +              +L L  +   +   +    +G   +    +   G   +
Sbjct  114  FSLSGFFKEANKQFWPVMGYLAVLFLVGVGFVLAWVVFFVVLGFAGTAFKSVPQGGMAVV  173

Query  206  LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
            + +L + +    +++++    +F     +   V+   +     AL+++ + +  +  A +
Sbjct  174  VSVLGVFLFIAVAVVMLGSVFIFASLGAYGATVVVFRDASVWGALKRAWVFLWANQRAFW  233

Query  266  GRFVLLLVISLTL------SFLTARIPYVGEAANL---AFSLLLTPFSFLYYYLIYSDLK  316
            G  + L ++ +          +   IP +G+   L    FS+ +  F+  +   +     
Sbjct  234  GFVLSLCILWVVAIGLALGGLIVQVIPGLGKILALPYDVFSMSVEVFATYWSTAVAMSFY  293

Query  317  ANYRGPQHP  325
              +R    P
Sbjct  294  EAHRRVYSP  302


>MBA3501508.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=310

 Score = 47.5 bits (109),  Expect = 0.009, Method: Composition-based stats.
 Identities = 9/42 (21%), Positives = 12/42 (29%), Gaps = 0/42 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           VRC  C  E     S+L     + +C  C             
Sbjct  3   VRCEKCQTEYELDESRLKPGGVTVKCTNCGHMFKIRKRTPTN  44


>NLJ30869.1 DUF2628 domain-containing protein [Clostridiales bacterium]
Length=352

 Score = 47.5 bits (109),  Expect = 0.009, Method: Composition-based stats.
 Identities = 30/316 (9%), Positives = 62/316 (20%), Gaps = 17/316 (5%)

Query  1    MPTVR---CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPH  57
            M       CP CG         +        CPEC      +           +    P 
Sbjct  1    MIDYTGLKCPVCGKTFTADDDIV-------VCPECGAPYHRECYTKAGKCVFADKHGTPD  53

Query  58   CGLQRRIPSDRLEIQSKTVNCRR-CNRSFCLQPEREFRASGSGLRSISQLLADSWELFCR  116
                     D  E   +   C    + +           + +            +     
Sbjct  54   AWKPPSPQPDHQENGKRCPRCGSLNSAAALFCEHCGQPLTVNQQDFHGFPQNSGYPYGNY  113

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                   I            F    +             +    +       L  S  T 
Sbjct  114  PPQNGRPIPPNQPGGFPQGGFPGQPIPFLFDPMGGVNPEERIEDVPAGEIAKLVQSNTTY  173

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGG--GSLLLIIPGLLFCVWFFF  234
             +  ++  +     R           + L   +  L        ++L +  +   V +  
Sbjct  174  YLPAFMNLSRFHRNRFNFGAFLFSCGWLLYRKMYKLGGILTAAMVMLYLASIYVSVHYSD  233

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
                    + G       S  L       +    +   V  + L  + + I  V     +
Sbjct  234  PILQSLMLSAGVS---SDSASLTYDQTMQVINLLMQKPVWQILLFSVPSIISLVNFVIMI  290

Query  295  AFSLLLT-PFSFLYYY  309
               +     +      
Sbjct  291  VVGINANRWYLRHCMT  306


>WP_025291210.1 hypothetical protein [Sphingomonas sanxanigenens]AHE52921.1 hypothetical 
protein NX02_05930 [Sphingomonas sanxanigenens DSM 
19645 = NX02]
Length=230

 Score = 46.7 bits (107),  Expect = 0.009, Method: Composition-based stats.
 Identities = 28/184 (15%), Positives = 55/184 (30%), Gaps = 2/184 (1%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY-ILLGLSWMTGSMF  179
            + G ++L            +    A           +           L           
Sbjct  15   IGGFFMLLPTALLYAFLPPMPQPAAGDDAFVMMVAYYRQHQPEFIIHTLWITFGELAMTV  74

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
            +        +  ++   LR      L  ++++L + GG+L+ I+P L F      C  +L
Sbjct  75   LLAAPDRPTVGEALVRALRLYPWHLLQRLVVVLAMIGGALMFILPALYFSGRLALCTPLL  134

Query  240  AD-DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
            A  +    L  + +S+ L  G  W I    +L+ + +   S     I         A  +
Sbjct  135  AVGEGRNPLALMRRSQQLTRGAGWHIAAFVILVWLGTTLFSSAVGTISAAMFKPFGAGGV  194

Query  299  LLTP  302
                
Sbjct  195  AHVA  198


>RKU28095.1 hypothetical protein C6499_10485 [Candidatus Poribacteria bacterium]
Length=1019

 Score = 48.2 bits (111),  Expect = 0.009, Method: Composition-based stats.
 Identities = 27/424 (6%), Positives = 81/424 (19%), Gaps = 31/424 (7%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
              +   +     +  +       L    +       L       L       + ++ + +
Sbjct  359  IYINHGIQPLLRWVGISGVSMAALALFTRLIVATAPLNVAWAAPLIFGIGIYAHWLLVVR  418

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC-VWFFFCQYVLADDN  243
                  R +   L  +      L          +L+     +    + FF          
Sbjct  419  KSFSWQRFLMPILPIILPIKSFLQAQDYRKLAKALVHYWIRIAKWQIAFFVKVTKFCF--  476

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT--ARIPYVGEAANLAFSLLLT  301
                    +  +       + F    +   + + +  +         G       +  + 
Sbjct  477  --------RWVVHTIRFSRSPFLLIFVAFGVVIFVGIVLPIVLPMLFGSMVTHIVAEWIL  528

Query  302  PFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQ  361
            P     +  + S L    +           +          L                + 
Sbjct  529  PVVLAPFLALASILTGLDKLFHLGFNIPIVVGWAFWGLVIGLAIQGYRAMEIYDQKRMKI  588

Query  362  LLSAGKD--------IQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEG  413
             ++            ++     +P           +     S    +   +         
Sbjct  589  WVAVAPILLLCVIGTMRYVSALKPPNLDKTTPIAAQTSGLESETITRTSPAPGATKQPAT  648

Query  414  GLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYD  473
                G +                       +   +   +       +        +   +
Sbjct  649  ESDGGSMLSEEPAPLPPTTPSKPPAGTPKQETTAIPTLEPEPTTSSVPSEPAPIKQTEIE  708

Query  474  RQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQ  533
             +  F  P      I Q +            L   T ++       +    +P  +E   
Sbjct  709  PEKKFTPPRKPVPAIEQEEPA---------PLPPTTPSKPPAGTPKQETQIIPT-LEPKP  758

Query  534  LTRN  537
            +T  
Sbjct  759  ITPL  762


>QLH40229.1 zinc-ribbon domain-containing protein [Defluviicoccus sp.]
Length=180

 Score = 45.9 bits (105),  Expect = 0.009, Method: Composition-based stats.
 Identities = 8/38 (21%), Positives = 12/38 (32%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  + CPHCG      +     +  +  C  C      
Sbjct  1   MI-IACPHCGTSYAVEADVFGPEARTVECSACGGRWQQ  37


>NDU92133.1 hypothetical protein [Ferrovum sp.]
Length=328

 Score = 47.5 bits (109),  Expect = 0.009, Method: Composition-based stats.
 Identities = 23/182 (13%), Positives = 47/182 (26%), Gaps = 2/182 (1%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            F R    +    L         +    L +   +             +     ++  L  
Sbjct  71   FMRYAMLIAVAGLFFRAQMMTGLTQRALGQRTGYAGVYLSFGASFWRVFGAYVVIFLLLT  130

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
                    +         ++  G    G+   L I ++       + L +      +   
Sbjct  131  AVQIAATLLLVLPAVALVAVGHGTAGTGASPDLGIAMVTFFVLWRIALFVVVTYLFIRLS  190

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAAN  293
            F    +          L +   LV G+   IF   + L V  + +  +   +   GE   
Sbjct  191  FVVTAVIVAEQRFD--LIRPWRLVEGNVLRIFALGLALFVPFVLVFIIVYVLVLGGEIWT  248

Query  294  LA  295
            LA
Sbjct  249  LA  250


>HBV58079.1 hypothetical protein [Candidatus Magasanikbacteria bacterium]
Length=265

 Score = 47.1 bits (108),  Expect = 0.009, Method: Composition-based stats.
 Identities = 23/201 (11%), Positives = 67/201 (33%), Gaps = 7/201 (3%)

Query  104  SQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLAT  163
                A S+  F       +    + ++         ++   +T +   ++  +   +   
Sbjct  56   QWSWAFSFGNFDYSMGFWVLALGILLLGLAVLFVFTVINSFSTLIFATDKFRKDKKVDLA  115

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
              ++ +   +      ++  K  + +F S+ +    +       + +I +     L  I+
Sbjct  116  KVWLEMKKKFALVLGLVFFFKILIMIFGSLSVAPLWLVMNGTAGVGMIALYPLIFLFSIL  175

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHW-------WAIFGRFVLLLVISL  276
              L+      +    +  ++    QA+++S  L  GHW         IF   +L  +I++
Sbjct  176  AILVCSFLLIYAAAFVILEDYTFGQAIKESWRLFIGHWLVNLEMAIIIFFINLLAGLITI  235

Query  277  TLSFLTARIPYVGEAANLAFS  297
              + +      +    +L   
Sbjct  236  VAAAIIGIPALIVFLFSLFIQ  256


>RMG14255.1 gliding motility protein, partial [Deltaproteobacteria bacterium]
Length=90

 Score = 44.0 bits (100),  Expect = 0.009, Method: Composition-based stats.
 Identities = 11/41 (27%), Positives = 15/41 (37%), Gaps = 0/41 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAES  43
              C +C A+      KL       RC +C   +I  P E 
Sbjct  2   KFACDNCSAQYMIADEKLGPNGVKVRCKKCAHEIIVRPPER  42


>SCI66204.1 Protein of uncharacterised function (DUF975) [uncultured Clostridium 
sp.]
Length=240

 Score = 46.7 bits (107),  Expect = 0.009, Method: Composition-based stats.
 Identities = 24/153 (16%), Positives = 43/153 (28%), Gaps = 6/153 (4%)

Query  139  ALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLR  198
              +           +N    +L A           +   +   I    VG+  ++  G  
Sbjct  64   CQVGMIFDGFEHFGRNLGSMLLQALFILCAYLSGMLALMLVGIIIGIIVGVSTAVMGGAT  123

Query  199  HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVS  258
                  LL + L + V    L++    +L    F   +       I    AL+ S  +  
Sbjct  124  LANLLLLLFVPLAIAVIVLMLVVY--YILSMTRFILAESHQ----IKAFDALKLSARITK  177

Query  259  GHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
            GH   IF   +  L   L        +  +   
Sbjct  178  GHRADIFVFDLSFLGWMLLGVLTLGILNILYVI  210


>HHO52398.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=244

 Score = 46.7 bits (107),  Expect = 0.009, Method: Composition-based stats.
 Identities = 10/45 (22%), Positives = 18/45 (40%), Gaps = 0/45 (0%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRT  46
                CP C  +    +SK+P   + A CP C    +   + ++  
Sbjct  65   VETSCPSCRVQYRVEASKVPEGGARAVCPRCGFDFMIRRSPARIW  109


>NBX74576.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=290

 Score = 47.1 bits (108),  Expect = 0.009, Method: Composition-based stats.
 Identities = 16/179 (9%), Positives = 38/179 (21%), Gaps = 2/179 (1%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF-DPAESQRTQTTDNIATCPHCG  59
            M  + CP C A          A     RC +C                            
Sbjct  34   MELI-CPSCQARYMVKDDLFQAGPKVVRCQKCNHRWRQGADGVVDDGLPKAGSPAASSDA  92

Query  60   LQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGW  119
                     +   +   +      +          A  S           +W        
Sbjct  93   AAALATPSSIAPSAGAPHGASAPSAHATNQPHGGNAPASTPPREVHPDHANWFERHDFHM  152

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
              L      + + +  +    ++   T      Q  + + ++  V  + + +  +   +
Sbjct  153  PRLAAVRDFLHMHYPHLDWHKIVYNITHPVGLKQALRASWVMVGVLVVSILVVALMQRV  211


>QFR39297.1 hypothetical protein A9Q91_03610 [Candidatus Gracilibacteria 
bacterium 28_42_T64]
Length=324

 Score = 47.5 bits (109),  Expect = 0.009, Method: Composition-based stats.
 Identities = 23/196 (12%), Positives = 57/196 (29%), Gaps = 12/196 (6%)

Query  108  ADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI  167
             +   +       ++ I  L I + F       + +     + Q    +  +      + 
Sbjct  69   NNPIVVIMITFGMIIFIAYLLINITFILGLIKSIQQ--ASKSEQVTPKENILYGFKNLFN  126

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
                 W   +    I      +   +     +  + +LLL +   ++    +L     ++
Sbjct  127  SFKTYWYIFAYVALIPALIFIVGGILFNLGFYYDNSSLLLKIGGALMVIAGILFFFFAIV  186

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
              +   F  Y   +      +    +  L   +WW + G F L+ +I          I  
Sbjct  187  RGIKATFPLYNAVEKGDFSKENFSGALKLTKNNWWRVLGNFALVGLI----------IGL  236

Query  288  VGEAANLAFSLLLTPF  303
            V    +    ++  P 
Sbjct  237  VTGMISGVIGIITPPM  252


>MBA4076763.1 hypothetical protein [Cyanobacteria bacterium PR.023]
Length=256

 Score = 47.1 bits (108),  Expect = 0.009, Method: Composition-based stats.
 Identities = 21/122 (17%), Positives = 45/122 (37%), Gaps = 12/122 (10%)

Query  201  GSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGH  260
                    ++ +    G     IPG+ F +     Q ++  +  G  +A+ KSR L++G+
Sbjct  99   FKMIGAAFVVNIGTMIGMCFFFIPGVWFAMIHALTQEIVVLEGCGVFEAMGKSRQLMAGN  158

Query  261  WWAIFGRFVLLLVISLTLSFLTARIP-----YVGEAAN-------LAFSLLLTPFSFLYY  308
             W I     LL +    +  +   I      + G           +  ++++  F+  + 
Sbjct  159  IWRITTYCFLLPMAVGLVGLIIMGIGCAIFYFAGGIVMRGDVGREILINMIIAVFTVFFM  218

Query  309  YL  310
             L
Sbjct  219  AL  220


>MBE6990307.1 DUF975 family protein [Ruminococcaceae bacterium]
Length=235

 Score = 46.7 bits (107),  Expect = 0.009, Method: Composition-based stats.
 Identities = 28/201 (14%), Positives = 70/201 (35%), Gaps = 13/201 (6%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L   +++ ++L    +        A     +      A+ L    ++   + W    + +
Sbjct  39   LFSFFVVQLLLMAVSLGVRSGRFGAGAQVMKYVYIGLALYLLVSLFVGSAVEWGLDRVML  98

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
               + +    R++      V    LL + L + V   SLLL +PG++  + +    ++LA
Sbjct  99   LRLRGECADQRTLFFYRPAVFDAILLRLFLSVRVTLWSLLLFVPGIVAAMNYAMAPFLLA  158

Query  241  DD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLL  299
             +  +G  +A+  S+ L+ G+   + G  +      +                 +   + 
Sbjct  159  QNPGMGVPEAIRVSKYLMKGYKRKLLGLLLGYAGEIII------------SLLLVVPFIY  206

Query  300  LTPFSFLYYYLIYSDLKANYR  320
            + P       + + D    + 
Sbjct  207  VMPRLKCATAVFFRDRVRLHD  227


>MAS87997.1 hypothetical protein [Micavibrio sp.]
Length=255

 Score = 46.7 bits (107),  Expect = 0.009, Method: Composition-based stats.
 Identities = 9/44 (20%), Positives = 15/44 (34%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  + CP C A     +S +       +C  C      +P   +
Sbjct  1   MI-IVCPECSARYLIQTSSVGPSGKKVKCASCKHVWHQEPETEE  43


>RZC46976.1 hypothetical protein C5167_039954 [Papaver somniferum]
Length=573

 Score = 47.8 bits (110),  Expect = 0.010, Method: Composition-based stats.
 Identities = 26/146 (18%), Positives = 54/146 (37%), Gaps = 3/146 (2%)

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
            L+        + I      VGLF    + +   G     +++  L +    L+     + 
Sbjct  382  LIVTFLWCLLIVIIYTGVAVGLFLWFFVSVNDEGQGKDKVLIFGLCLFIPFLI---GLIY  438

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
                +     +   +N  G +AL KS  LV G  +A    F++L + S  + F  + +  
Sbjct  439  MENVWSVAIVMSVLENDCGRKALGKSMKLVRGKIFASSVAFLVLHIASAGVIFAFSLLVV  498

Query  288  VGEAANLAFSLLLTPFSFLYYYLIYS  313
             G  ++L   + +    +L   ++  
Sbjct  499  YGFMSSLVGKIFVGIACYLVLLVLIH  524


>NBO64891.1 hypothetical protein [Acidobacteria bacterium]
Length=309

 Score = 47.1 bits (108),  Expect = 0.010, Method: Composition-based stats.
 Identities = 33/242 (14%), Positives = 67/242 (28%), Gaps = 29/242 (12%)

Query  94   RASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQ  153
                 G             +  +    L  +  +   +  A  F  LL K          
Sbjct  10   NWKWIGDAWRLFTSNVFTWILMQLTVVLFILITISPAVFLAGGFGFLLSKEDWSSLAGLS  69

Query  154  NWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILV  213
                 ++   +  ++LG  ++   +     K   G   S             L    +L 
Sbjct  70   VVGVIVVPILLIVLVLGGIFLLAGLSRAAIKKAQGEEISYSDLFSGSDVLLPLTGFYLLY  129

Query  214  VGGGSLLLIIPGLLFCV------------------WFFFCQYVLADDNIGGLQALEKSRL  255
            V       I+P  +F V                  W FF   ++ D   G ++A+E+S  
Sbjct  130  VTASIAAGIVPRFIFGVSDVITSLIESTVRLALFGWTFFSVPLIVDRRAGVVEAIEESLR  189

Query  256  LVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
            L    W +            + L+ +   +  +G         +   F +    + Y+++
Sbjct  190  LTLPKWSS-----------YILLALVIQILSSLGFILLFIGIFVTLHFQWTVSAVAYTEI  238

Query  316  KA  317
              
Sbjct  239  YG  240


>MBF0475646.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=333

 Score = 47.5 bits (109),  Expect = 0.010, Method: Composition-based stats.
 Identities = 11/38 (29%), Positives = 14/38 (37%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  V+CP C        +KL    +  RC  C     F
Sbjct  1   MV-VQCPSCETRYKIDDAKLKEGNTKLRCSNCNHMFTF  37


>PIR39501.1 hypothetical protein COV35_03025 [Alphaproteobacteria bacterium 
CG11_big_fil_rev_8_21_14_0_20_39_49]
Length=218

 Score = 46.3 bits (106),  Expect = 0.010, Method: Composition-based stats.
 Identities = 9/38 (24%), Positives = 13/38 (34%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  V CP+C A     +  +       +C  C  T   
Sbjct  1   MI-VGCPNCSARFIVKAQAIGENGRKVKCARCKHTWFQ  37


>NOS67550.1 hypothetical protein [Candidatus Peribacteraceae bacterium]
Length=301

 Score = 47.1 bits (108),  Expect = 0.010, Method: Composition-based stats.
 Identities = 23/250 (9%), Positives = 70/250 (28%), Gaps = 17/250 (7%)

Query  89   PEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWL  148
             +  F          +        ++          +L+ + + F   F    +     +
Sbjct  41   YQIYFLYEYFWGTGGAGFFDIEIVIYESMPHWFFWTFLIVLGVLFIIEFFFPHMALGAII  100

Query  149  NPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLI  208
                ++     +   +   L     +     I++  +         + +R++        
Sbjct  101  GLAAKSHMGEPVKGGMVLALYNFFAVFTIHEIFVLGSLSTSITLTSVIMRYIEGN-----  155

Query  209  LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI----  264
                 + G   L+     +   +F F +  +  D +G  +AL +S  L+  +   I    
Sbjct  156  -AKGFIIGLLWLVWALSNVLRFFFSFAEEAVVIDKVGMFEALGRSFKLIISYLGHIMFLL  214

Query  265  -------FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKA  317
                       + + +I +    +   +  +G   +   + ++     +   L+ S   A
Sbjct  215  LLLFVISLRIILNVAIIVVIPGLVIGIVLLLGTFLSTLLTWIIGGIVGIALTLVASYFFA  274

Query  318  NYRGPQHPPI  327
                 +    
Sbjct  275  YLHVFKQTVW  284


>TVQ02204.1 hypothetical protein EA381_03675 [Planctomycetaceae bacterium]
Length=519

 Score = 47.8 bits (110),  Expect = 0.010, Method: Composition-based stats.
 Identities = 33/327 (10%), Positives = 84/327 (26%), Gaps = 23/327 (7%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              VRCP C +      ++        RC +C   +   P    +++   ++AT P     
Sbjct  173  FRVRCPVCDS---VTYARTDQVGKRIRCTDCESVITVPPRPKPQSRYQPDMATAPVYRFS  229

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                    E  + T    +                       ++    S+ ++  R   +
Sbjct  230  DG-----DEHGNPTAARPQDPFRKSADDLLRAAELAEEESEETEWELPSFGVWFLRLGKV  284

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
                 + I +    + + + +     ++ Q       +    V +  L ++     +   
Sbjct  285  FRDPAVAIHVGLLSLLAYVPVAIWLKVDAQAPIVALGMFAGGVVFAALVVACGFAILQSV  344

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLL----------ILLILVVGGGSLLLIIPGLLFCVW  231
                +      +   L   G   + +           ++  + G G ++     +     
Sbjct  345  ANGEERVSEWPVFDPLEWFGQVAVAMSAGAVAAGPIWMVAKLFGIGGMMAFALAMFTLYL  404

Query  232  FFFCQYVLADDNIGGLQALEKS--RLLVSGHWWA---IFGRFVLLLVISLTLSFLTARIP  286
             +    +   D             + +V               L   +     F +A  P
Sbjct  405  LYPIILMSMLDEQSVFVPFSTDVSKSIVRAPDQWGAAYLASTALFGGLFFAYLFASACTP  464

Query  287  YVGEAANLAFSLLLTPFSFLYYYLIYS  313
              G A  +  ++      F     +  
Sbjct  465  LTGAAIAIVVTIAAVFMYFGIIGQLAY  491


>MSP71201.1 tetratricopeptide repeat protein [Myxococcales bacterium]
Length=1326

 Score = 48.2 bits (111),  Expect = 0.010, Method: Composition-based stats.
 Identities = 14/65 (22%), Positives = 22/65 (34%), Gaps = 0/65 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + CP CGA+ N   SK+P      +CP+C  + +           T         G + 
Sbjct  2   KIICPKCGADYNINDSKIPPDGLHIKCPKCLHSFVATKDGGPAGPGTTPTGPGATGGQRP  61

Query  63  RIPSD  67
                
Sbjct  62  PTAPH  66


>KKJ71346.1 glycerophosphodiester phosphodiesterase, partial [Enterococcus 
faecium MRSN 4777]
Length=213

 Score = 46.3 bits (106),  Expect = 0.010, Method: Composition-based stats.
 Identities = 15/79 (19%), Positives = 30/79 (38%), Gaps = 0/79 (0%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
             ++    LL+ I      +   F    +   +     A+ +S  L      AI G+F+++
Sbjct  6    WIIVSSFLLVYIFLGYIGIRLIFALPEMILRDRSFRAAIRESWSLTKSRLLAITGQFIVI  65

Query  272  LVISLTLSFLTARIPYVGE  290
                L LS L   +  + +
Sbjct  66   GGTILLLSSLGYIVVILAQ  84


>OYV40783.1 hypothetical protein B7Z80_03680 [Rhodospirillales bacterium 
20-64-7]
Length=253

 Score = 46.7 bits (107),  Expect = 0.010, Method: Composition-based stats.
 Identities = 9/42 (21%), Positives = 14/42 (33%), Gaps = 0/42 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
            V CP+C  E   P + L  +     C  C    +      +
Sbjct  41  RVTCPNCSTEYEVPDAALAGRNRKLLCERCGHRWLHADVPPR  82


>MBI1300350.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=318

 Score = 47.1 bits (108),  Expect = 0.010, Method: Composition-based stats.
 Identities = 7/38 (18%), Positives = 13/38 (34%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M    CP+C A+     + +  +    +C  C      
Sbjct  1   MIL-TCPNCDAQFAVGDNLIGDEGRKVKCSSCSNVWFQ  37


>RPJ64716.1 hypothetical protein EHM20_18110 [Alphaproteobacteria bacterium]
Length=362

 Score = 47.5 bits (109),  Expect = 0.010, Method: Composition-based stats.
 Identities = 31/327 (9%), Positives = 86/327 (26%), Gaps = 21/327 (6%)

Query  5    RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRI  64
            +CP+C A+ N   + +      A+C +C Q     P       +   + +        + 
Sbjct  32   QCPNCKAKFNINEANV---GKKAKCSKCAQLFTIAPFVETPVSSEPAVKSPVPPSQTIKD  88

Query  65   PSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGI  124
             + + +                 +P        S  +                      I
Sbjct  89   AAPQQQQIKNPEPVETPKVEKTPEPVAAAEPLKSPEQMPQIKGPAPAAPAPLSAKPAPNI  148

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
                     +   +   L  A ++         A +LA++  +L+       S       
Sbjct  149  GQAAQEEVVSKPSTKPSLSKAVFVYFWTGVRITAGVLASIGLVLILSKHDKSSFTAAFAA  208

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
             D+ +  S+ +           +      +  G  +  +   +F +++         ++ 
Sbjct  209  ADIFIIISLLIEFMLFYKMWAAIKDTQTSISPGKAVGFLFIPVFNIYWALLMITGFVEDY  268

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL-----SFLTARIPYV-----------  288
             G   +++  +        +F  +  + +++  L      F     PY+           
Sbjct  269  NGF--VQRRSIKTQNLPMLLFMIYSFMFILTGLLVTIPMMFAFGLFPYINAAFINYSAAA  326

Query  289  GEAANLAFSLLLTPFSFLYYYLIYSDL  315
                 +  ++    F       + +  
Sbjct  327  WALLGIVLAIGTGHFITYLLVALKTCN  353


>WP_180490098.1 zinc-ribbon domain-containing protein, partial [Escherichia fergusonii]
Length=56

 Score = 42.8 bits (97),  Expect = 0.010, Method: Composition-based stats.
 Identities = 8/33 (24%), Positives = 11/33 (33%), Gaps = 0/33 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           + CPHC        +   A     RC  C +  
Sbjct  3   IVCPHCTTSYAVDPANFSAAGRRVRCARCQEVW  35


>MBI5826910.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=429

 Score = 47.5 bits (109),  Expect = 0.010, Method: Composition-based stats.
 Identities = 8/36 (22%), Positives = 13/36 (36%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  ++C  C  +     SK+       RC +C    
Sbjct  1   MI-IQCDKCHTKFRLDDSKVKGAGVKVRCTKCQNVF  35


>MBI1363544.1 hypothetical protein [Proteobacteria bacterium]
Length=251

 Score = 46.7 bits (107),  Expect = 0.010, Method: Composition-based stats.
 Identities = 10/68 (15%), Positives = 15/68 (22%), Gaps = 2/68 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M    CP C      P++ L       RC  C          + +       +  P    
Sbjct  1   MIL-TCPACQTRYMVPATSL-RDGRLVRCTRCSHEWFQPAPTAAQKMAESKASQPPRPDP  58

Query  61  QRRIPSDR  68
                   
Sbjct  59  MEARRDRM  66


>HGB07418.1 DUF3426 domain-containing protein [Deltaproteobacteria bacterium]
Length=463

 Score = 47.5 bits (109),  Expect = 0.010, Method: Composition-based stats.
 Identities = 12/36 (33%), Positives = 16/36 (44%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V+CP C A+ N    K+  K    RC +C    
Sbjct  1   MI-VQCPKCQAKFNLADEKITEKGLKVRCSKCKNVF  35


>MBA3821522.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=463

 Score = 47.5 bits (109),  Expect = 0.010, Method: Composition-based stats.
 Identities = 10/67 (15%), Positives = 14/67 (21%), Gaps = 0/67 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
           VRC  C  E      +L     + +C  C                       P    +  
Sbjct  3   VRCEKCQTEYELDEQRLKPGGVTVKCTNCGHMFKIRKRTPTNVGPPLGDRAMPAASSRTD  62

Query  64  IPSDRLE  70
              D   
Sbjct  63  DRIDSGP  69


>PCJ60240.1 MSEP-CTERM sorting domain-containing protein [Planctomycetes 
bacterium]
Length=964

 Score = 47.8 bits (110),  Expect = 0.010, Method: Composition-based stats.
 Identities = 30/357 (8%), Positives = 83/357 (23%), Gaps = 6/357 (2%)

Query  93   FRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQN  152
             +   +    +S         F      L  +      L        L++     +    
Sbjct  120  QQWIINDFEFMSYQFFFISPGFFACIVYLSCVKSKMSQLKEFFFSFFLVVCLPVSIYFVF  179

Query  153  QNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLIL  212
                    L   ++     +     + +      + LF        +       ++L+I 
Sbjct  180  VVIDPFFRLIDFSFSQFIFTSFFMVVTVIFIGGFIRLFSLFLFSTHNSDLKNNTVLLIIY  239

Query  213  VVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
            ++  G LLL     +   +     Y+   + + G+           G    +F  F  ++
Sbjct  240  LLTAGGLLLNNTIPIPTKFDSTWFYI--LEALNGVVLFWPW----KGRNSILFSYFGKIV  293

Query  273  VISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWL  332
            +    L F    +P++  +     +L           L     +          +     
Sbjct  294  MFPFVLYFFLVFLPFLPVSIFAIIALGGGFLMLSPIALFLLQGRIIINNFDEVSLLIGKS  353

Query  333  PLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQ  392
                 I   +L+        + ++                      +      +      
Sbjct  354  KTITFICIALLLLPAFFGLKAWRDRVIINQTMEYIYSPNYNDLSVPKLNIQTSAKILINL  413

Query  393  RLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLS  449
            ++     +  L  +          + P     + +     +  +       D  N +
Sbjct  414  KMMKQGNQYPLISKMYNQIVFNGMVLPNKKIDELYSTLTGDKIIDEYSMGYDRRNWN  470


>WP_178132558.1 hypothetical protein [Limnoglobus roseus]QEL16899.1 NINE protein 
[Limnoglobus roseus]
Length=396

 Score = 47.5 bits (109),  Expect = 0.010, Method: Composition-based stats.
 Identities = 7/32 (22%), Positives = 10/32 (31%), Gaps = 0/32 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPEC  32
           M    CP C +  +   SK        +C   
Sbjct  1   MIRFACPGCDSTFSVDDSKAGKSGKCPKCQTQ  32


>MBI5071026.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=234

 Score = 46.7 bits (107),  Expect = 0.010, Method: Composition-based stats.
 Identities = 8/32 (25%), Positives = 10/32 (31%), Gaps = 0/32 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
              C  CGA+      K+       RC  C  
Sbjct  2   RFTCDGCGAQYMISDDKVGPGGVKVRCKRCSH  33


>TMA29364.1 tetratricopeptide repeat protein [Deltaproteobacteria bacterium]
Length=475

 Score = 47.5 bits (109),  Expect = 0.011, Method: Composition-based stats.
 Identities = 8/34 (24%), Positives = 14/34 (41%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            VRC  C A      +++  +  + RC +C    
Sbjct  2   EVRCDKCQARYKVDDARIGPQGLTMRCGKCQNVF  35


>MBI4701479.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=800

 Score = 47.8 bits (110),  Expect = 0.011, Method: Composition-based stats.
 Identities = 11/62 (18%), Positives = 19/62 (31%), Gaps = 0/62 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
           V C  C        + + A+ ++ RC  C  +             +  + T     LQ R
Sbjct  3   VTCERCQTRYEFDEALVSARGTTVRCTCCGHSFRVHRPAGSGGFESWTVRTQAGWELQFR  62

Query  64  IP  65
             
Sbjct  63  AM  64


>HHA58865.1 hypothetical protein [Syntrophobacterales bacterium]
Length=165

 Score = 45.5 bits (104),  Expect = 0.011, Method: Composition-based stats.
 Identities = 7/41 (17%), Positives = 12/41 (29%), Gaps = 1/41 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
           M  + C  C  +     S +  +    RC  C      +  
Sbjct  1   MI-ITCERCQTKYRFDDSLVTGEGVWVRCTRCRHVFFQENP  40


>WP_006501536.1 hypothetical protein [Austwickia chelonae]GAB76785.1 hypothetical 
protein AUCHE_03_00020 [Austwickia chelonae NBRC 105200]SEW30682.1 
hypothetical protein SAMN05421595_1919 [Austwickia 
chelonae]
Length=361

 Score = 47.5 bits (109),  Expect = 0.011, Method: Composition-based stats.
 Identities = 21/145 (14%), Positives = 41/145 (28%), Gaps = 22/145 (15%)

Query  201  GSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGH  260
                      +  +      L +   L  + F     V   +  G ++AL +S  L+ G 
Sbjct  204  WFAMGGREAQLSNLTFVLWTLGLVQFLLLIPFSAAPAVSVIEGAGPVRALRRSFSLMRGF  263

Query  261  WWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL----------------------AFSL  298
                F   ++  ++   ++ + A    +G                               
Sbjct  264  LGRAFLMVLVSKLLIGIMTAVIAAPLQLGILVLSQTTGYQLMESAWSPAVNMLMDLLAGS  323

Query  299  LLTPFSFLYYYLIYSDLKANYRGPQ  323
            L  PF  + + L Y DL+    G  
Sbjct  324  LTLPFEAVVFALFYIDLRVRGEGLD  348


>KKT35684.1 hypothetical protein UW24_C0005G0032 [Parcubacteria group bacterium 
GW2011_GWA2_44_12]
Length=274

 Score = 46.7 bits (107),  Expect = 0.011, Method: Composition-based stats.
 Identities = 29/177 (16%), Positives = 54/177 (31%), Gaps = 4/177 (2%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                  GI    + L      +  +L             Q  + +      L  L     
Sbjct  47   WLDMYFGISFGIVFLLLTFFVTMSILYTLHSFLQNKAMAQSEVFITAARAFLSVLGLNFV  106

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
                      +     ++  L        L   L L +   S +     L   V + F  
Sbjct  107  LYIGLAFVVPLVGVWVLRELLFFTPLAAGLGRFLFLPLIALSGIA--FSLSMLVRYGFAI  164

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI--PYVGEA  291
            + L  + +   QA+ KS+ LV+G++W I  R V++  + +    +   +    VG  
Sbjct  165  FFLLFERVSVAQAMGKSKALVAGNFWHIALRQVVVGFVLIVGMVILGIVATMLVGAV  221


>OQX60845.1 hypothetical protein B5M51_09690 [Anaerolinea sp. 4484_236]
Length=268

 Score = 46.7 bits (107),  Expect = 0.011, Method: Composition-based stats.
 Identities = 14/147 (10%), Positives = 41/147 (28%), Gaps = 6/147 (4%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
               +  +        + + +     GLF ++           L+  +  + V    L + 
Sbjct  67   FWRFFGMNFLVSLPFIIVIVGLVGAGLFAAIAADSAAGAKEFLVGFIPAICVIFCCLFIF  126

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
               +   V        +  +++G   +L +   +   +     G  +L+ +I      + 
Sbjct  127  SLVVGMIV--QQASNAMILEDLGISASLIRGWDVFKNN----LGHLLLMAIILFIFGVIA  180

Query  283  ARIPYVGEAANLAFSLLLTPFSFLYYY  309
              I  +     L  + +          
Sbjct  181  GVILALPFLLILIPAAVSFVLGSAQST  207


>OGV19256.1 hypothetical protein A2X47_02255 [Lentisphaerae bacterium GWF2_38_69]HBM16739.1 
hypothetical protein [Lentisphaeria bacterium]
Length=775

 Score = 47.8 bits (110),  Expect = 0.011, Method: Composition-based stats.
 Identities = 20/222 (9%), Positives = 54/222 (24%), Gaps = 5/222 (2%)

Query  70   EIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGI  129
              +                   +F    S + S+  L      +    G        + +
Sbjct  285  PFRESAFWFIMAFIFIIYSYSLDFIPKDSIIHSLGILFLIRSMMLLGWGLRTRAFKDIDL  344

Query  130  VLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGL  189
              A       L                 +IL      +    +           +  +  
Sbjct  345  KSAPFIPLFWLYFYGVILQFTDQYFVFISILWFIGLVVFYFWTKKRVRKNSLKFERSLQF  404

Query  190  FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQA  249
                   +  + +   ++ L IL++    L+ I   L   + + F       +    +  
Sbjct  405  LSMFLGLIAMLITCIGMVYLSILMMMAWFLICIGIQLARYINYSFHDLAEYIEGNYVI--  462

Query  250  LEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
                ++L+ G    +   F++  +    +  +     +V   
Sbjct  463  ---FKVLLIGLSVPVLWLFIMFCIFIWGIGQVFGSYFFVPFV  501


>CDW80596.1 wd-40 repeat protein [Stylonychia lemnae]
Length=1493

 Score = 47.8 bits (110),  Expect = 0.011, Method: Composition-based stats.
 Identities = 33/421 (8%), Positives = 103/421 (24%), Gaps = 21/421 (5%)

Query  113   LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
              +    +  +  Y +  V          +      +     ++            +L L 
Sbjct  960   FYMSVLFAFIQTYHVISVYFLIAFGFYFMGFEILQMCLNKLSYLGDFWNIFDFLRVLLLI  1019

Query  173   WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
                 + F Y           +   L  +     L  L +       + L++  +   + F
Sbjct  1020  IFIYNAFKYEDWNRSQAMLQLLGTLNFLSWVRALSFLRLFQKTRIFIRLLVEVIYDMIPF  1079

Query  233   FFCQY---------VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
                               ++I  L+AL+ +  L  G +           +      F+  
Sbjct  1080  MIVLIGAVLGISLSFEVINDISMLEALKHNYRLTFGDFETDNYTTANWAL------FIIG  1133

Query  284   RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWML  343
              +       N+  +++   ++ +   ++ SD     +        + W      +     
Sbjct  1134  SVLIPLIMLNMLVAIMSDTYARVMSDVLPSDFLELNQMILEQEEIQFWNRRKGQLGYLHY  1193

Query  344   IPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLL  403
             +  L     +      +++L+  +          Q    LN+   +    + S   +   
Sbjct  1194  VTPLQRKEGNDWEGQIQKILALLQQQN------GQSPEVLNKLGDKIDLIIESQKVQFRD  1247

Query  404   SKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKV  463
             +  +    +         L                +    D  +       S   +    
Sbjct  1248  TNLKFDEQKKRFDRIQQELEIIMPKQQKDEIAEEEEDSDGDLFDEEYINDLSQIAKTPFE  1307

Query  464   LDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLEL  523
                     +      ++       +++ +                    ++ S++ +++L
Sbjct  1308  NFSYQMKQFVNYRWNQYSPTKQGKLHKNESIKAIRQSFDALYIDAPDKSEIESLIQEIDL  1367

Query  524   T  524
              
Sbjct  1368  N  1368


>MBF0571846.1 hypothetical protein [Candidatus Omnitrophica bacterium]
Length=235

 Score = 46.3 bits (106),  Expect = 0.011, Method: Composition-based stats.
 Identities = 22/112 (20%), Positives = 49/112 (44%), Gaps = 7/112 (6%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
               +  +      F   +     V+A +    +  L+++  L+ G ++  FG ++ L+++
Sbjct  109  IVTTGWMWCGTAYFSYKYALLPTVVAYEKPVFMSYLKRNAQLIKGSFYDFFGAYLALVLV  168

Query  275  ---SLTLSFLTARIP----YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
                  +SF ++ IP     +    ++   + LTPF F Y Y +Y++L    
Sbjct  169  LSPVFAISFFSSSIPQPYQMILSIVSIVLWVPLTPFCFHYLYFVYTELIQRE  220


>NVN98292.1 hypothetical protein [Geobacteraceae bacterium]
Length=1134

 Score = 47.8 bits (110),  Expect = 0.011, Method: Composition-based stats.
 Identities = 67/554 (12%), Positives = 126/554 (23%), Gaps = 57/554 (10%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
              CP C       + KLP +   A CP+C  +       +  + +    +        + 
Sbjct  323  FACPECRLAGKIATEKLPEQGLLATCPKCKTSFPIGHNAASSSLSAIVKSVDSDTSALKE  382

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLG  123
            +   + + +  T +          Q  R      S        +  +W+         + 
Sbjct  383  LVLPKADSEISTRSEEFDQLINGKQLLRMKATCLSYFLLQGLAIYFAWQHGQGGVSAPII  442

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            I  L +    + +                       L        +  +  T        
Sbjct  443  IVQLFVFALPSLLAF------------GCVMAMRKFLGIVALATGVIFAITTAIAVYCFQ  490

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF--FCQYVLAD  241
                  F    L       + LL +   LV       L      +       F       
Sbjct  491  GEANKWFPLKNLVFDSAFKYALLALASGLVGVSAGCFLAAVERRYGFAPQELFRATFRRL  550

Query  242  DNIGGLQALEKSRLLVSGHWWAIF--------------GRFVLLLVISLTL---------  278
                 L A     +L+ G     F                 +   ++ +           
Sbjct  551  RENSFLIAGWSLFVLIEGIMIPQFHGEAGLIVLPIVVVVDNLPFFIVWMLARVTSTEEVP  610

Query  279  SFLTARIPYVGEAANL--------------AFSLLLTPF--SFLYYYLIYSDLKANYRGP  322
            S   A I +VG  A +              AF   +TPF    +  +++        R  
Sbjct  611  SARLASILFVGVYALMTLPLHSSTMVLNPVAFMFQITPFIGLAVMVFVMIFLSFFRTRTT  670

Query  323  QHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPD  382
               P  +  + L         +    L              S  +     L         
Sbjct  671  VGEPELKGVVSLLPVGSYVYTVRSAALGVAILAACYGSFNGSRERFDLGNLFAAQVYPEI  730

Query  383  LNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLEL  442
            +   L  +P        + +  +     +      G  TL  ++     Q P    +L  
Sbjct  731  VFTDLAGKPLGHLPVSVEFIRHEPISLLNIHAEQGGRFTLETNK-NGILQLPIRKPRLTT  789

Query  443  SDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTD--ENDLFSGI  500
                 L  A K S  + +          +Y   +             +      ++  G 
Sbjct  790  LGIAGL-FAGKRSIFLSVLDPRWQVPDSVYSPAYELRDMETRQTITCKPADIRANIMRGT  848

Query  501  RSIYLRQGTQAEQV  514
              I L      E V
Sbjct  849  HLINLMPAKTWEDV  862


>RDW79714.1 hypothetical protein BP6252_04352 [Coleophoma cylindrospora]
Length=1154

 Score = 47.8 bits (110),  Expect = 0.011, Method: Composition-based stats.
 Identities = 36/396 (9%), Positives = 67/396 (17%), Gaps = 8/396 (2%)

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
                  +  +L   A  +             + +   LL +S     M ++  K  V   
Sbjct  44   YIILLGWIDVLAYTALNVLTYTTGGFIDDNPSVLGLKLLYISSFLFDMGVFFPKASVLAL  103

Query  191  RSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
                            L ++IL V  G ++     + +C            +      A+
Sbjct  104  YWDLTPPAVFPELRRALNVVILYVFTGFVIAFCLDMFWCKPISDNWNPDKLERSVWYSAV  163

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYL  310
                         I    +   +              +G  +           S   Y  
Sbjct  164  VFDTNYAFNFGSDIMIFIIPFFLFKHLSLSRPQMWALLGSFSLGVA---TMAISISRYVW  220

Query  311  IYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQ  370
             Y DL          P             G                   +      +   
Sbjct  221  QYHDLDKAIGNDPEGPATPVGPEGPDGPDGPDGPVKPGSPDTPVGPEGPDGPDDPIEPGS  280

Query  371  QRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWAD  430
                  P+     +  +  E         +                 GP           
Sbjct  281  PDTPVGPEGPDGPDAPVGPEDPDRPVGPVEPGSPDIPVGPDGPDGPDGPDDPVEPGSPDT  340

Query  431  DQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQ  490
               P      +    P     +                          E           
Sbjct  341  PVGPEGPDGPDAPVGPE-DPDRPVGPVEPGSPGTPVGPDGPDGPDDPVEPGNPDTPASPD  399

Query  491  TDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLP  526
            T                    E     +G +E + P
Sbjct  400  TPVGPEGPDGPDAPAGP----EDPDRPVGPVEPSSP  431


>ARU42875.1 hypothetical protein CCB81_01390 [Armatimonadetes bacterium Uphvl-Ar2]
Length=171

 Score = 45.5 bits (104),  Expect = 0.011, Method: Composition-based stats.
 Identities = 22/158 (14%), Positives = 47/158 (30%), Gaps = 12/158 (8%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
               A++    +      F+   + +         G     +  L+ +++ +    G L  
Sbjct  21   FAGAFLAGPGNLSLARCFVKARRGEAVTTNDATFGFSKFLAAGLMSLIIQVASQLGILAC  80

Query  222  IIPGLLFCVWFFFCQYVL-ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
             +  L+           +  D++ G   AL  S      H W     + + +++ L    
Sbjct  81   CVGILIVGALLMGSYAAMAHDESAGLGDALMNSIGAFKDHVWKATWFYFVCILVILA---  137

Query  281  LTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKAN  318
                    G  A     L+  P  F    L Y ++   
Sbjct  138  --------GILACGVGLLVSVPVGFGAMTLAYLNMTER  167


>WP_115937294.1 hypothetical protein [Aestuariispira insulae]RED49087.1 hypothetical 
protein DFP90_10664 [Aestuariispira insulae]
Length=245

 Score = 46.7 bits (107),  Expect = 0.011, Method: Composition-based stats.
 Identities = 23/200 (12%), Positives = 56/200 (28%), Gaps = 5/200 (3%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
                L +    +      +    +            +   A+    V   +  ++     
Sbjct  42   YAWQLHLAGWSLENYPDLVLKGGMRTDDIDWLDLASSALGALADGIVVLGVYLMAKGVPV  101

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
             F  +    +     + +          +  L    +  G L  +    L  V+FF    
Sbjct  102  SFPRLVYGVIAAAPRLVVFALVYSLLYQIARLAPGSMF-GLLFALGLYGLQFVYFFLFTP  160

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGR-FVLLLVISLT---LSFLTARIPYVGEAAN  293
            +   +N   +  + +S  L  G    + G       V+++    L  L   +PY      
Sbjct  161  ICLLENRPVIATMSRSVDLTRGKRLGLLGLAMFAFYVMTMLHVTLLQLMKGLPYPIFILQ  220

Query  294  LAFSLLLTPFSFLYYYLIYS  313
                  ++ +  +  ++IY 
Sbjct  221  ALLFAAISGYFSVLSFVIYQ  240


>TAH00994.1 hypothetical protein EAZ17_05775 [Sphingobacteriales bacterium]
Length=310

 Score = 47.1 bits (108),  Expect = 0.011, Method: Composition-based stats.
 Identities = 28/200 (14%), Positives = 56/200 (28%), Gaps = 23/200 (12%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              +  + +   +     + +          +     AIL+  +A   +  S         
Sbjct  42   FLLIGVFLGAGYFMQIFSAMTAGKANTLFGDWKIWVAILVIYMAVNGMATSIYLYMQVWE  101

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILV-----------------------VGGGS  218
                       +K+  +   S  L  +L+ L                        +G   
Sbjct  102  EEDRRATPGELLKVIAKPFFSNMLYSVLMFLGLMLVMVPVVLVAGGAGSAGSIALLGLFM  161

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            L  +I  L+   +      V          A   + +L+ G+WWA  G  ++L +I    
Sbjct  162  LFAMIGLLILFPYLMLIYPVNTIAGKEFGNAFSGAAMLLRGNWWASLGYVMVLFIIYYIF  221

Query  279  SFLTARIPYVGEAANLAFSL  298
            SFL   +  +   A      
Sbjct  222  SFLVQMMLTLIFGAGALMGA  241


>HCM98355.1 hypothetical protein [Rhodobacter sp.]
Length=140

 Score = 44.8 bits (102),  Expect = 0.011, Method: Composition-based stats.
 Identities = 8/43 (19%), Positives = 14/43 (33%), Gaps = 0/43 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
            + CP+C A        +       +C  C  T +    E  +
Sbjct  2   QIVCPNCEAHYEVGYDSVGDAGRQVQCSNCGHTWLAMRQEEDQ  44


>WP_174403930.1 zinc-ribbon domain-containing protein [Desulfovibrio sp. HN2]GFM31809.1 
hypothetical protein DSM101010T_01740 [Desulfovibrio 
sp. HN2]
Length=287

 Score = 46.7 bits (107),  Expect = 0.011, Method: Composition-based stats.
 Identities = 30/283 (11%), Positives = 66/283 (23%), Gaps = 8/283 (3%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             +RCP CG  +    + L    + A CPEC     F  A  +        +         
Sbjct  2    EIRCPQCGYGKEVDEAALTPSVTFATCPECGHRFRFRDAVPENEPDFTLDSGQEEPARDS  61

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
             +  +     +              QP++      +  R +  + A              
Sbjct  62   AVVEEDHPHPTSHTAPEEGPMPSWGQPKKAEAEIWAYAREVGVVAALIANTRHILQRPQG  121

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLAT-----VAYILLGLSWMTGS  177
                +   ++F    S  L+     +  +                  A    G       
Sbjct  122  FFATMSREVSFLVPLSYYLIISLVGIGLETFWQTAVGNPILPEAFSGAVSSPGEMIQLVM  181

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
                +    +     +      +       +   + V   S    +  ++  V       
Sbjct  182  FSPVVLMLYLYALSGILHLGLVLTGAAKGGVRTTMRVVCYSSAADMLSIIPVVGSLLGGL  241

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
            +        +  L       +       G   L L++ + L+ 
Sbjct  242  LRLW---IIVVGLRTVHGTTTARVLPSVGLLFLSLLVIVALTM  281


>HAA72950.1 hypothetical protein [Planctomycetaceae bacterium]
Length=462

 Score = 47.5 bits (109),  Expect = 0.011, Method: Composition-based stats.
 Identities = 28/338 (8%), Positives = 70/338 (21%), Gaps = 20/338 (6%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            + C  CG         +  +     C           A  +R  T    +        + 
Sbjct  120  ILCRLCGTRIYVRIRDVGQEVKCPDCHSQNLIDKPKTARRKRPTTDHTESDHLDLVKLQP  179

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLG  123
                      +     +       + +  +         +        +           
Sbjct  180  ETEVARPENQEMRKMAQQTLQQAREKKPTYFKEEQPGLEVETPPIVLLKFLLNPHTIGYW  239

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            + L   V   A  F   +  P   LN       W       +              + I 
Sbjct  240  VILSVGVTILASCFYFAINPPGQDLNASIAKVIWVSFWILTSVSGCLAIGFLSVTLLTIA  299

Query  184  KTDVGLFRSMKLGLRHVGSFTLL------------LILLILVVGGGSLLLIIPGLLFCV-  230
                    +++      G                   L   ++    +   +P  +  V 
Sbjct  300  GETTNGTLNIEHWPPVTGFLDWALDTLYVVNSAVIAALPGAIIAAPLIFFEVPMWISLVP  359

Query  231  ------WFFFCQYVLADDNIGGLQALEKS-RLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
                    F   +    +N      +       +  +       ++   ++ +    +  
Sbjct  360  VALSVSVLFSILFASMLENRSCFNPVALPIWRSLKANTSCWTMFYMAGPIMLIGAYAIGC  419

Query  284  RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
               +    AN   S  +    F+Y+ L+   +      
Sbjct  420  GYAFGSILANGFISAGIVGLLFVYFRLLGWVIWKVREQ  457


>PIT94560.1 hypothetical protein COT98_02950, partial [Candidatus Falkowbacteria 
bacterium CG10_big_fil_rev_8_21_14_0_10_39_9]
Length=100

 Score = 44.0 bits (100),  Expect = 0.012, Method: Composition-based stats.
 Identities = 25/99 (25%), Positives = 52/99 (53%), Gaps = 10/99 (10%)

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY---  287
            +++F  Y L  +++ G+ A++ S+ LVSG+WW +FGR + LL++++  SF+ +       
Sbjct  1    FYYFALYFLIFEDVKGMNAIKSSKALVSGYWWGVFGRTMFLLLVAMLASFVLSIPFMFLS  60

Query  288  -------VGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
                   +         +++ P   ++ YL+Y DL+   
Sbjct  61   EGTIMYTIYSLIQNLIWVVVIPVFMVFTYLMYKDLRGIK  99


>WP_013627030.1 hypothetical protein [Rubinisphaera brasiliensis]ADY58287.1 MJ0042 
family finger-like protein [Rubinisphaera brasiliensis 
DSM 5305]
Length=461

 Score = 47.5 bits (109),  Expect = 0.012, Method: Composition-based stats.
 Identities = 9/31 (29%), Positives = 14/31 (45%), Gaps = 0/31 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPE  31
           M    CPHC A    P S +  + +  +C +
Sbjct  1   MIQFECPHCAAVLRVPDSAMGQRGTCPKCKQ  31


>MBQ71941.1 hypothetical protein [Planctomycetaceae bacterium]
Length=487

 Score = 47.5 bits (109),  Expect = 0.012, Method: Composition-based stats.
 Identities = 35/307 (11%), Positives = 72/307 (23%), Gaps = 6/307 (2%)

Query  6    CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIP  65
            CP CG   +              CP C Q    D        T+ +  +C     +    
Sbjct  18   CPECGEHFDPTGHLFKPGSVRFCCPHCDQAYFGDGEGGHLNPTSFDCVSCGRAIDESECI  77

Query  66   SDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL--LG  123
                E  S           +       ++     +                    L    
Sbjct  78   IRPREDGSHEDRTVPDLSPWHDSERTAWKRFWGTVGMSMVRPGALGRGLPMDAKLLGGFW  137

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
             ++L   +A         L                         L+ ++ +  +    + 
Sbjct  138  FFVLVNTMALVFGLGPFFLAMFLAPMFFGGGGGLGGGGIPPFAFLIPIAGLMLAWIPMLF  197

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
                     + L L+  GS T      +  +   +  L+I  + F  +       +    
Sbjct  198  IGLCFYGGIIHLVLKLSGSTTGGFGRTLTSITFSTGPLMIGAVPFLGYCLQTPAQIWVIV  257

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
               +  L      VSG         +  + + + L  +   I   G AA +  ++     
Sbjct  258  TTII--LLTQSQAVSG--VRATFAVLTPVFLGIVLYAVFLGIMMFGTAARIPVAVPPAMA  313

Query  304  SFLYYYL  310
                  +
Sbjct  314  QGKAIAV  320


>TFH30533.1 hypothetical protein E4G97_05570, partial [Deltaproteobacteria 
bacterium]
Length=113

 Score = 44.0 bits (100),  Expect = 0.012, Method: Composition-based stats.
 Identities = 7/33 (21%), Positives = 13/33 (39%), Gaps = 1/33 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECC  33
           M  + C  C A      S++    +  +C +C 
Sbjct  1   MI-IECHTCHARFRLDESRIKGSGARVKCRKCG  32


>SMO97843.1 hypothetical protein SAMN06265173_13929 [Lutimaribacter litoralis]
Length=228

 Score = 46.3 bits (106),  Expect = 0.012, Method: Composition-based stats.
 Identities = 13/110 (12%), Positives = 31/110 (28%), Gaps = 18/110 (16%)

Query  527  LAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDR-TDLLNVHASNSHAEPLR  585
             +I +L+ + +D      +      +  + S A      G    D++ V A ++    + 
Sbjct  79   TSITTLEFSSSDNLFGDDVQNVSFRISDIDSGADPYTASGTSMLDVVTVRAYDASGNLIG  138

Query  586  EIGFTWQ-----------------KSGDAFSLRQMFDGNIESITVLVAGD  618
                                     +    S+     G +  I +  A +
Sbjct  139  VNFTAGSAVTAAGDTLTGGPMNYEPTDGDASVLVEIAGPVSRIEIEYANE  188


>MSP94805.1 DUF3426 domain-containing protein [Alphaproteobacteria bacterium]
Length=251

 Score = 46.7 bits (107),  Expect = 0.012, Method: Composition-based stats.
 Identities = 12/64 (19%), Positives = 19/64 (30%), Gaps = 1/64 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M    CP CG       ++  A   + RC +C       P  ++     + I   P    
Sbjct  1   MIL-SCPSCGTRYQADGARFAAPGRNVRCAKCAHVWFQTPPAAEIEPEPEPIIHAPPQPE  59

Query  61  QRRI  64
               
Sbjct  60  SFAR  63


>MBI2374423.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=52

 Score = 42.4 bits (96),  Expect = 0.012, Method: Composition-based stats.
 Identities = 9/34 (26%), Positives = 13/34 (38%), Gaps = 0/34 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           M    C  CGA  +    +L  +    RC +C  
Sbjct  1   MLPFACDSCGALYSVERKRLGPRGFRIRCGQCGA  34


>KWT82062.1 hypothetical protein ASN18_2548 [Nitrospirae bacterium HCH-1]
Length=231

 Score = 46.3 bits (106),  Expect = 0.012, Method: Composition-based stats.
 Identities = 30/162 (19%), Positives = 59/162 (36%), Gaps = 7/162 (4%)

Query  156  QWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG  215
             + I         L                   +F S    L+ +G      IL+++ + 
Sbjct  58   LFMISALNYFLQALSHCVTIAMAIEIENNRMCNVFDSSLTVLKRIGRILPASILILIFLT  117

Query  216  GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
             G +LLI+P L  C  F     ++ +++I    A+ KS   +  ++       +   + S
Sbjct  118  AGLMLLILPALYVCFAFMLTFIIIMNEDITPAMAMMKSYRAMKDNYSR--SIVLFFFLSS  175

Query  276  L-----TLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
            L      ++ L  ++ Y G  ++L  S L   F+ +     Y
Sbjct  176  LWITVSIINLLLGQLHYFGVISSLFLSGLFMAFATITLLKFY  217


>VDO31919.1 unnamed protein product [Brugia timori]
Length=766

 Score = 47.8 bits (110),  Expect = 0.012, Method: Composition-based stats.
 Identities = 31/342 (9%), Positives = 71/342 (21%), Gaps = 17/342 (5%)

Query  5    RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRI  64
             CP C        + L A  +     +                             +   
Sbjct  417  TCPVCFTVYIATYASLIATLAK-WKNDKPGPDGRISGAVCGQDPRYCDDPVSVKPYEWPD  475

Query  65   PSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGI  124
               +     +       N                          D  +           +
Sbjct  476  DITQWPTCKQMAFLVPENVVHMQCEITGRPCCIQLHGQCRITTRDYCDFVEGYFHENATL  535

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
                  L+        L K       +     +         + + +  +       +  
Sbjct  536  CSQVSCLSEVCGMLPFLRKDQPDQWYRLFIPLFLHAGIIHCILTIFIQILYMRDLEKLLG  595

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ---YVLAD  241
                    M  G+    +  + +     V   GS + +   +   V + +        A 
Sbjct  596  WARIALLYMVSGVGGYLAGAIFVPYRPEVGPAGSHVGMFAAMYVDVLYSWNLLERPWHAV  655

Query  242  DNIGGLQ-ALEKSRLLVS-GHWWAIFGRFVLLLVISLTLSF-----------LTARIPYV  288
              +     AL     L    +W  +FG    +L+    L +           +   +  +
Sbjct  656  VQLSLFTLALFTIGTLPWVDNWAHLFGFIFGILISLAVLPYIQTKRHNRTRRIIIVVTSL  715

Query  289  GEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
              A +L   LL   +    +  +Y +         H    + 
Sbjct  716  TTALSLFIVLLAVFYWPSGFNCVYCEYFNCIPYTDHFCDNQG  757


>WP_135244332.1 hypothetical protein [Polymorphobacter arshaanensis]TFU05607.1 
hypothetical protein EUV02_00795 [Polymorphobacter arshaanensis]
Length=275

 Score = 46.7 bits (107),  Expect = 0.012, Method: Composition-based stats.
 Identities = 19/172 (11%), Positives = 47/172 (27%), Gaps = 4/172 (2%)

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
            W  L +  +  ++    + +                    +           L+ +  ++
Sbjct  59   WSFLIVNYVLALIGTLTVAALAGATAELRGQSVGTVLAGTVKPMMKLLAASLLALLIMTV  118

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
                    +GL   +   L  V          +L +       I   L   +       +
Sbjct  119  AFTAIGFALGLLAGIVGILSGVKLPDTQSAGWVLFLILIFTAFIGVELYVVLRLSALPGI  178

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
               + IG   +L ++  +  GH        + L +I +  +     + Y+G 
Sbjct  179  ALFERIGIKASLRRTWRMTRGHM----LTLLALALIYIVAAVPIVALVYLGT  226


>WP_121500478.1 hypothetical protein, partial [Pseudomonas aeruginosa]
Length=192

 Score = 45.9 bits (105),  Expect = 0.012, Method: Composition-based stats.
 Identities = 33/175 (19%), Positives = 60/175 (34%), Gaps = 0/175 (0%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
             +   ++   + F        +   +  +P        I       +          + I
Sbjct  1    FVVYSVVIQTITFILTLLFGGIGLMSEADPIAFAVGQMIASLVATAVGYPFFTGLTMIGI  60

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                     F  M      +    L  +L++L+V  G  LLIIPGL   V +     ++A
Sbjct  61   RRAADQPATFNEMFNYFGMLVPLLLTGLLMMLMVYVGFFLLIIPGLYLSVAYMLALPLVA  120

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
            +  +   QALE SR  +S HW+ +FG  ++L ++ L                 + 
Sbjct  121  ERGLTPWQALETSRKAISRHWFKVFGLLLVLSLLMLVSMIPLFIGLIWTGPLFVV  175


>CDD09235.1 unknown [Clostridium sp. CAG:349]
Length=320

 Score = 47.1 bits (108),  Expect = 0.012, Method: Composition-based stats.
 Identities = 24/221 (11%), Positives = 62/221 (28%), Gaps = 8/221 (4%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY  166
               +              ++  +V++F    +     PA      N              
Sbjct  83   FYHTVIDLMNATNATAAFWVTVVVVSFFIRLAMSFCYPAISDVISNFMSSNMSYGLLSNI  142

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
            +         ++F  I    + +   + +     G   L   ++ +     +L+  +  +
Sbjct  143  LKNFSLCAKYALFHTI----ITMVTDIAIFFAIYGVTKLFFPIIGVFAFALALVCAVVLI  198

Query  227  LFCVWFFFCQYVLAD--DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
               + F                  AL+     +  ++  IFG   +++ +  +L  LT  
Sbjct  199  ALRLTFSSGVIPEMVVGGEKKYFGALKTGFSYMKKYFGKIFGANCIVIFVIYSLIMLTTL  258

Query  285  IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHP  325
            I +    A      +L  +  +   + Y + K+        
Sbjct  259  ITF--GVAFFVLGSMLVTYIHVMQLVFYYESKSMRYYTDAY  297


>RMF19123.1 tetratricopeptide repeat protein [Candidatus Dadabacteria bacterium]
Length=1066

 Score = 47.8 bits (110),  Expect = 0.012, Method: Composition-based stats.
 Identities = 11/32 (34%), Positives = 15/32 (47%), Gaps = 1/32 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPEC  32
           M  + CP CGA  N P  K+       +C +C
Sbjct  1   MI-IACPECGARYNLPDEKIAGGAVKVKCKKC  31


>MBA3954950.1 hypothetical protein [Candidatus Dependentiae bacterium]
Length=250

 Score = 46.3 bits (106),  Expect = 0.012, Method: Composition-based stats.
 Identities = 27/184 (15%), Positives = 66/184 (36%), Gaps = 6/184 (3%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             I  L             ++   T     +       + +    +   +S +   + + +
Sbjct  40   LIATLTWRSFDLTDLLLDVITGTTSTYDFSALIVPFSIWSACVLLKCYISLIIIQLTLQL  99

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
               +  +  +    L+    + + ++ + L+V     L++ PG++    +FF     ADD
Sbjct  100  YDNNKAVKLNCLPSLKTFIKYVIAVMAIGLLVTLPWFLVV-PGMIMLTKYFFVSVASADD  158

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT-----LSFLTARIPYVGEAANLAFS  297
            +    QA ++S     G  W +FG  +++ V+ L       S+    +  V +      S
Sbjct  159  STSLKQAFKRSSRATFGVRWTLFGYTLMVGVLILLSLQFFSSYFFVFVNAVLQLILTLAS  218

Query  298  LLLT  301
            + + 
Sbjct  219  VYVY  222


>PYN59115.1 thiol reductase thioredoxin, partial [Candidatus Rokubacteria 
bacterium]
Length=52

 Score = 42.4 bits (96),  Expect = 0.012, Method: Composition-based stats.
 Identities = 9/32 (28%), Positives = 14/32 (44%), Gaps = 0/32 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPEC  32
           M  +RCP CG     P ++  +     RC + 
Sbjct  1   MIELRCPWCGTTNRIPDTRAGSPARCGRCGQP  32


>WP_150448468.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Microbacterium rhizomatis]KAA9111702.1 hypothetical 
protein F6B43_06310 [Microbacterium rhizomatis]
Length=338

 Score = 47.1 bits (108),  Expect = 0.012, Method: Composition-based stats.
 Identities = 12/84 (14%), Positives = 30/84 (36%), Gaps = 1/84 (1%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
                L  I   L            +  ++     A+ +S  L+ G +WA  G  VL+ ++
Sbjct  51   VLAVLAAIPLTLWLSTKLLLVPATIILEHASIRTAIVRSWTLIRGRFWAALGVIVLISLV  110

Query  275  SLTLSFLTARIP-YVGEAANLAFS  297
               ++ + +    ++    +   +
Sbjct  111  FGAVAQVVSIPFSFLSTGLSTVIA  134


>MBF0572433.1 zinc-ribbon domain-containing protein [Desulfamplus sp.]
Length=211

 Score = 45.9 bits (105),  Expect = 0.013, Method: Composition-based stats.
 Identities = 11/43 (26%), Positives = 15/43 (35%), Gaps = 1/43 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAES  43
           M  + C  C  + N   S L    S  RC  C       P ++
Sbjct  1   MI-ITCEKCSTKFNLDESVLKKDGSKVRCSMCKHIFKAYPPDN  42


>PIE56796.1 hypothetical protein CSA34_02200 [Desulfobulbus propionicus]
Length=59

 Score = 42.4 bits (96),  Expect = 0.013, Method: Composition-based stats.
 Identities = 8/35 (23%), Positives = 15/35 (43%), Gaps = 1/35 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQT  35
           M  + C  C  + N   +K+ + ++   C EC   
Sbjct  1   MIAI-CEECSKKYNVDETKIKSNRARFACFECGHM  34


>NLE57325.1 hypothetical protein [Planctomycetes bacterium]
Length=131

 Score = 44.4 bits (101),  Expect = 0.013, Method: Composition-based stats.
 Identities = 10/28 (36%), Positives = 13/28 (46%), Gaps = 0/28 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARC  29
              RCP C A    P+S +  +   ARC
Sbjct  8   IETRCPACDARYRVPASSIGHRARCARC  35


>PHQ80397.1 hypothetical protein COB66_04855 [Coxiella sp.] [Coxiella sp. 
(in: Bacteria)]
Length=282

 Score = 46.7 bits (107),  Expect = 0.013, Method: Composition-based stats.
 Identities = 27/226 (12%), Positives = 67/226 (30%), Gaps = 23/226 (10%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
             +GL        + A +   +  +   A  ++         ++   ++      +     
Sbjct  36   FYGLSVFVEFNYLFAESLNLAKNITPTALTVSLTIMVLLLLVITNLLSCTFYVCTLHRLQ  95

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF----F  233
                     +   +   L +    S  L++ ++   V        I  +   V +     
Sbjct  96   NKPCTIGQGLNESKRFILKMLAWSSIKLVMDIIFRAVASLGRFGRIVEIFMQVSWGILNL  155

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL------------  281
            F   ++   N+G + AL++S  +V  +W   F    LL ++++    +            
Sbjct  156  FVLPIMIMQNVGPITALKRSGAMVKKNWRKNFSISFLLSLVTIAFVVIGYFLMNSMQVSP  215

Query  282  -------TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
                      +  +     L FSL++ P   +    +Y        
Sbjct  216  FAYSTRTVYLLGTLVTIWFLLFSLVMRPLLQISNAALYLFNSDQKE  261


>RPJ23292.1 hypothetical protein EHM26_00200, partial [Desulfobacteraceae 
bacterium]
Length=279

 Score = 46.7 bits (107),  Expect = 0.013, Method: Composition-based stats.
 Identities = 9/39 (23%), Positives = 16/39 (41%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  + C  CG +     S++ A  + A+C  C   +   
Sbjct  1   MIII-CEECGKKYQIDPSRIKATGAKAKCTACGNMMTIP  38


>MBL42700.1 hypothetical protein [Rhodospirillaceae bacterium]
Length=320

 Score = 46.7 bits (107),  Expect = 0.013, Method: Composition-based stats.
 Identities = 11/44 (25%), Positives = 14/44 (32%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  V CP C  +       L       RC  C  T    P + +
Sbjct  1   MI-VTCPSCDTKYEVLRDILIPNGRRLRCVRCKHTWTERPPDDE  43


>NBU28287.1 hypothetical protein [Caulobacteraceae bacterium]
Length=278

 Score = 46.7 bits (107),  Expect = 0.013, Method: Composition-based stats.
 Identities = 8/42 (19%), Positives = 12/42 (29%), Gaps = 1/42 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAE  42
           M    CP C         ++ A   + +C  C       P  
Sbjct  1   MIL-TCPECATRYFVGDDQVAASGRTVKCAACKTRWTAHPEP  41


>RME97676.1 hypothetical protein D6773_15455, partial [Alphaproteobacteria 
bacterium]
Length=280

 Score = 46.7 bits (107),  Expect = 0.013, Method: Composition-based stats.
 Identities = 10/55 (18%), Positives = 15/55 (27%), Gaps = 1/55 (2%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCP  56
             + CP C      P + +P      RC +C       P +  R           
Sbjct  7   FLLECPSCRTRYEVPVA-IPEGGRKVRCAKCEHVWTVMPEDLIRPGALPAWDEEE  60


>WP_161808505.1 hypothetical protein [Methyloglobulus morosus]
Length=107

 Score = 44.0 bits (100),  Expect = 0.013, Method: Composition-based stats.
 Identities = 23/98 (23%), Positives = 36/98 (37%), Gaps = 8/98 (8%)

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP-------  286
               Y +  D++GG  +++ S  LV GHWW     F +  +I   L  +   +        
Sbjct  3    LSIYFIVLDSLGGYASIKASHKLVWGHWWKTATVFTIPTIIICILYGIFGALAAYMGTDK  62

Query  287  -YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                +      S   TPF     Y+ + DLK    G  
Sbjct  63   KLAIDITIQIISAFTTPFLVSVGYVQFHDLKLRKSGSD  100


>HHI68480.1 hypothetical protein [Planctomycetes bacterium]
Length=294

 Score = 46.7 bits (107),  Expect = 0.013, Method: Composition-based stats.
 Identities = 25/202 (12%), Positives = 57/202 (28%), Gaps = 35/202 (17%)

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLIL---VVGGGSLLL  221
                L ++ +      Y+    + +    + G+  +     + I+       +  G +L 
Sbjct  92   FLSFLLMAGVVAGAVEYLAGESLPMGTCFRRGIVRLFPALGVSIVSGFLSLFILVGGILA  151

Query  222  --------------IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGR  267
                           +  L    WFF        +  G + AL++S  LV   ++ +F  
Sbjct  152  LSFLKLPSPLILVPYLLSLAATSWFFVAVPAAVGERRGVIGALKRSLALVKDRFFTVFLS  211

Query  268  FVLLLVISLTL--SFLTARIPY----------------VGEAANLAFSLLLTPFSFLYYY  309
             + + ++   +    L                      V     L   L  + F      
Sbjct  212  LLAVSLLQGAVDRLLLNGLHALHLQKTWISEVNMTWVRVYMVLTLLAGLFFSLFYAALSA  271

Query  310  LIYSDLKANYRGPQHPPIKRQW  331
            +I+S L           + + +
Sbjct  272  VIFSRLVEEKEKRSLRDLAKVF  293


>WP_198265140.1 efflux RND transporter permease subunit [sulfur-oxidizing endosymbiont 
of Gigantopelta aegis]
Length=731

 Score = 47.5 bits (109),  Expect = 0.013, Method: Composition-based stats.
 Identities = 37/418 (9%), Positives = 106/418 (25%), Gaps = 26/418 (6%)

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRH---VGSFTLLLILLILVVGGGSLLLIIP  224
             L    +   +  +   + +  F ++ +             L   +   V  G ++++  
Sbjct  24   ALIGLSLVLLVTWFFLGSRIAFFTTIGIPFTLAATFWILHALGQTVNNAVLLGVVIVLGM  83

Query  225  GLLFCVWFFFCQYVLADDNIG----GLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
             +   +      Y    + +      ++A+++    V+         F+ L+++   +  
Sbjct  84   LVDDAIVVVESMYTRMREGVDSVQAAIEAIQEVFAPVTASVLTTMAAFLPLMLLPGIMGE  143

Query  281  LTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFG  340
                IP V   A     +        +  +   +  +     +       +         
Sbjct  144  FMLVIPLVVTIALAISLIEAYWMLPAHLIVSKPNYSSEAHASKIRSRFGLYRVQIFRERF  203

Query  341  WMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYK  400
               +  + +  L       + +L     +        Q    +      EP  +   + K
Sbjct  204  TQKLKSVYIRFLIGSLRQPKIMLLGLVLLFSSAIYALQSGKIVIDFFAGEPFPVIYVNVK  263

Query  401  LLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEI  460
            +        T      L                           F  +           I
Sbjct  264  MPEGTPLDYTLTAVQQLENNIKAQLARDDYRD----MASYAGFYFTQVEPLFGEHYGQII  319

Query  461  DKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQ-AEQVHSILG  519
              + D   +DL     + +             +    +G++ + L Q         ++  
Sbjct  320  LSLNDISPQDLTHLHQTIKQ------------QFKKMTGVKEMSLLQLKDGPPVNRAVSM  367

Query  520  KLELTLPLAIESLQLTRND-IGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHA  576
            K++     +I++  +     +     +    + L   GS  + L+   D      +  
Sbjct  368  KVQGADFESIQAAVVDLKRFLESLSDVENISV-LDSAGSMELALQLKQDAIHRAGLSP  424


>WP_013759537.1 DUF975 family protein [Treponema brennaborense]AEE17836.1 protein 
of unknown function DUF975 [Treponema brennaborense DSM 
12168]
Length=315

 Score = 46.7 bits (107),  Expect = 0.013, Method: Composition-based stats.
 Identities = 21/143 (15%), Positives = 43/143 (30%), Gaps = 12/143 (8%)

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
             +  + W T    I++    +G         R   + T+  +L +   G   L   I   
Sbjct  109  AITAMLWATLWQLIWMIPLIIGAIVFFTASFRVTKAETVGSMLFMFSGGVLYLAGFIILY  168

Query  227  LFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
               + +    ++LAD+  +   +AL  S  +  G    I    +                
Sbjct  169  AKMLSYSMIYFILADNPELSVRRALRLSIAVTKGSRENILVMALSFYGWL----------  218

Query  286  PYVGEAANLAFSLLLTPFSFLYY  308
              +G        L + P+  +  
Sbjct  219  -LLGFLTLGIGFLWIGPYMTMAM  240


>WP_089343722.1 zinc-ribbon domain-containing protein [Paracoccus seriniphilus]SNT72942.1 
MJ0042 family finger-like domain-containing protein 
[Paracoccus seriniphilus]
Length=289

 Score = 46.7 bits (107),  Expect = 0.013, Method: Composition-based stats.
 Identities = 10/44 (23%), Positives = 14/44 (32%), Gaps = 0/44 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
            ++ CP C AE   P   +P       C  C +         Q 
Sbjct  4   ISLICPGCAAEYRIPPDAIPEGGRDVECTACNRIWFVPGPYQQP  47


>HIJ62604.1 thioredoxin [Rhodospirillaceae bacterium]
Length=76

 Score = 42.8 bits (97),  Expect = 0.013, Method: Composition-based stats.
 Identities = 10/37 (27%), Positives = 12/37 (32%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  V CP C    N   + L  +    RC  C     
Sbjct  1   MI-VTCPSCETHFNLAPAALGPEGRVMRCARCGHKWH  36


>WP_002772529.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Leptonema illini]EHQ06838.1 Glycerophosphoryl 
diester phosphodiesterase, membrane domain-containing protein 
[Leptonema illini DSM 21528]
Length=303

 Score = 46.7 bits (107),  Expect = 0.013, Method: Composition-based stats.
 Identities = 19/203 (9%), Positives = 47/203 (23%), Gaps = 0/203 (0%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            W +     + +       I        +++      + +  +       L      I + 
Sbjct  78   WIILTIMVFPVFFASYTVIGYRLIRREASVGDLFLGFRSYGSTLVTAISLFILYYLITIP  137

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
             S            T+  +                +     +             L F  
Sbjct  138  FSLPQMLYLFEGFPTENVMENMSSYMAGLKAKQEQIYDAENIWRVFLGYAGYPFALYFWG  197

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
                   ++     G  QA + S  L   H   +     L+ ++ +    + A    +  
Sbjct  198  RVQMVLPLVILRESGIGQAFKSSWQLTGPHHLKLALYLFLITIVMIIGFAILAFGIALSA  257

Query  291  AANLAFSLLLTPFSFLYYYLIYS  313
                   +L+    FL  ++   
Sbjct  258  LTGSLAIILIPACVFLLLFVFAY  280


>NOY87574.1 DUF3426 domain-containing protein [Deltaproteobacteria bacterium]
Length=359

 Score = 47.1 bits (108),  Expect = 0.014, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 13/36 (36%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V+C  C  +      K+  K    RC +C    
Sbjct  1   MI-VQCDKCRTKYRVADEKVTGKGVRVRCAKCEHIF  35


>KAF7127327.1 hypothetical protein RHSIM_Rhsim11G0075400 [Rhododendron simsii]
Length=508

 Score = 47.5 bits (109),  Expect = 0.014, Method: Composition-based stats.
 Identities = 29/226 (13%), Positives = 70/226 (31%), Gaps = 11/226 (5%)

Query  130  VLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGL  189
               F   F  L      +        +   +   ++ +      +  +  ++     +  
Sbjct  270  YSYFFLGFYLLSTSAVVYTVASIYTGRDVTIGIVMSVVFKIEKRLMVTFALFFLLLSLYT  329

Query  190  FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQA  249
               + +    V   T+   + ++ +    +L  +  +   + +     +   +++ GL+A
Sbjct  330  VVLVAVLALVVWCLTIEYKVGLVFLVVIVILCTVGFVYTTLVWQLASIISVLEDMKGLKA  389

Query  250  LEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT-------ARIPYVG----EAANLAFSL  298
            + KSR L+ G  W     F +L    + +  L             VG        L F  
Sbjct  390  MTKSRNLIKGKMWIAVVIFSMLNFTMVGIGILFESQVVHTGSFGVVGRLGFGFLCLLFLS  449

Query  299  LLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLI  344
             +  F F+   +IY   K+ +            L +  A    ++ 
Sbjct  450  KVFLFGFVIQTVIYFVCKSYHHENIDKSSLSDHLEVYLAENVPLMA  495


>TDJ02067.1 hypothetical protein E2O73_03155 [Deltaproteobacteria bacterium]TDJ09684.1 
hypothetical protein E2O71_01510 [Deltaproteobacteria 
bacterium]
Length=299

 Score = 46.7 bits (107),  Expect = 0.014, Method: Composition-based stats.
 Identities = 10/40 (25%), Positives = 17/40 (43%), Gaps = 0/40 (0%)

Query  5   RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           RCP C +       K+  + +  RC +C      +PA+  
Sbjct  8   RCPQCESRYRVAPEKVGPQGARLRCTKCGNVFRVEPAQMD  47


>NOU32819.1 hypothetical protein [Polyangiaceae bacterium]
Length=302

 Score = 46.7 bits (107),  Expect = 0.014, Method: Composition-based stats.
 Identities = 26/209 (12%), Positives = 54/209 (26%), Gaps = 31/209 (15%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            +Y L I +         + +  +  + +         +            ++      + 
Sbjct  48   LYTLAIFVPSYLPTVLAMARVFSISSVEFHVTSLVFEVIATLSSTYLFLGISRFGLASVK  107

Query  184  KTDVGLFRSM--KLGLRHVGSFTLLLILLILVVGG------------------GSLLLII  223
                G+      +   R    + LL  L  L                       + L ++
Sbjct  108  GETPGIGHLFGLRGLGRMFVLYMLLNTLSFLGTTVSIVAAAIDMTELHVLAGALAFLSLV  167

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
               +   +       + D ++  +QA+  S  +  G+   IF              FL  
Sbjct  168  VLAVAWPFISMSPLFILDKDLSIVQAIRASLDVTRGNRLNIF-----------VAGFLAG  216

Query  284  RIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
             I   G  A         P   L + +IY
Sbjct  217  LIMIAGMFACGIGLFATMPLGTLVFVVIY  245


>HAH08606.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=354

 Score = 47.1 bits (108),  Expect = 0.014, Method: Composition-based stats.
 Identities = 10/40 (25%), Positives = 15/40 (38%), Gaps = 1/40 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
           M  V CP+C       ++ +       RC  C  T   +P
Sbjct  1   MI-VSCPNCSTRYVLDAAMIRPPGRHVRCARCQLTWFQEP  39


>WP_058273742.1 hypothetical protein [Ruegeria atlantica]CUH43713.1 hypothetical 
protein RUM4293_02608 [Ruegeria atlantica]
Length=101

 Score = 43.6 bits (99),  Expect = 0.014, Method: Composition-based stats.
 Identities = 17/97 (18%), Positives = 33/97 (34%), Gaps = 8/97 (8%)

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE--------AANL  294
             + G   L +S  L  G+ W IFG  +L+ + +  ++FL      +              
Sbjct  4    RVSGFGGLGRSAALTKGYRWPIFGALLLVGICAGIVNFLAGVFAGILAGVGSWAKIVGFS  63

Query  295  AFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQW  331
              + L    + +   LI + L+    G         +
Sbjct  64   IIAALSEGLTGISIALISARLREIKEGVIVDQTASVF  100


>KIV61629.1 hypothetical protein SZ55_4949 [Pseudomonas sp. FeS53a]
Length=138

 Score = 44.4 bits (101),  Expect = 0.014, Method: Composition-based stats.
 Identities = 29/148 (20%), Positives = 56/148 (38%), Gaps = 11/148 (7%)

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
               + +         F  +           +L  L+ +++  G LLL+IPG+  C+ +  
Sbjct  1    MYLLAMRHVTGQPVNFNQIFSQFGKFIPLAILNALIPVLIYLGLLLLVIPGIYLCIAYML  60

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
               ++ +  +   QA+E SR  +S  W+  FG F LL +I +           +      
Sbjct  61   AMPLVVERGLSPWQAMEASRRAISQRWFKCFGLFALLGLIVM-----------LSALPLF  109

Query  295  AFSLLLTPFSFLYYYLIYSDLKANYRGP  322
               +   P  F+   L+Y  +    + P
Sbjct  110  IGLVWTLPMCFVVVALLYQRIFGIQQFP  137


>HIN76518.1 hypothetical protein [Rhodospirillales bacterium]
Length=284

 Score = 46.7 bits (107),  Expect = 0.014, Method: Composition-based stats.
 Identities = 11/86 (13%), Positives = 22/86 (26%), Gaps = 1/86 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + CP C          +       RC +C +     P   +R Q ++     P    
Sbjct  1   MF-ISCPKCSTSYTIDGRDIDISGRGVRCFKCGEAWHQYPEPVERAQISEPRLKVPKPET  59

Query  61  QRRIPSDRLEIQSKTVNCRRCNRSFC  86
           + +     +   +             
Sbjct  60  ELKQGIAEVPSSNPQYYGPGYPPPSM  85


>WP_162175824.1 zinc-ribbon domain-containing protein [Dongia sp. URHE0060]
Length=243

 Score = 46.3 bits (106),  Expect = 0.014, Method: Composition-based stats.
 Identities = 9/38 (24%), Positives = 14/38 (37%), Gaps = 0/38 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
             V CP C    +  +S L     + RC +C      +
Sbjct  6   VIVSCPACATRFSLDASLLGPNGRNVRCAKCAHRWRQE  43


>OGP60032.1 hypothetical protein A2V67_08745 [Deltaproteobacteria bacterium 
RBG_13_61_14]
Length=693

 Score = 47.5 bits (109),  Expect = 0.014, Method: Composition-based stats.
 Identities = 10/65 (15%), Positives = 15/65 (23%), Gaps = 1/65 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  ++C  C  +      K+  K +   CP C    I                       
Sbjct  1   MI-IQCNKCQKQYKVNPEKVTEKGTKITCPSCGHQFIVRRKAEAPPPEPKVKTPPCRVCG  59

Query  61  QRRIP  65
           Q    
Sbjct  60  QPSTH  64


>PAV71546.1 hypothetical protein WR25_17868 [Diploscapter pachys]
Length=190

 Score = 45.5 bits (104),  Expect = 0.015, Method: Composition-based stats.
 Identities = 18/125 (14%), Positives = 48/125 (38%), Gaps = 1/125 (1%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
            +V    +          + + +    +  ++K        + L ++++ +  G G  LL+
Sbjct  15   SVIAYAITYFGTALLYCLLLDRNRPTVGEALKRTAVMFPRYLLAMVVVSIPSGAGMYLLL  74

Query  223  IPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            +PGL     F     +L  +  +G + A+ +S  L     +++ G  V + + ++     
Sbjct  75   LPGLWLMSRFMLAGPILFAEAPVGAMAAVGRSWRLTRKAQFSLLGAIVTVYLGAVLAGQP  134

Query  282  TARIP  286
               + 
Sbjct  135  FMLVA  139


>NJL08224.1 hypothetical protein [Methylacidiphilales bacterium]
Length=111

 Score = 44.0 bits (100),  Expect = 0.015, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 13/36 (36%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V CP C +    P + +     + RC  C    
Sbjct  1   MLLV-CPSCSSAFRIPPAAITVAGRAVRCSHCQHVW  35


>VVB56624.1 Uncharacterised protein [uncultured archaeon]
Length=266

 Score = 46.3 bits (106),  Expect = 0.015, Method: Composition-based stats.
 Identities = 31/220 (14%), Positives = 66/220 (30%), Gaps = 26/220 (12%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
             W +   +L G+++    +    LL P    N        A+ L          + +   
Sbjct  25   PWMIALPFLSGLLMLALIVSFIFLLLPVADSNVATWGAVLALYLIGYFLFYFTQAMIVFG  84

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLIL---------------VVGGGSLLLI  222
                    D    +     + H  +  +L  +                      G +L  
Sbjct  85   AKERFAGRDPTFGQCFSSAMAHAPTLLILAAIGATIGLLMQVIGRDKNGRPTLFGQILSS  144

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            + G+ + +  +F   V+ +++ G L + ++S  LV+  W       + L +++L    L 
Sbjct  145  LIGVAWTLVSYFSLPVILNEDKGVLDSFKRSVELVTKAWGEGMSANLSLSLLTLPGIVLM  204

Query  283  ARIPYV-----------GEAANLAFSLLLTPFSFLYYYLI  311
                +V              A  A S+          Y+ 
Sbjct  205  GLGLFVGFLPLALLGLLAFVAGYALSIPAKAVISQALYVY  244


>PIR04431.1 hypothetical protein COV59_01120 [Candidatus Magasanikbacteria 
bacterium CG11_big_fil_rev_8_21_14_0_20_39_34]
Length=354

 Score = 46.7 bits (107),  Expect = 0.015, Method: Composition-based stats.
 Identities = 28/243 (12%), Positives = 70/243 (29%), Gaps = 10/243 (4%)

Query  59   GLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRG  118
                 I        +  +       +F              L +    +  + +L+    
Sbjct  54   YGSFAINWFYARFLNVPILPHVLASTFSPIGFFYLMGYSHELPNTWSFIKGTIDLYRHNM  113

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
               L   L+  + +      +  ++               I L     ++L    ++ + 
Sbjct  114  KIFLEYSLVFFIFSVVVTSLSTYIEVWENHIGNTSGILLGIFLLFRLLVILFGFVLSIAF  173

Query  179  FIYICKTDVGLFRS-----MKLGLRHVGSFTLLLILLIL-----VVGGGSLLLIIPGLLF  228
               +    +G+        +        S   + ++L++      +      + I   + 
Sbjct  174  LRVVAARIIGVAPHHMSSEIFDAFSLFWSCVFVTLVLMVLLFVLNITQAFFPIFIFLAIL  233

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
             +WF F  + +  +NI G  A   S+ LV G  W++    V L  +   +++    I   
Sbjct  234  IIWFIFSIHAVVIENIRGFSAFGYSKKLVRGRAWSVLWILVALTGLFAFIAYFIQGILVA  293

Query  289  GEA  291
               
Sbjct  294  PFF  296


>WP_172187921.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Lentilactobacillus kribbianus]
Length=601

 Score = 47.1 bits (108),  Expect = 0.015, Method: Composition-based stats.
 Identities = 18/167 (11%), Positives = 46/167 (28%), Gaps = 4/167 (2%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            ++ + ++    AF  +    +    +    Q      A L        +        +  
Sbjct  80   IVILIVIYWQFAFILLSVRNIQHGKSETLMQVLRQTVASLKVASPLTFVFFLGYFIIILP  139

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
            +          +       +  F L   L    +    L++        +  F     + 
Sbjct  140  FGSLFLRTPLLNKVKIPGFLMDFFLQNPLYTAGLVLFYLVIG----YIGIRLFLTLPFMI  195

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
              +     A++ S     G  W     F+L+  +S  ++ +   + Y
Sbjct  196  LSHTSAKTAIKLSWQKTRGRLWFYLASFLLIAGVSSMVTAVVYALIY  242


>NLI33605.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=73

 Score = 42.8 bits (97),  Expect = 0.015, Method: Composition-based stats.
 Identities = 12/57 (21%), Positives = 18/57 (32%), Gaps = 1/57 (2%)

Query  3   TVRCPHCGAERNTPSSKLPAK-KSSARCPECCQTLIFDPAESQRTQTTDNIATCPHC  58
             +CP C A    P  K+P        CP+C   +     E    ++       P  
Sbjct  2   KFQCPGCQAGFVVPDEKIPTGRGVRILCPKCKFPIEVKEIEPDPQESGRAHGPAPSN  58


>NBC58701.1 hypothetical protein [Bacteroidetes bacterium]
Length=123

 Score = 44.0 bits (100),  Expect = 0.015, Method: Composition-based stats.
 Identities = 11/61 (18%), Positives = 22/61 (36%), Gaps = 0/61 (0%)

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
             +N    +A  K   L+  +WW  F  F++  ++   L F+     +V        +   
Sbjct  1    MENESISEAFTKCFTLIKNNWWITFATFLVFGILIAILGFIFQLPAFVFSMVEGFTAFEQ  60

Query  301  T  301
             
Sbjct  61   A  61


>WP_088259215.1 DUF3566 domain-containing protein [Fimbriiglobus ruber]OWK36162.1 
hypothetical protein FRUB_08725 [Fimbriiglobus ruber]
Length=261

 Score = 46.3 bits (106),  Expect = 0.015, Method: Composition-based stats.
 Identities = 14/86 (16%), Positives = 20/86 (23%), Gaps = 3/86 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M    CP CGA      SK      S +CP+C          +Q               +
Sbjct  1   MIRFPCPGCGATFTVDDSK---GGRSGKCPKCQGPFTIPMPGAQAPAPAPAADPNEPVEI  57

Query  61  QRRIPSDRLEIQSKTVNCRRCNRSFC  86
                       + +          C
Sbjct  58  APCPGCQMRLTVAASNLGMNVECPGC  83


>NQV33552.1 hypothetical protein [Phycisphaeraceae bacterium]
Length=486

 Score = 47.1 bits (108),  Expect = 0.015, Method: Composition-based stats.
 Identities = 8/72 (11%), Positives = 18/72 (25%), Gaps = 3/72 (4%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M    CP C    +   +         +CP+C   ++     +      ++         
Sbjct  1   MIRFACPSCHKSIHVDDN---HAGKKGKCPKCGHAVVVPEQSTLIEFDCEDCGHTIKVPG  57

Query  61  QRRIPSDRLEIQ  72
                + R    
Sbjct  58  DYAGKTGRCPKC  69


>OBQ27600.1 hypothetical protein AN483_19895, partial [Aphanizomenon flos-aquae 
MDT14a]
Length=218

 Score = 45.9 bits (105),  Expect = 0.015, Method: Composition-based stats.
 Identities = 17/164 (10%), Positives = 58/164 (35%), Gaps = 5/164 (3%)

Query  110  SWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
               +          +  L   LAF  + +      +      ++ W++ + +  +  I +
Sbjct  44   WLLVPIYGWVKFYALSALISRLAFGELVNQPESVSSGKRFVNSRLWEFFVNMILMLAISI  103

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
            G+      +   +      L   ++          +L+ L+++++    +L      +  
Sbjct  104  GIMLGFVLIGSLLIGIPTVLLGGLQDANPANTGIIILITLVVMIITFSGILW-----IGT  158

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
             ++     +  +D++ G  ++ +S  L  G+ + I    ++  +
Sbjct  159  RFYLVDVPLAIEDDVNGSSSINRSWELTKGNIFRILLISLIGFL  202


>XP_001430535.1 hypothetical protein [Paramecium tetraurelia strain d4-2]CAK63137.1 
unnamed protein product [Paramecium tetraurelia]
Length=432

 Score = 47.1 bits (108),  Expect = 0.015, Method: Composition-based stats.
 Identities = 27/271 (10%), Positives = 64/271 (24%), Gaps = 19/271 (7%)

Query  4    VRCPHCGAERN---TPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            ++CPHC                     +C +  +  +        T   +++        
Sbjct  159  IKCPHCDIPIRDIDVLEHSQELYTKYIKCHQNLEIAMNPNKAWCPTINCNSVIEFKQLST  218

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSW--------E  112
                   ++E+  +        +S     ++          +                  
Sbjct  219  VATCAYCQIEVCKRCKQRAHPLQSCEENLQQVLNEWQENRDTQQCPRCKIIVEKINGCNH  278

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
            + C+               +           PA       Q     + L    +I   L 
Sbjct  279  MTCQFCQHEWCWICGSDYTSIHYAIFNPFGCPALMPGWIRQKDWSYVKLIIWRFICFILL  338

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
             +   M + +    + + + ++       +  +   LLI     G  L+ I  ++F V  
Sbjct  339  IILTPMVVLVIAPILCIAKLIQTRFYRDKNCWIQACLLIFAFIIGIALIPIAIIVFAVAL  398

Query  233  FF----CQYVLADDNIGGL----QALEKSRL  255
                        D+          AL +   
Sbjct  399  VPSIIGIIVFYYDERKRLEFRHQTALSRHFQ  429


>HHO51860.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=546

 Score = 47.1 bits (108),  Expect = 0.015, Method: Composition-based stats.
 Identities = 26/204 (13%), Positives = 58/204 (28%), Gaps = 3/204 (1%)

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
            W    + +   ++  A    +                   +L+               + 
Sbjct  312  WAPSLLVIPAALITVAFAMVSSGQASGMQPFIPALCQLAWVLVWASFGGAAAAIVWVRAG  371

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
                    V   + +    R             + +      +I+PG+ + + + F   V
Sbjct  372  DAASRGEPVDAGQILAEVPRRTLEIAAPHG-ARIQIVSIGFQVILPGIFYALQYAFVDMV  430

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
               D      AL +S  L  G    +F  F++  +++   S   A +   G+ +     +
Sbjct  431  AVLDPQR--SALRRSGQLTWGMRGRLFRMFLIYWLVTGFASLGLAMLIDGGQLSQRFTEM  488

Query  299  LLTPFSFLYYYLIYSDLKANYRGP  322
            L+ P  F    L+  +L       
Sbjct  489  LMNPTGFSLPVLVAQELMWALGTW  512


>RLB29263.1 hypothetical protein DRG66_02270, partial [Deltaproteobacteria 
bacterium]
Length=103

 Score = 43.6 bits (99),  Expect = 0.015, Method: Composition-based stats.
 Identities = 10/63 (16%), Positives = 17/63 (27%), Gaps = 1/63 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + C  C  + N     L    S  RC  C       P + +     ++        +
Sbjct  1   MI-IECERCHTKFNLDEKLLKETGSKVRCSICKHIFTAFPPKPEIEIEENSTERLEKQEI  59

Query  61  QRR  63
              
Sbjct  60  VPP  62


>NOY64397.1 hypothetical protein [Nitrospirae bacterium]
Length=49

 Score = 42.1 bits (95),  Expect = 0.015, Method: Composition-based stats.
 Identities = 9/37 (24%), Positives = 16/37 (43%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  + CP C  +      K+    +  +CP+C   L+
Sbjct  1   MIVI-CPKCRVKLKIADEKVSPGGTRFKCPKCATILM  36


>PVX25248.1 hypothetical protein CW691_05195 [Candidatus Bathyarchaeota archaeon]
Length=212

 Score = 45.9 bits (105),  Expect = 0.015, Method: Composition-based stats.
 Identities = 27/188 (14%), Positives = 62/188 (33%), Gaps = 9/188 (5%)

Query  126  LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKT  185
             L +      + +  LL      N  N    WA          +    +       I   
Sbjct  19   NLLLFAPPLVLLAIQLLFQFLVYNFPNIWLVWAGRFIVGLIGFIAYCIVVDMTNDAINGQ  78

Query  186  DVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIG  245
             + L +S+   +  +    L+ I+ +L      L+ +   L               +   
Sbjct  79   PLNLNKSLNAIMGRLAELILVAIVTVLCALTILLIPLALFLRTITV---------VEKTD  129

Query  246  GLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSF  305
              Q + KS   V  +   +    ++++++++ +SF  + IP+VG   +   ++L+     
Sbjct  130  TNQTISKSVDFVRNNLGEVVFFAIIVIIVAVFISFGFSLIPFVGAYLSDFLNMLVNVVFT  189

Query  306  LYYYLIYS  313
                 +Y 
Sbjct  190  TASVHLYF  197


>WP_141332527.1 zinc-ribbon domain-containing protein, partial [Myxococcus sp. 
AB025B]
Length=693

 Score = 47.5 bits (109),  Expect = 0.015, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V+C  C      P  K+  K    RC +C  T 
Sbjct  1   MI-VKCARCQTRFKIPDEKVTEKGVKVRCTKCQNTF  35


>WP_194869038.1 zinc-ribbon domain-containing protein, partial [Myxococcus sp. 
AB025B]
Length=134

 Score = 44.4 bits (101),  Expect = 0.016, Method: Composition-based stats.
 Identities = 8/34 (24%), Positives = 12/34 (35%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP C  +       LP   +S +C  C    
Sbjct  2   QIACPQCSMQYALDPRLLPPGGASVQCTRCGHVF  35


>WP_126401825.1 zinc-ribbon domain-containing protein [Blastochloris tepida]BBF94769.1 
thioredoxin [Blastochloris tepida]
Length=363

 Score = 46.7 bits (107),  Expect = 0.016, Method: Composition-based stats.
 Identities = 10/36 (28%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V CP C +    P + + A   + RC  C    
Sbjct  1   MLLV-CPSCTSAFRIPPAAITAAGRTVRCSHCQHVW  35


>RKZ30952.1 hypothetical protein DRQ36_03750 [bacterium]
Length=439

 Score = 47.1 bits (108),  Expect = 0.016, Method: Composition-based stats.
 Identities = 12/40 (30%), Positives = 15/40 (38%), Gaps = 0/40 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAE  42
            + CP+CG+    P  KLP K    RC  C          
Sbjct  2   KITCPNCGSSSQIPDEKLPEKPVQIRCKSCGTAFTIQRPP  41


>HCF59164.1 hypothetical protein [Myxococcales bacterium]
Length=1003

 Score = 47.5 bits (109),  Expect = 0.016, Method: Composition-based stats.
 Identities = 12/34 (35%), Positives = 17/34 (50%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            V C  CGA       ++PAK + A+CP+C    
Sbjct  2   RVTCEKCGAAYAVDEKRIPAKGARAQCPKCKGVQ  35


>THU71494.1 hypothetical protein C4D60_Mb04t02030 [Musa balbisiana]
Length=307

 Score = 46.7 bits (107),  Expect = 0.016, Method: Composition-based stats.
 Identities = 31/225 (14%), Positives = 55/225 (24%), Gaps = 14/225 (6%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
            +       F        G       L    IF  +                 A       
Sbjct  60   MPFPMATGFDDYLDFEFGATHDIKELLAITIFVLVEHTLYLATITVTIYAVSAASSTAAV  119

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
              L     M      +I +  V +       L    +  + + +  +V+G  S+L     
Sbjct  120  THLTLSGVMRSLKACFITRLWVTMLELAFNLLLASVNMMMGIWIFGIVLGAVSVLQNYLR  179

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL-------LLVISLTL  278
                V +     V   ++  G+ AL ++  LV G+W   +   V          V     
Sbjct  180  ----VVWVLALVVSVVEDYSGVAALRRALQLVRGNWAQTWLLSVFAFQTGIHFFVARNLA  235

Query  279  SFLTA---RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
            S L      +                  +     + Y + K   R
Sbjct  236  SSLVRRRPIVGVAMSVFLAVGVAFSVMLAVAAEVVFYGECKKKRR  280


>WP_011985291.1 zinc-ribbon domain-containing protein [Anaeromyxobacter sp. Fw109-5]ABS25185.1 
MJ0042 family finger-like protein [Anaeromyxobacter 
sp. Fw109-5]
Length=588

 Score = 47.1 bits (108),  Expect = 0.016, Method: Composition-based stats.
 Identities = 15/73 (21%), Positives = 22/73 (30%), Gaps = 1/73 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V C  C A+      K+  + + ARC  C    +  P        +   A  P    
Sbjct  1   MI-VACTSCRAKFRIADEKIGPRGAKARCSRCQTVFVVHPELGAVPPPSPGPAAPPARDS  59

Query  61  QRRIPSDRLEIQS  73
           +   P  R     
Sbjct  60  RPGSPQPRRPEPR  72


>WP_021693655.1 zinc-ribbon domain-containing protein [Limimaricola cinnabarinus]GAD55551.1 
hypothetical protein MBELCI_1603 [Limimaricola 
cinnabarinus LL-001]
Length=534

 Score = 47.1 bits (108),  Expect = 0.016, Method: Composition-based stats.
 Identities = 9/34 (26%), Positives = 15/34 (44%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP+CGA+       +P +    +C  C  T 
Sbjct  2   RLTCPNCGAQYEVARDAVPTEGRDVQCSNCGLTW  35


>WP_068041938.1 MULTISPECIES: zinc-ribbon domain-containing protein [unclassified 
Rickettsia]ASX28565.1 hypothetical protein BA173_04950 
[Rickettsia sp. MEAM1 (Bemisia tabaci)]ODA36883.1 hypothetical 
protein A8V33_03745 [Rickettsia sp. wb]ODA38549.1 hypothetical 
protein A8V34_03970 [Rickettsia sp. wq]
Length=213

 Score = 45.5 bits (104),  Expect = 0.016, Method: Composition-based stats.
 Identities = 7/98 (7%), Positives = 21/98 (21%), Gaps = 0/98 (0%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            + CP+C  +    ++++       +C +C          +    +  +           +
Sbjct  3    ISCPNCQTKFIVSNNQIGINGRRVKCSKCRHIWYQKLDYNTSKLSDFDEDKFEAVKTPIK  62

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLR  101
                     +  V                   S     
Sbjct  63   NGYGNQFSANVPVILPYIPPKPKYNVFPFLWTSFIIFC  100


>PIN85665.1 hypothetical protein COV47_01090 [Candidatus Diapherotrites archaeon 
CG11_big_fil_rev_8_21_14_0_20_37_9]
Length=269

 Score = 46.3 bits (106),  Expect = 0.016, Method: Composition-based stats.
 Identities = 19/156 (12%), Positives = 50/156 (32%), Gaps = 12/156 (8%)

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             +   I K  +  F  + + L        +  + +  +      +I   L   +   F  
Sbjct  105  HIANTIKKRIIPFFAYLVITLLFYSIIITIPTVPLYFLFQTIGAIIGIILGAIIVLAFIP  164

Query  237  YVLA------DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF------LTAR  284
             ++         +   ++++     L   ++        ++++  L +S         + 
Sbjct  165  IIIMIPPIIALRDDTIIESIRNGIRLGLHNYTYNLVAIAIMIIAGLVISIPSMIASFLSI  224

Query  285  IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
            IP +G   ++  S+L  PF+     L+       Y 
Sbjct  225  IPGIGVFLSIIGSILTIPFTIYATILLTLYTLKIYE  260


>WP_173743586.1 DUF975 family protein [Blautia wexlerae]NSF74130.1 DUF975 family 
protein [Blautia wexlerae]
Length=340

 Score = 46.7 bits (107),  Expect = 0.016, Method: Composition-based stats.
 Identities = 27/303 (9%), Positives = 86/303 (28%), Gaps = 21/303 (7%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
                 ++ I +  I      ++  +       LN     ++        A  +L    + 
Sbjct  56   YVFSFIISILINVISAGMLYMYLNIARNKEFSLNDLFYFFKKYPDRVITATFVLAFINVL  115

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
              +   I    +         +  +    +   +L+++      ++ IP  +        
Sbjct  116  TMLPYTIYGYSLPTD---ISDMNLLMEIAVKSGVLMIIGTVVYEIITIPLEMTYYILADN  172

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
                      G+ AL++S  ++ G++W  F   +  +            + ++       
Sbjct  173  PQ------TKGMDALKESIEMMRGNFWRYFLLKLSFV-----------PLMFLSVFTFYI  215

Query  296  FSLLLTPFSFLYYYLIYSDLKAN-YRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSR  354
              L + P+  +   + Y DLK    R  +  P    +               ++     +
Sbjct  216  ALLWILPYMTMTETMFYRDLKGELKRSGETEPPTYGYQSEPYFKSVDTEAEAIVNPEADQ  275

Query  355  QNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGG  414
             + + +  L   +   +   +   ++ +  ++  E  +     + +           +  
Sbjct  276  DDRAEKADLWNIQPDSESRESADNESTEQPQTAEEIQEEKKPQEEQPEEEDNEPKPWDEY  335

Query  415  LSL  417
             + 
Sbjct  336  FNR  338


>RLA85112.1 hypothetical protein DRG31_03810 [Deltaproteobacteria bacterium]
Length=209

 Score = 45.5 bits (104),  Expect = 0.016, Method: Composition-based stats.
 Identities = 12/44 (27%), Positives = 16/44 (36%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  V C  CG +     + L  +    RC  C  T    P E +
Sbjct  1   MI-VECGACGTKYRFDETLLRPEGVRVRCSRCGFTWTLYPEERE  43


>WP_027308803.1 hypothetical protein [Caloramator sp. ALD01]
Length=257

 Score = 46.3 bits (106),  Expect = 0.016, Method: Composition-based stats.
 Identities = 23/120 (19%), Positives = 46/120 (38%), Gaps = 0/120 (0%)

Query  189  LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQ  248
            L   + L +  +GS  +       V   G L+ ++      +       +L  ++ G   
Sbjct  115  LIIYIFLSIFIIGSLYVASKGSAGVFIIGILIFVLILGFVAILVTPVIPILVVEDEGFTG  174

Query  249  ALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYY  308
            A ++S      +++ I G FV  ++I L +  +   +  +    N   S LLT +    Y
Sbjct  175  AFKRSFEFGFSNFFNILGVFVFNMIIGLLVGIIFKNVKIIQTLINSYMSTLLTVYIINKY  234


>OYV07719.1 Uncharacterized protein CG444_193 [Methanosaeta sp. ASP1-2]
Length=170

 Score = 45.1 bits (103),  Expect = 0.016, Method: Composition-based stats.
 Identities = 18/118 (15%), Positives = 49/118 (42%), Gaps = 5/118 (4%)

Query  188  GLFRSMKLGLRHVGSFTLLLILL-----ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
             +   + + L +  +  L ++L        ++    L+ +   +   + + F  Y +AD 
Sbjct  9    TIIFYLIITLPYFTTIFLSMLLGKETSEDYILVPAMLISLAAAVYLHLKYQFFGYFIADR  68

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
              G ++AL++S  +  G    +   ++ + ++   +S + + +  V E       LL+
Sbjct  69   GSGPIEALKQSGRMTKGVLKNLLIFWIEMGLVIGFVSAIASIVSIVVEIPMTMILLLV  126


>HGO75033.1 STAS domain-containing protein [Phycisphaerae bacterium]
Length=299

 Score = 46.3 bits (106),  Expect = 0.017, Method: Composition-based stats.
 Identities = 8/29 (28%), Positives = 9/29 (31%), Gaps = 0/29 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARC  29
           M    C HC A  N    K        +C
Sbjct  56  MIPFTCTHCQARLNVSDDKAGKSGKCPKC  84


>PIP81684.1 hypothetical protein COR54_18990 [Elusimicrobia bacterium CG22_combo_CG10-13_8_21_14_all_63_91]PJB25180.1 
hypothetical protein 
CO113_09900 [Elusimicrobia bacterium CG_4_9_14_3_um_filter_62_55]
Length=276

 Score = 46.3 bits (106),  Expect = 0.017, Method: Composition-based stats.
 Identities = 34/218 (16%), Positives = 65/218 (30%), Gaps = 17/218 (8%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
               I +L  V  +   F  +L+      +P   N   A+ +     +   L  +  +  I
Sbjct  41   FFSICVLTRVPPYLLRFLYILVFIDAADDPGPLNLSAAVAVILYEGLSWVLGALALAAAI  100

Query  181  YI--CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
                    + +  + + G   + +      L  + VG G   L+IPGL+    F F  + 
Sbjct  101  SAADQGRPLSVIGAFRAGFARLSAGLKTAALGGIFVGVGLAALLIPGLILLYQFSFAWFA  160

Query  239  LADDNIGGLQALEKSRLLVSGHWWA-IFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
            +A + + G  AL+ SR LV  +    +    +   +           +          F 
Sbjct  161  VAVEGLEGKAALDSSRALVRAYPTRTLATLGLAAALSLGVAGAAMGSLNLALGFVYGLFD  220

Query  298  LL--------------LTPFSFLYYYLIYSDLKANYRG  321
            L                  F  +   +      A    
Sbjct  221  LPENSLATAFVFDLARRVVFQCVPVAVAVYWWVAYAEF  258


>WP_166820645.1 DUF4190 domain-containing protein [Rubinisphaera sp. JC658]
Length=195

 Score = 45.5 bits (104),  Expect = 0.017, Method: Composition-based stats.
 Identities = 27/196 (14%), Positives = 48/196 (24%), Gaps = 4/196 (2%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIA-TCPHCGL  60
              V C +CG     P     A    A+CPEC   ++      +  +  D           
Sbjct  3    IEVECRNCGRILRAPDD---AAGRRAKCPECSAVVLVHEDGDEFDEFFDTDFLPTEEPSG  59

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
                  +      + V        +        R  G  LR                   
Sbjct  60   FHEDYEETPPRDRRDVRPCPVCGEYIKSRATRCRFCGESLRESRHARPGDRSEGWAIASL  119

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +LGI  L +           L+     +    +  Q     + +A   L    +  + ++
Sbjct  120  VLGIVSLVLFCLAPISVPLALMAVVFSVLEMVKASQEHRDRSGMAIAGLICGLIPLAFWM  179

Query  181  YICKTDVGLFRSMKLG  196
             I    +     M   
Sbjct  180  AILIAAMTDNARMFNF  195


>NBQ82794.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=202

 Score = 45.5 bits (104),  Expect = 0.017, Method: Composition-based stats.
 Identities = 6/34 (18%), Positives = 11/34 (32%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + C  C       ++ +P +    RC  C    
Sbjct  7   LLTCAKCETIFKIDAAAIPKEGRRVRCTICDHVW  40


>QIQ85562.1 hypothetical protein G9473_01845 [Erythrobacter sp.]
Length=290

 Score = 46.3 bits (106),  Expect = 0.017, Method: Composition-based stats.
 Identities = 25/176 (14%), Positives = 50/176 (28%), Gaps = 18/176 (10%)

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
            + L  +  +    A   + + L                  +L   V   +  L      +
Sbjct  80   YVLFLLVTIAQQAAMTALATPLSQPGFGDALGAGFKSAPTLLATAVLLTVAALVAGVVWI  139

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
             +    +  G    +   L  +  F L L L   +     ++                  
Sbjct  140  VLAAILSLAGDAGILANVLAVLMVFPLALYLACRLAVLVPVVA-----------------  182

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
              D   G ++A+ +S  +  G    I   F+L  VI + L  +   + + G A   
Sbjct  183  -VDGERGPIRAIRRSWAITEGKVLGILVVFLLATVIMIVLGLVPFLLIFAGGAMAG  237


>GEU92936.1 hypothetical protein [Tanacetum cinerariifolium]
Length=357

 Score = 46.7 bits (107),  Expect = 0.017, Method: Composition-based stats.
 Identities = 28/264 (11%), Positives = 71/264 (27%), Gaps = 12/264 (5%)

Query  103  ISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLA  162
             +  L++       R      IY+L   +      S +          +  ++  +    
Sbjct  63   HTLHLSEHISTVDHRTVVYELIYILIAYMLTLCAISMITYSTHQSFLGKPVSFLTSFKSL  122

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG--GGSLL  220
             +++  +  + +  +  +++      +F    L       F      +  +       ++
Sbjct  123  ALSFFPIVSTAVVANALLFLILLSFLVFVGSLLMFGKTVGFVTDDNSVCFLSFSVIAGVV  182

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
              +  + F V           ++  G +AL +S  LV G         +   VI   L  
Sbjct  183  FFVVMMYFYVNLSLVYVASVTESKWGFEALTRSWYLVKGMRLVSLKLLLFYGVIDGLLVA  242

Query  281  L---------TARIPYVGEAANLAFSLLLTPF-SFLYYYLIYSDLKANYRGPQHPPIKRQ  330
            +               V      +  L+L    S +   ++Y+  KA +        +  
Sbjct  243  IYSYYLMKYGLGNWTSVFHIIYGSGFLILLMLQSSVAITVLYNYCKALHGELVIEVAEGF  302

Query  331  WLPLTAAIFGWMLIPGLLLVSLSR  354
                         +       +  
Sbjct  303  VCDYITLGDDGEKVGLGTYGDVGY  326


>MBI1209912.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=329

 Score = 46.7 bits (107),  Expect = 0.017, Method: Composition-based stats.
 Identities = 23/227 (10%), Positives = 62/227 (27%), Gaps = 4/227 (2%)

Query  78   CRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIF  137
              R +    +    +  +      +                  L+ I ++ + L    +F
Sbjct  44   YWRLSEMIEMMQHFDPGSMFKVPTTQPLASLGWPVSVALYVLQLISIAVVAVALHRVILF  103

Query  138  SALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGL  197
            +                + +A++  T   +++ +  +    F+ +         +     
Sbjct  104  NERKPGVWFSFPFGRTEFFYALMAVTALILVVVVGALLALPFVVVITPLGLTPEAYLQPQ  163

Query  198  RHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLV  257
                    L            +++    +   +        L  +   G   L ++  L 
Sbjct  164  TWSMIAMSLAQKPHWFFVVALVVVYGGVIWLLLRLAAWPPTLVAEGGFG---LARAWSLT  220

Query  258  SGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFS  304
             G+     G F+L+ V    +  + A      +  ++   LL TP  
Sbjct  221  RGNALRYLGLFLLVFVFFAAIG-VAAYFFVAAQHISIMAWLLPTPVQ  266


>WP_161636928.1 hypothetical protein [Erysipelothrix rhusiopathiae]NBA01304.1 
hypothetical protein [Erysipelothrix rhusiopathiae]
Length=208

 Score = 45.5 bits (104),  Expect = 0.017, Method: Composition-based stats.
 Identities = 38/204 (19%), Positives = 76/204 (37%), Gaps = 12/204 (6%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            +    + +    +F  L+     +L+     +  A++L  +  + + L  +   +  +  
Sbjct  6    LLHNYLQIFIISVFILLIPFALGFLDFGVNPFISAVILILIGLVSMLLHGLVEMVVFFSR  65

Query  184  KTDVGLFRSMKLGLRHVGSFTLL---LILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
              D  L   +   +     +       +L  L +  G +LL++PG+   +      +VL 
Sbjct  66   LEDAELNWDLVKQVAAYVDWGRALKFYLLSALAITIGLVLLLVPGIYIAIPLSILSFVLV  125

Query  241  D--DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE----AANL  294
            D  D+    +   K   +  G+   IFG  + L++I      L   I   G       + 
Sbjct  126  DFPDDHNLFE---KCFEISKGYRLRIFGFTLALVIIQFLFKALLDSIFGPGTAQGNILDH  182

Query  295  AFSLLLTPFSFLYYYLIYSDLKAN  318
            A SLLL P S  Y+  +Y +    
Sbjct  183  ALSLLLAPISVNYFTNLYLEATGR  206


>WP_176437060.1 zinc-ribbon domain-containing protein, partial [Myxococcus sp. 
AM011]NVJ26764.1 zinc-ribbon domain-containing protein [Myxococcus 
sp. AM011]
Length=500

 Score = 47.1 bits (108),  Expect = 0.017, Method: Composition-based stats.
 Identities = 10/38 (26%), Positives = 14/38 (37%), Gaps = 0/38 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
            V CP C    N    ++P   +  +C  C  T    P
Sbjct  2   KVSCPSCQTNYNIDDKRIPPGGAKLKCARCQTTFPIRP  39


>OHV87881.1 hypothetical protein ORS3428_03620 [Mesorhizobium sp. ORS 3428]
Length=284

 Score = 46.3 bits (106),  Expect = 0.017, Method: Composition-based stats.
 Identities = 28/222 (13%), Positives = 59/222 (27%), Gaps = 5/222 (2%)

Query  75   TVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFA  134
                 R     CL          S L   +  +                   +       
Sbjct  19   CAVIXRNFWLCCLLALIFRILPQSILSLCAWAIFVHGVPDHSVKGIAFSAVHIIGYPGLF  78

Query  135  PIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVG---LFR  191
             +    L   A       +      L   +  +L        +  +      +G   +  
Sbjct  79   GMLHVALAFVAIEDWXGRRPTLRGCLGIALRRLLPATGIWLXAYELLRIGGAIGSHAIED  138

Query  192  SMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALE  251
                    +    +   LL  +     +   IPGL+    +F     L  +  G  ++L 
Sbjct  139  HAIRFYVRLLPSHIPFPLLAAIFAAIPIPAFIPGLVLWARWFVAIPTLISERSGIFRSLL  198

Query  252  KSRLLVSGHWWAIFGRFVLLLVISLTLSFLT--ARIPYVGEA  291
            +SR L  G  W + G ++ +   S+ +  ++    +P+    
Sbjct  199  RSRDLTRGSRWPLAGFWLGMYAGSVLVELVSRHQILPFGITL  240


>WP_049757760.1 zinc-ribbon domain-containing protein [Magnetococcus marinus]
Length=1209

 Score = 47.5 bits (109),  Expect = 0.017, Method: Composition-based stats.
 Identities = 12/54 (22%), Positives = 19/54 (35%), Gaps = 1/54 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIAT  54
           M  V C +C A  +     L +K    +C +C       P E +  Q   +   
Sbjct  1   MI-VTCENCDARFDVDPQLLGSKGRKLKCSQCHHIFFQAPPEPKSAQPPASEQP  53


>HCU59267.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=237

 Score = 45.9 bits (105),  Expect = 0.017, Method: Composition-based stats.
 Identities = 17/115 (15%), Positives = 34/115 (30%), Gaps = 0/115 (0%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             ++CP C        S +P      RC  C +     P ++   + T          + +
Sbjct  2    LIKCPKCQVVYELDDSLVPENGLKMRCNSCGEVFKAYPEDAVDDEQTAAQKKLNIINMFK  61

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRR  117
            R   ++ ++ +        NR   ++         S    +  LL      F   
Sbjct  62   RFAGEKEDLFTPDTPDVVQNRPPKVRIVHLTHYKNSINYLLILLLLVLMAAFMYF  116


>MBC7266751.1 hypothetical protein [Coriobacteriia bacterium]
Length=288

 Score = 46.3 bits (106),  Expect = 0.017, Method: Composition-based stats.
 Identities = 28/191 (15%), Positives = 55/191 (29%), Gaps = 28/191 (15%)

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
               +       S +  +    +          +K GL   G + L   L+ ++V   ++ 
Sbjct  85   WLLLLGRAYLDSAILAAYPQMLRGRVSDEKAFLKAGLSRWGWYLLTAYLVQMIVSLAAVA  144

Query  221  LIIPGLLFCVWFFFCQYVLA----DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
                     +  +    + A     +     QA+++S  LV  H+W   G   ++  +  
Sbjct  145  SFFALFAGGLAAWALLSLAATVTVLERANTAQAIQRSYALVKRHFWRTLGYLAIVTTLVA  204

Query  277  TLSFLTARIPYVGEA------------------------ANLAFSLLLTPFSFLYYYLIY  312
                  A    V +                            A   L+ P   L    +Y
Sbjct  205  LFEGALASPLIVRQIVVAAQNPDAVFQQTPLVWKIVEGLVLGAAMTLVAPIMPLALASLY  264

Query  313  SDLKANYRGPQ  323
            +DL+A   G  
Sbjct  265  ADLRARSEGMD  275


>WP_176423706.1 adventurous gliding motility protein GltJ, partial [Myxococcus 
sp. AM009]NVI99717.1 adventurous gliding motility protein 
GltJ [Myxococcus sp. AM009]
Length=363

 Score = 46.7 bits (107),  Expect = 0.017, Method: Composition-based stats.
 Identities = 8/32 (25%), Positives = 11/32 (34%), Gaps = 0/32 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
              C  C A+      K+  K    RC +C  
Sbjct  2   RFVCDSCRAQYMISDDKIGPKGVKVRCKKCGH  33


>NLK39116.1 DUF975 family protein [Clostridiales bacterium]
Length=303

 Score = 46.3 bits (106),  Expect = 0.017, Method: Composition-based stats.
 Identities = 19/181 (10%), Positives = 57/181 (31%), Gaps = 4/181 (2%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            + G+YLL  +      +   L    +     + +       +    I +   ++      
Sbjct  93   VFGLYLLLAMPVGVGFYLFALRSVESAGRAPSVSVILEPFSSAKLLIRIYRCFLAYLWRG  152

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +   ++ +  S+   +          +   +++     + +I      +       ++A
Sbjct  153  ALIIGEIAVGISIACRVNDYYLSWGYPVAGAVMMLCICAVTLIFAHFLALILCNVYPMMA  212

Query  241  D----DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
                  ++     ++ S+LL+ GH + +FG        +L  +F    +     A     
Sbjct  213  VAMKNRDLSIRGVVKISKLLMKGHRFELFGLTFSFGFWALLSAFTAGIVFAAYAAPYFVA  272

Query  297  S  297
            +
Sbjct  273  A  273


>WP_025028154.1 spore cortex biosynthesis protein YabQ [Bacillus mannanilyticus]
Length=198

 Score = 45.5 bits (104),  Expect = 0.017, Method: Composition-based stats.
 Identities = 21/196 (11%), Positives = 56/196 (29%), Gaps = 7/196 (4%)

Query  154  NWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRH-VGSFTLLLILLIL  212
                 + +      +         +     +  + L + +   L + +  F  L  +   
Sbjct  6    QALTMLHMVGAGVYIGLAYETYSRLRWKKKQQLITLIQDLFFWLLNVICIFLWLHYVNQG  65

Query  213  VVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
             +    +L ++ G           Y    +    ++++     LV      +F    ++ 
Sbjct  66   EMRIYVILSLLCGYAMYKALLQQIYRNILEK--IIRSVIYLYRLVV-RMIRLFIIQPIVW  122

Query  273  V---ISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
            +   I   + F+   +  + +        L  PF  LY+Y+    ++           K+
Sbjct  123  LYKLIMALIFFIFGLLTKMVQLLYSILLFLARPFIRLYHYVKKKMVQKRKEFQNKKVAKK  182

Query  330  QWLPLTAAIFGWMLIP  345
                      G  L+ 
Sbjct  183  TNTKKRNFKMGGELVV  198


>MBA2115874.1 hypothetical protein [Planctomycetes bacterium FF15]
Length=289

 Score = 46.3 bits (106),  Expect = 0.018, Method: Composition-based stats.
 Identities = 15/177 (8%), Positives = 42/177 (24%), Gaps = 0/177 (0%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  + CP CG   N     +       +C    Q +     +       +          
Sbjct  1    MIRITCPCCGVGINAEERLIGQTLRCPKCLSTTQVVRPKSDDDDLAPVIEPNYETRAYDE  60

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
             +  P+D  +        +   +      + E         ++++      ++       
Sbjct  61   DKPQPTDCPKCGKVIPAQQYLCKGCGWHVKLEAYFEDLTEEALAKDDEPKTKMEKWLEEQ  120

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
            L  +      L  + + +  +                  ++  +    L   W+   
Sbjct  121  LHDLVTPRDFLIASGLCAVFVGFIFVVAGRIFLGALGGTIVGLILAAGLAFGWIILM  177


>MBI1949005.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=280

 Score = 46.3 bits (106),  Expect = 0.018, Method: Composition-based stats.
 Identities = 11/32 (34%), Positives = 17/32 (53%), Gaps = 1/32 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPEC  32
           M  V CP C ++    +  +PA  +  RCP+C
Sbjct  1   MI-VTCPGCSSKYRVRNESVPADGARMRCPKC  31


>WP_152644732.1 hypothetical protein [Corynebacterium argentoratense]
Length=354

 Score = 46.7 bits (107),  Expect = 0.018, Method: Composition-based stats.
 Identities = 31/331 (9%), Positives = 71/331 (21%), Gaps = 43/331 (13%)

Query  14   NTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQS  73
            + P    PA    +                            P        P        
Sbjct  20   SAPQPSEPAGGFDSTPSYPAYPGDAAQQSGSNNSAPYPQPGQPASVTPPTAPQQSGAGAY  79

Query  74   KTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAF  133
              +                       +         ++  +      +  + +   V  F
Sbjct  80   GEIPAYGAPVYPNYNTAPAGARVFDSVGYGFNSFFANFGPWILFSLIMFAVQIPNGVYGF  139

Query  134  APIFSALLLKPATWLNPQNQNWQ--WAILLATVAYILLGLSWMTGSMFIYICKTDVGLFR  191
               F+    +P   +   +        + + TV    +  +         +    + +  
Sbjct  140  VESFTRSYPEPGAEVTVADTMSVGGTIVQVITVLATFVITAIAALGAIKQVNGRKITIGE  199

Query  192  SMK-LGLRHVGSFTLLLILLI-----------------------------LVVGGGSLLL  221
                +    V    LL+ ++                               +  G + + 
Sbjct  200  MFSGVPFGRVIGLQLLVAVVQGLAMIPLAFGLIPLAFTADDDPNGALAGIAIFFGAAAVS  259

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
             I  LL    F    Y L D+N+   QA ++   +   ++    G            + +
Sbjct  260  AILYLLISPLFSLMVYALVDENLSVGQAFKRGFNVAKNNYLKTLG-----------FTVI  308

Query  282  TARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
               +   G        LL  P + L     +
Sbjct  309  IGLLNSAGALLCGFGLLLSMPAAVLASAHFW  339


>PJC73259.1 hypothetical protein CO013_07260, partial [Syntrophobacterales 
bacterium CG_4_8_14_3_um_filter_58_8]
Length=126

 Score = 44.0 bits (100),  Expect = 0.018, Method: Composition-based stats.
 Identities = 6/39 (15%), Positives = 10/39 (26%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  ++C  C        + +       RC  C       
Sbjct  1   MI-IQCRKCETRFRFDDTLIEGDGVWVRCSRCQHVFFQQ  38


>MBE6357106.1 hypothetical protein [Lentisphaerae bacterium]
Length=189

 Score = 45.1 bits (103),  Expect = 0.018, Method: Composition-based stats.
 Identities = 14/90 (16%), Positives = 20/90 (22%), Gaps = 1/90 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M    CP C          LP       C +C + +I    +        +     H   
Sbjct  1   MIEFSCPCCNERYQLDGDALPD-GMKFECGKCGKKVIHKGNKLIIFAFEVSRGEQSHIAA  59

Query  61  QRRIPSDRLEIQSKTVNCRRCNRSFCLQPE  90
                         T  C+  N S      
Sbjct  60  CPHCSIRYEFPYDSTGICQCPNCSGDFFIH  89


>MBJ22024.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=302

 Score = 46.3 bits (106),  Expect = 0.019, Method: Composition-based stats.
 Identities = 11/60 (18%), Positives = 17/60 (28%), Gaps = 0/60 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
           V C  C        +++P   +  RC  C          + R  T  +IA          
Sbjct  10  VTCEECSTSFQLDEARIPVSGAQVRCSRCKHAFFLPNPSAGRPDTVHSIAAQAASAPDDP  69


>WP_045387631.1 zinc-ribbon domain-containing protein [Falsirhodobacter sp. alg1]
Length=191

 Score = 45.1 bits (103),  Expect = 0.019, Method: Composition-based stats.
 Identities = 10/35 (29%), Positives = 13/35 (37%), Gaps = 0/35 (0%)

Query  6   CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
           CP CGA     +  +PA      C  C    +  P
Sbjct  5   CPRCGAHYEVSADAIPASGREVECSACGHEWVQYP  39


>NNF08013.1 hypothetical protein [Candidatus Eisenbacteria bacterium]
Length=174

 Score = 44.8 bits (102),  Expect = 0.019, Method: Composition-based stats.
 Identities = 14/44 (32%), Positives = 20/44 (45%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  VRCP C  +    SS++P      RCP+C        +E +
Sbjct  1   MI-VRCPDCLTKYEIASSRVPESGIKVRCPKCKAVFPVHKSEEE  43


>NNL00831.1 hypothetical protein [Eudoraea sp.]
Length=200

 Score = 45.5 bits (104),  Expect = 0.019, Method: Composition-based stats.
 Identities = 16/155 (10%), Positives = 52/155 (34%), Gaps = 3/155 (2%)

Query  151  QNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL  210
                   A+      + ++    +   +             ++   L +   + L+ +L 
Sbjct  34   FPWEGFGAVAFGLWVFYVISAFLILVLLNWAFNDFFANTVANLDKELLNSFVYGLIYLLG  93

Query  211  ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
            + ++   + +L++   +         + +   ++  + AL  +  L     W  +    +
Sbjct  94   MPLLIVLTFILVVGIPIGLFLLVLYIFSILFGHL--IAALLMAHYLNKDRNWNFWTIVFV  151

Query  271  LLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSF  305
             + I++ L  L   IP++G   ++    +     F
Sbjct  152  SIGIAIVLR-LLTLIPFLGGLVSIVVIAIAYGLFF  185


>OEU61243.1 hypothetical protein BA870_11650 [Desulfuromonadales bacterium 
C00003094]OEU74705.1 hypothetical protein BA869_06420 [Desulfuromonadales 
bacterium C00003107]
Length=384

 Score = 46.7 bits (107),  Expect = 0.019, Method: Composition-based stats.
 Identities = 12/44 (27%), Positives = 20/44 (45%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  ++CPHC A    P +K+    +  RC +C    +  P +  
Sbjct  1   MV-IQCPHCQACFKLPQNKVKPGGTKVRCTKCKIIFMVTPPQDD  43


>EHI60156.1 hypothetical protein HMPREF9473_01828 [ [Hungatella hathewayi 
WAL-18680]
Length=139

 Score = 44.4 bits (101),  Expect = 0.019, Method: Composition-based stats.
 Identities = 23/144 (16%), Positives = 39/144 (27%), Gaps = 11/144 (8%)

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
            + G LQAL +SR ++ G+    FG  +  L I                       L + P
Sbjct  7    DKGILQALSESRQMMRGNRCRYFGLGLSFLGILALAYMSFG-----------IGMLWIVP  55

Query  303  FSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQL  362
            +        Y DLK      Q           T     +  +PG   V      +  +  
Sbjct  56   YLICTNVFFYLDLKPVVEVYQPQWEMAGMQGETFVEAEFTEVPGQAPVEPGYVEIPGQAP  115

Query  363  LSAGKDIQQRLGTQPQQTPDLNRS  386
                +           ++ +    
Sbjct  116  AEPEQPQSSAQPDDMYESYESQNW  139


>WP_109355172.1 hypothetical protein [Sphingorhabdus sp. EL138]
Length=175

 Score = 44.8 bits (102),  Expect = 0.019, Method: Composition-based stats.
 Identities = 19/113 (17%), Positives = 44/113 (39%), Gaps = 0/113 (0%)

Query  147  WLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLL  206
                       ++ +      +L  + +  +    +   DV +       ++   S   L
Sbjct  63   DYWLAFMVSGISVFITYFILAVLLQAMLIVATIRDMRGQDVDIGLCFAEAMKRFLSLIGL  122

Query  207  LILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSG  259
             IL +L +  G +L I+PG++  + +      +  +N+    +L++S  L SG
Sbjct  123  AILSLLGIILGLILFIVPGVILMLMWMVAVPAMVVENLSITDSLKRSAELASG  175


>WP_197430411.1 zinc-ribbon domain-containing protein, partial [Methylobacterium 
sp. CCH5-D2]
Length=37

 Score = 41.3 bits (93),  Expect = 0.019, Method: Composition-based stats.
 Identities = 8/35 (23%), Positives = 15/35 (43%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            + CP C +     + ++  +  S RC  C +T  
Sbjct  2   LITCPTCASSYRVETGRVGMEGRSVRCAACRETWF  36


>WP_086052668.1 zinc-ribbon domain-containing protein [Pseudoruegeria sp. SK021]OSP55646.1 
hypothetical protein BV911_05895 [Pseudoruegeria 
sp. SK021]
Length=429

 Score = 46.7 bits (107),  Expect = 0.019, Method: Composition-based stats.
 Identities = 8/36 (22%), Positives = 17/36 (47%), Gaps = 0/36 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
            + CP+C A  + P++ +P +    +C  C  +   
Sbjct  2   RIICPNCTAAYDIPANAVPTQGREVQCSSCTHSWYQ  37


>MBI5846300.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=735

 Score = 47.1 bits (108),  Expect = 0.019, Method: Composition-based stats.
 Identities = 10/37 (27%), Positives = 14/37 (38%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  V CP+C    +   S +  K+   RC  C     
Sbjct  1   MI-VTCPNCSTGLSLDDSLIKGKQVKVRCANCRHVFP  36


>OQB87135.1 hypothetical protein BWX88_00545 [Planctomycetes bacterium ADurb.Bin126]
Length=434

 Score = 46.7 bits (107),  Expect = 0.019, Method: Composition-based stats.
 Identities = 42/341 (12%), Positives = 78/341 (23%), Gaps = 55/341 (16%)

Query  1    MPTVRCPH--CGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHC  58
            M  V+CP+  C  +     S         RCP C    +                     
Sbjct  1    MV-VKCPNPQCHIKLKIDDSL---AGREGRCPHCGTNFMIP------------ALGGADH  44

Query  59   GLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRG  118
               R          +                     A+                      
Sbjct  45   ASHRPERHREHPPAAAPAPPPPVAPPVAPPVAPPVAAAQPHRSPPPARAPRPALPSKALV  104

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
             G +      +       F               Q       L     + LGLS +    
Sbjct  105  DGTISSLAAALSPRKLAFFVIGWAVTMLLCFLLLQLDVCLTTLIIAVVLALGLSAVVAGG  164

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF----------  228
               +  T++GL  +    +R   SF L  +L+ L +   S ++    LL           
Sbjct  165  IARMAHTNLGLGDAFTYCMRKFFSFFLAGLLVPLALLIVSAIINGLMLLISSETSAGSYV  224

Query  229  -CVWFFFC-----------------QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
              V+F                       +  +  G  QA+ +    +      +  +  L
Sbjct  225  AAVFFGPQFLINLLLVVAGLVTVIVPCAIVVEETGVFQAVGRLLFCIRRQTGTVMTQVAL  284

Query  271  LLVISLTLSFLTARIPYV---------GEAANLAFSLLLTP  302
             +  +  ++ +   + +          G A +   S    P
Sbjct  285  GVFFAGMITMILTVLTFSALMPTASTNGGAVSGLGSSSFLP  325


>MBC5827089.1 hypothetical protein [Candidatus Eremiobacteraeota bacterium]
Length=313

 Score = 46.3 bits (106),  Expect = 0.019, Method: Composition-based stats.
 Identities = 19/179 (11%), Positives = 55/179 (31%), Gaps = 17/179 (9%)

Query  157  WAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGG  216
              ++L  +    +  + +  ++ I I    + +   +    R   +      L ++V   
Sbjct  121  HVLILIVMWLFAMIGAAIALTLVISIAAFAIAVATGLLAHARTFFAI-----LGVVVGVA  175

Query  217  GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL--LVSGHWWAIFGRFVLLLVI  274
                ++    +  + F F       +    ++A   +         +W      + L+ I
Sbjct  176  AVCAIVAAMTMLYLAFAFSFVSTVLEKTDPVRAFSSAFARIFSKHQFWRSAAVGIALVGI  235

Query  275  SLTLSFLTARI----------PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
             L +  +   +          P      +    L+ + F ++   + Y D++    G  
Sbjct  236  GLGVDIVFLGVGAYASYATKSPLAYFIVSGLAGLMFSAFGYIAVSVYYYDVRVRREGLD  294


>PYV23996.1 hypothetical protein DMG27_14115 [Acidobacteria bacterium]
Length=232

 Score = 45.5 bits (104),  Expect = 0.020, Method: Composition-based stats.
 Identities = 27/159 (17%), Positives = 52/159 (33%), Gaps = 9/159 (6%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL-  169
            + L+    W   GI +L  ++         +                A  +  + ++ + 
Sbjct  19   FSLYREHFWLFAGIMVLPQLVVLVVAVPLRVASRTQPNISTTTIPTHAASVIFLLFLFVT  78

Query  170  --------GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                     L+    ++         G+  +             + IL  L+VG G LL 
Sbjct  79   AFLAMHMTALAATVSAVSEMQLGRPTGVRLAYCFLRDKWWRVLWVAILNGLIVGLGFLLF  138

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGH  260
            +IPGL+  V           +N  G  A++++ LL   H
Sbjct  139  VIPGLIILVRTAVAIPAAVLENQRGWAAIKRADLLTERH  177


>OYT54045.1 hypothetical protein B6U72_04025 [Candidatus Altiarchaeales archaeon 
ex4484_2]
Length=287

 Score = 46.3 bits (106),  Expect = 0.020, Method: Composition-based stats.
 Identities = 23/238 (10%), Positives = 61/238 (26%), Gaps = 7/238 (3%)

Query  89   PEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWL  148
             +  F    +        +     L     + ++ +  +      A              
Sbjct  44   MQHFFDFPFNWDSYSINYIVGKGWLHISLLFLMVFLNTIIFCFIGAFFTLVFFHSIKKDG  103

Query  149  NPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLI  208
              ++ N                +  +   +   +    V L  S+   L          +
Sbjct  104  LAEDINSYLKWKKLAKFISEGYIKMLLLFIPFTLVFLIVTLMLSLPSVLLWGVLAQSTQV  163

Query  209  LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
              ++++    LL  +      +   F   +L  +    +++L++S  L  G +  I+   
Sbjct  164  NQLVILFIVGLLDSLILSGIFLKIGFSFPILVYEKREVIESLKRSERLTKGFFIEIWTTI  223

Query  269  VLLLVISLT-------LSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
               + I          +  L   +  +    N    + L        YL   +   + 
Sbjct  224  TFFMFIVFVSNFLVSYVGMLLGYMTLLQFLFNSILVVPLGFILTTRIYLNLKEFAYHK  281


>MBE6224901.1 hypothetical protein [Bacteroidales bacterium]
Length=230

 Score = 45.5 bits (104),  Expect = 0.020, Method: Composition-based stats.
 Identities = 25/184 (14%), Positives = 62/184 (34%), Gaps = 1/184 (1%)

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
            + ++ +           I            +     +    LL  + Y L     +   +
Sbjct  37   YQVICLSDFPAHEYMNAIQHDDPELLLALYSGFLVKYIKLSLLCGILYALFFGGILNIVL  96

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
             +   +         K+ ++   +F L+ I++ +    G+++ IIPG+   +        
Sbjct  97   KLTKGEMHEFNLSGFKMPVQTYVNFILIGIIIGIFSTIGTIMCIIPGIYVYIRLSLAMIH  156

Query  239  LADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
            + +    G  + L+KS  +  G++W IF   +L ++      F      +   A      
Sbjct  157  VLEHPEDGLSETLKKSWNMTKGNFWNIFLLALLYVLFYYLGIFCCCIGAFFSMALGSFMI  216

Query  298  LLLT  301
            ++  
Sbjct  217  VVTY  220


>MBA3392926.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=826

 Score = 47.1 bits (108),  Expect = 0.020, Method: Composition-based stats.
 Identities = 8/42 (19%), Positives = 11/42 (26%), Gaps = 0/42 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           VRC  C  E      +L     + +C  C             
Sbjct  3   VRCEKCQTEYELDEQRLKPGGVTVKCTNCGHMFKIRKRTPTN  44


>WP_182076551.1 zinc-ribbon domain-containing protein [Deefgea sp. CFH1-16]MBA5671844.1 
zinc-ribbon domain-containing protein [Deefgea sp. 
CFH1-16]
Length=118

 Score = 43.6 bits (99),  Expect = 0.020, Method: Composition-based stats.
 Identities = 9/66 (14%), Positives = 16/66 (24%), Gaps = 1/66 (2%)

Query  1   MPTVRC-PHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCG  59
           M  + C P+C         +L A +   RC  C                 +   +     
Sbjct  1   MNQITCCPNCSTAFRVTDQQLSAHQGKVRCGRCAFVFHAPDFMQAPIAGNEQNQSDDVIV  60

Query  60  LQRRIP  65
            +    
Sbjct  61  REIPPC  66


>WP_131925204.1 hypothetical protein [Hazenella coriacea]TCS93869.1 hypothetical 
protein EDD58_10578 [Hazenella coriacea]
Length=300

 Score = 46.3 bits (106),  Expect = 0.020, Method: Composition-based stats.
 Identities = 31/273 (11%), Positives = 69/273 (25%), Gaps = 21/273 (8%)

Query  73   SKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLA  132
               +     +  F +  E             +    D+             ++ + +   
Sbjct  28   MIWLFSLAISLFFNIFLEWWSWDLFHPDIGGANPNGDAVNKIIMFFLIKGLVWFISLYPL  87

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
               +   L+       +   +N       A +A+ +  + W+      +           
Sbjct  88   LQILAIILVQDTEQSFSQTIKNIWTHSGKAILAHGIALIGWVVIFFIFFSIIGLPSYLIF  147

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
                     +     +   L    G      P LL  + F     +L   N       +K
Sbjct  148  QAESFLSQEAAFWTGLYTTLFFFFG------PALLLFIRFSLVIPLLVTGNAQLKDVFKK  201

Query  253  SRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY---------------VGEAANLAFS  297
            S  L  G  + +FG    L++IS+ +  L   I +               +         
Sbjct  202  SWFLTKGSTFKVFGGIFGLVIISMIVKTLNVVITFLPDLFGASTTLIWEMIFTILIFLVD  261

Query  298  LLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
              + P   +Y+ + Y +              +Q
Sbjct  262  ASIIPLIPIYFAIFYFNELIRKEALDIQIQLKQ  294


>NCA13882.1 zinc-ribbon domain-containing protein [Proteobacteria bacterium]
Length=402

 Score = 46.7 bits (107),  Expect = 0.020, Method: Composition-based stats.
 Identities = 25/167 (15%), Positives = 59/167 (35%), Gaps = 1/167 (1%)

Query  107  LADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY  166
               +W     R WGL  + L+  ++      +   +        + +       L +   
Sbjct  193  YGHAWATVRERFWGLFVVGLVATIIGSLIGGAITGIGALIAATTEVEVVTVVAQLLSQVL  252

Query  167  ILL-GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
                  + +  +    +      +        R + S      L  L+V  G  LL++PG
Sbjct  253  GTWPIWAGVQYANLQTVRSGRAEIDDIFVAYRRGLASLIAAQFLTTLLVVVGLALLVVPG  312

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
            ++  +   F   ++ D+ +G ++A+ +S    +G  W + G  ++  
Sbjct  313  IIVGLRLSFVTLLVVDEGLGPVEAIGESWRRSAGFGWTLLGIGLMAF  359


>MBK50175.1 hypothetical protein [Chloroflexi bacterium]MQG39568.1 hypothetical 
protein [SAR202 cluster bacterium]
Length=304

 Score = 46.3 bits (106),  Expect = 0.020, Method: Composition-based stats.
 Identities = 25/198 (13%), Positives = 60/198 (30%), Gaps = 20/198 (10%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYIL-LGLS  172
            F            L  ++    I  +L     T    +       I   T    +    +
Sbjct  25   FSISIARFKFFVALVAIIQGPAIILSLYFSGLTEQENRYDIILQIISYLTAFLAMTFSSA  84

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG-----------------  215
             +  ++  +     V L    K     V S +   ILL   +                  
Sbjct  85   LIIAAIGQHYAHNKVTLSFCFKRTWWRVLSISFWAILLTTPLFAITYLANNAFDTQPNTL  144

Query  216  --GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
                 +++++  + F ++  F    +  +    L  +++S  L+      +F   +L+ +
Sbjct  145  SMLAVVIVMLIFIPFLIYTSFYTQTVIIEGFDLLNGIKRSATLIHRKLLKVFTAIILIGL  204

Query  274  ISLTLSFLTARIPYVGEA  291
            +++ LS +      + E 
Sbjct  205  LAIGLSVIVNIPFALLEL  222


>OUS04399.1 hypothetical protein A9Q96_15750 [Rhodobacterales bacterium 52_120_T64]
Length=383

 Score = 46.3 bits (106),  Expect = 0.020, Method: Composition-based stats.
 Identities = 7/36 (19%), Positives = 12/36 (33%), Gaps = 0/36 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
            + CP C  + +   S +       +C EC      
Sbjct  2   RITCPTCSTQYDVDESDIAFTGQDVQCTECMTIWTQ  37


>MBA3701820.1 hypothetical protein [Rubrobacteraceae bacterium]
Length=120

 Score = 43.6 bits (99),  Expect = 0.020, Method: Composition-based stats.
 Identities = 21/107 (20%), Positives = 39/107 (36%), Gaps = 8/107 (7%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
                + LL+IPG+     +     V+ +++IG L A  +S  LV G +W +F    +   
Sbjct  7    TTIATGLLVIPGVWLYTRWSLTTPVIREEDIGPLAATRRSNELVRGRFWLVFMTATVAYY  66

Query  274  I--------SLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
            +        +L     T    +         + L  P +     L +
Sbjct  67   LEGVVIHEGALVAGSFTGSQTWGAWMGGSIVATLAMPLAAFATSLAH  113


>NET71271.1 hypothetical protein [Sphaerospermopsis sp. SIO1G2]
Length=313

 Score = 46.3 bits (106),  Expect = 0.021, Method: Composition-based stats.
 Identities = 12/66 (18%), Positives = 18/66 (27%), Gaps = 2/66 (3%)

Query  1   MPTV--RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHC  58
           M  V  RCP C +    P   +       RC  C       P +  R+ ++         
Sbjct  1   MVAVLLRCPACLSRFRVPDGAVAGDGREVRCGSCGHQWHATPDQLIRSTSSPTQVPLDTS  60

Query  59  GLQRRI  64
                 
Sbjct  61  VASTPP  66


>WP_148921784.1 zinc-ribbon domain-containing protein [Oceanicella actignis]TYO91351.1 
Meckel syndrome type 1 protein [Oceanicella actignis]
Length=379

 Score = 46.3 bits (106),  Expect = 0.021, Method: Composition-based stats.
 Identities = 8/34 (24%), Positives = 13/34 (38%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP C A+ + P +  P +     C  C    
Sbjct  2   QITCPSCAAQYDAPDAAFPIEGRVVECSACGARW  35


>WP_152889271.1 DUF975 family protein [Clostridium tarantellae]MPQ43597.1 DUF975 
family protein [Clostridium tarantellae]
Length=256

 Score = 45.9 bits (105),  Expect = 0.021, Method: Composition-based stats.
 Identities = 28/213 (13%), Positives = 67/213 (31%), Gaps = 21/213 (10%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
               +      ++ I  L ++  F              L     + ++ + +A  +  ++ 
Sbjct  47   IGAYLTPALAIVYILCLKMLKDFKGKGVPKYRYVDISLAQIWNSIKYTLWIALFSLPMVI  106

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
            L ++     I+     V +     L      S  LL+I+ I+ +           ++  +
Sbjct  107  LIFIALIPLIFKILNSVYMGDPYYLLNDFSFSINLLIIVYIVGIV--------YCIILNL  158

Query  231  WFFFCQYVLADDN--IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
             F    Y++ +          + ++  L+ GH W  F   +  ++  +           +
Sbjct  159  MFSLTPYIIMERKNHDTVRDLMTEAHKLIKGHMWNYFLFKLSFILWYI-----------L  207

Query  289  GEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
            G        L +TP+  +     Y  LK     
Sbjct  208  GIITLGIGFLWITPYIEMCKIEYYYKLKEIKGN  240


>MBI4613963.1 hypothetical protein [Planctomycetes bacterium]
Length=104

 Score = 43.2 bits (98),  Expect = 0.021, Method: Composition-based stats.
 Identities = 10/39 (26%), Positives = 15/39 (38%), Gaps = 3/39 (8%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  + CP CG   +   + LP      RC +C   +   
Sbjct  1   MVKITCPGCGKSYDV--TNLPP-GRRLRCGKCNTHIERP  36


>WP_144997406.1 hypothetical protein [Polystyrenella longa]QDU81780.1 hypothetical 
protein Pla110_35300 [Polystyrenella longa]
Length=654

 Score = 46.7 bits (107),  Expect = 0.021, Method: Composition-based stats.
 Identities = 38/393 (10%), Positives = 97/393 (25%), Gaps = 15/393 (4%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              +RC  C A    P + +       +CP+C   +     +     +    +       +
Sbjct  3    IKIRCKVCEAGLKLPDAAM---GKVVKCPKCANRIKVPSRQPGSEGSAKPPSQSSPGRKR  59

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
            R   S    + +         +        +     +G+     +  ++ +L  +R   +
Sbjct  60   REAASSTGFMTALGNLPLEDQQMKICPRCGQDVDPETGVCESCGIDVETGQLTEKRKRLI  119

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
                +              +     +           +    +A     + +  G     
Sbjct  120  AVKGVDPRDFFKNVANDFFVYPFKNFGLIGRSILISMLAYIFIAIAHFFIHFSDGLTVEI  179

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                 V +  +   G      +T+   + +      + L             F  +++  
Sbjct  180  FMWAFVSVSTAFFFGW----FWTIASEMAVYTFQYHAALKKKKKTRPFKPKKFDIFMVFQ  235

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS-----FLTARIPYVGEAANLAF  296
            +    L  +  +   V  H   +F   +++    L        FLT     +G +  L  
Sbjct  236  NGYRFLAWVICN---VFVHALVLFPLILVVGFSLLASGSGSKTFLTIIGILLGFSLTLVG  292

Query  297  SLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQN  356
              + +    +   + +                 Q L        +  IP           
Sbjct  293  LSIPSAVGHMSMPVTWPGWNPFKLAQLTMRTIGQSLVWGLLTLMFAAIPIAGYAVAYNYV  352

Query  357  LSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPE  389
                  +     +  R+    +  P     LP+
Sbjct  353  ADDLDEVWEPMRMNARITWAQRLNPGPKEVLPD  385


>RDV37346.1 hypothetical protein DV096_15350 [Bradymonadaceae bacterium TMQ3]
Length=225

 Score = 45.5 bits (104),  Expect = 0.021, Method: Composition-based stats.
 Identities = 21/179 (12%), Positives = 57/179 (32%), Gaps = 5/179 (3%)

Query  140  LLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRH  199
            +L             + + +   T A  +L +  +       +    V      +L ++ 
Sbjct  1    MLPFFGVIGLWLAAVYLYPVGAITAALPVLLMLVVGWVHLHRVGYEAVYAAEGARLPMKS  60

Query  200  VGSFTLLLILLILVV-GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVS  258
                 +  +++   +   GS+   +P L+  +        +  +    ++AL+ +  L  
Sbjct  61   WFVRVMRGVVVAFGLNFIGSMFFFVPSLVAQILLMPYLLFIMVEGEEPIEALKINVRLAG  120

Query  259  GHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAAN----LAFSLLLTPFSFLYYYLIYS  313
                ++F  ++ L V ++ L  L    P     A        S++   +       +  
Sbjct  121  NRMGSLFVFWLTLRVGAVMLMILGGMAPVAAYIALSDAPGPISIVEMEWVMWTMVALGC  179


>WP_169691242.1 zinc-ribbon domain-containing protein, partial [Vibrio parahaemolyticus]
Length=53

 Score = 41.7 bits (94),  Expect = 0.021, Method: Composition-based stats.
 Identities = 11/35 (31%), Positives = 15/35 (43%), Gaps = 1/35 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQT  35
           M  +RC +C        SK+P    + RCP C   
Sbjct  1   MI-IRCDNCSVSLQLDESKIPNGNFTVRCPRCQNM  34


>MBI5186362.1 zinc-ribbon domain-containing protein [Nitrospinae bacterium]
Length=657

 Score = 46.7 bits (107),  Expect = 0.021, Method: Composition-based stats.
 Identities = 9/31 (29%), Positives = 12/31 (39%), Gaps = 0/31 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           V C  C        +K+P   +  RCP C  
Sbjct  3   VTCGFCNTVFRVDDAKIPPAGAKVRCPSCKN  33


>MBI5485606.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=252

 Score = 45.9 bits (105),  Expect = 0.021, Method: Composition-based stats.
 Identities = 23/187 (12%), Positives = 46/187 (25%), Gaps = 0/187 (0%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             + CP C       ++++P   S+ RCP C       P ++           CP CG  +
Sbjct  2    KIECPECHFSAEADAARIPPGGSNTRCPRCATVFTVTPVKTGVPDGVQGKVVCPKCGAWQ  61

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                            R       +          S    +     DS+     RG  + 
Sbjct  62   ETAESCGLCGLIYEKFRIAEEHRQMANGSSGTEMPSSAAPLPATGQDSFHFGYGRGDLMT  121

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
                  +     P    +          +      A+ +  +      + +   +     
Sbjct  122  WASEGYLAADALPQALRIAGTLPRAPEWRRLLDSLALWMGAIFLAAAVIFFFAYNWKELG  181

Query  183  CKTDVGL  189
                 G+
Sbjct  182  HFARFGI  188


>HHW99952.1 DUF975 family protein [Acholeplasmataceae bacterium]
Length=226

 Score = 45.5 bits (104),  Expect = 0.021, Method: Composition-based stats.
 Identities = 25/211 (12%), Positives = 63/211 (30%), Gaps = 13/211 (6%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
                   L+      ++ ++    ++                          +  + T  
Sbjct  16   YGYRMLYLWQLLLPLIIILFFGAFLIPAILSRILGGPNVTNDGKLGLIIAPISSWILTRL  75

Query  166  YILLGLSWMTGSMFIYICKTDVGL-FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
               L  +         +      L    +     H+     + ++  L+V  G  L I+P
Sbjct  76   IWPLFRAKRILIGKELLTSNYSDLDKIKLVKEPLHILKLICVSLIFTLMVIIGLGLFIVP  135

Query  225  GLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
            G+   V F    +++ ++ +I  + A + S  L   +   +     L+  +S    F+  
Sbjct  136  GIFLLVIFSMANHMMVENKDISIIAAFKYSYRLTKYYKKDV-----LIFFLSFIPHFILG  190

Query  284  RIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
             +          + L   P+  + Y  ++ D
Sbjct  191  FLTL------GIYFLFFYPYFEVAYTNLFID  215


>HCI62569.1 hypothetical protein [Erythrobacter sp.]
Length=95

 Score = 42.8 bits (97),  Expect = 0.021, Method: Composition-based stats.
 Identities = 17/81 (21%), Positives = 33/81 (41%), Gaps = 1/81 (1%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDN-IGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
              G ++  +      V       V+A +     ++AL +S  L  G  + +F  ++LLLV
Sbjct  9    VLGGIVAFVVTCYIVVKLSLVAPVIAIEGERNPVRALRRSWTLTRGQSFRLFLFYLLLLV  68

Query  274  ISLTLSFLTARIPYVGEAANL  294
              + LS +   +  +  A   
Sbjct  69   AFMVLSVVITLVLGLPFALAG  89


>MBI4603044.1 hypothetical protein [Planctomycetes bacterium]
Length=364

 Score = 46.3 bits (106),  Expect = 0.021, Method: Composition-based stats.
 Identities = 27/187 (14%), Positives = 49/187 (26%), Gaps = 19/187 (10%)

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS-----FLTARI  285
              F     L  +      AL +SR L  G      G F ++ ++   +S     FL    
Sbjct  180  MLFVAVPALVLERASVGAALRRSRDLTRGFRRRTLGLFGVMYIVGTAISTAVRLFLEGGT  239

Query  286  PYV-----GEAANLAFS-----LLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLT  335
              +     G+ A    S     ++      +   ++Y DL+    G     + +    L 
Sbjct  240  ALLVRSEAGQIAATVLSYGFNQVVTGGLFGVLTVVVYFDLRVRSDGFDLENLAQ----LV  295

Query  336  AAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLS  395
              I      P L   + +          S                    R  P      +
Sbjct  296  DVIAEREGAPPLASAATAATAWEPPAGSSGEAAPPSEPPAVSWGEAAAPREPPATSTGEA  355

Query  396  SADYKLL  402
            +   +  
Sbjct  356  AGPSEPP  362


>HDY19325.1 hypothetical protein [Gemmata sp.]
Length=131

 Score = 44.0 bits (100),  Expect = 0.022, Method: Composition-based stats.
 Identities = 11/35 (31%), Positives = 16/35 (46%), Gaps = 3/35 (9%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
             V CP CGA    P +   A   + +CP+C   +
Sbjct  3   ILVACPSCGARLKVPDN---AAGKTVQCPKCGTNM  34


>MBI2128803.1 hypothetical protein [Candidatus Woesearchaeota archaeon]
Length=159

 Score = 44.4 bits (101),  Expect = 0.022, Method: Composition-based stats.
 Identities = 29/157 (18%), Positives = 67/157 (43%), Gaps = 0/157 (0%)

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGG  246
            +    ++K G+++   F  L ++  L +    +L +IPG++F V++ F    L  +N   
Sbjct  1    MSFNDAVKGGMKYYLRFLGLALVKGLALLALFILFVIPGIIFSVYWAFSSIALVGENKAI  60

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFL  306
            +++++KS+ +V G WW++FG  +L  ++   +S +   +  +               + L
Sbjct  61   IESMKKSKEVVKGKWWSVFGYLLLFFLLISVISSVVMVVGLILGVLLTLIIAYTAGGNIL  120

Query  307  YYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWML  343
               +  +++           +      L A  F   L
Sbjct  121  VSAMYAANIIYMVLIMLVNAVAWPMSILFAKNFYMEL  157


>HCR85950.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=80

 Score = 42.4 bits (96),  Expect = 0.022, Method: Composition-based stats.
 Identities = 9/41 (22%), Positives = 16/41 (39%), Gaps = 0/41 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAES  43
            + CP+C      P+  L     + +C +C  T   +P   
Sbjct  2   LISCPNCKTSFALPAKALGINGRTLKCSKCSHTWFQEPTNF  42


>WP_149254325.1 hypothetical protein [Labrys sp. KNU-23]QEN90133.1 hypothetical 
protein FZC33_29190 [Labrys sp. KNU-23]
Length=303

 Score = 45.9 bits (105),  Expect = 0.022, Method: Composition-based stats.
 Identities = 34/232 (15%), Positives = 69/232 (30%), Gaps = 0/232 (0%)

Query  53   ATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWE  112
                H         + +   +    CRR          R    +     +    L     
Sbjct  8    RQKCHAVGISARVENLVPCDNIVEVCRRIPGKVRPAMSRSENVTLVRPATHRFGLGLVLR  67

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
                        + L  + A  P    L   P            W+ LL ++A   +  +
Sbjct  68   ASWAIYRMRFLRFTLLALAAVLPPPLILTFFPIPQDLLLTSAEYWSTLLLSLALSAMAGA  127

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
                 +   +      L  +      +  +     I+++     G ++L++PGL+    +
Sbjct  128  MSCLGVQQVLDHQPFSLRAAFTHAGSNCLALFATTIIVLGAFYLGLVVLVVPGLMMLCRY  187

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            F    V   +  G LQ+L++S  L  G  W +    + LL++ +        
Sbjct  188  FVAGPVCVIERTGPLQSLKRSAELTKGFRWKLLILGLFLLLLQIAPRLAINF  239


>RKY52211.1 hypothetical protein DRP93_08535, partial [Candidatus Marinimicrobia 
bacterium]
Length=32

 Score = 40.9 bits (92),  Expect = 0.022, Method: Composition-based stats.
 Identities = 12/33 (36%), Positives = 18/33 (55%), Gaps = 1/33 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECC  33
           M  + C  CG +      K+ ++K+S RCPEC 
Sbjct  1   MIVI-CEECGKKYRIDEEKINSEKASLRCPECG  32


>HBC53542.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=86

 Score = 42.8 bits (97),  Expect = 0.022, Method: Composition-based stats.
 Identities = 8/41 (20%), Positives = 12/41 (29%), Gaps = 1/41 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
           M    C  C        + +  K    RC +C      +P 
Sbjct  1   MIL-TCSACTTRFLVDPAAVGRKGRHVRCAKCLHVWFQEPP  40


>HGH09376.1 hypothetical protein [bacterium]
Length=209

 Score = 45.1 bits (103),  Expect = 0.022, Method: Composition-based stats.
 Identities = 8/31 (26%), Positives = 17/31 (55%), Gaps = 0/31 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPEC  32
             ++CP C ++     +K+P +    +CP+C
Sbjct  3   IIIQCPGCKSKYKMEKAKIPDEGKKVKCPKC  33


>XP_010680474.1 PREDICTED: uncharacterized protein LOC104895612 [Beta vulgaris 
subsp. vulgaris]
Length=121

 Score = 43.6 bits (99),  Expect = 0.022, Method: Composition-based stats.
 Identities = 18/109 (17%), Positives = 38/109 (35%), Gaps = 7/109 (6%)

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG-------EAAN  293
             + I G +AL+KS+ L+ G     F   +++ +  +    L  ++P  G          +
Sbjct  1    MEKICGFKALKKSKELIKGKMGIAFAILLVINLCGIPTMILIDKLPRFGIVEKFCFGIFS  60

Query  294  LAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWM  342
            L      T F  +   + Y   K+ +            L +    +  +
Sbjct  61   LVLWTFFTLFGLVVQTVFYLVCKSYHHESIDKTTLADHLDVYLGEYVPL  109


>KAB2603648.1 hypothetical protein D8674_004653 [Pyrus ussuriensis x Pyrus 
communis]
Length=356

 Score = 46.3 bits (106),  Expect = 0.022, Method: Composition-based stats.
 Identities = 33/260 (13%), Positives = 71/260 (27%), Gaps = 8/260 (3%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
               + L        VL F+ + ++ ++     +    +     ++          +    
Sbjct  88   WATYWLFNAAYFTFVLIFSLLSTSAVVYTIACIYTAREITFKKVMSVVPKVWKPVMVTFL  147

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
             +   + C   V L   +   L        +    +++     ++  I      + +   
Sbjct  148  CTFMAFFCYNIVALIVLIIWVLS-----MGISGASVIIGFLLLIVYFIGFGYLTLIWQLA  202

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY---VGEAA  292
              V   +   G++A+ KS+ L+ G+ W     +  L V++  L F   ++         A
Sbjct  203  SVVSVLEEARGIKAMAKSKELLKGNMWVATIIYFKLNVLAALLQFGFQQLVVSGRFFAIA  262

Query  293  NLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSL  352
                   L  F       I     A  +         + L  T        I  L     
Sbjct  263  TGILRAYLLLFLPFSIVTIIRPHAAAPKPVLLILTTWRLLRNTTGSAAATAIVHLAHNGN  322

Query  353  SRQNLSAEQLLSAGKDIQQR  372
            S  N  A          +  
Sbjct  323  SDANWLAICQQFGDFCWKTS  342


>XP_022318699.1 uncharacterized protein LOC111121634 isoform X2 [Crassostrea 
virginica]
Length=350

 Score = 46.3 bits (106),  Expect = 0.022, Method: Composition-based stats.
 Identities = 28/309 (9%), Positives = 83/309 (27%), Gaps = 12/309 (4%)

Query  12   ERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATC------PHCGLQRRIP  65
             R+      P+     +  +  + +              +                  I 
Sbjct  42   SRDLYDRNNPSAGKGTKPYDWGKPIHQKKHSHGVFHHYYSYFGNNIVEAIHVVFQVYDIG  101

Query  66   SDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIY  125
                ++            +  +            +  +  ++  S  L  +   GL+ ++
Sbjct  102  QSENDVNWGYFLVFISLYAHRIPIMMFLVLWFVVMDKLDLVVMISISLKAKHFPGLIFLW  161

Query  126  LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKT  185
            ++ + +      +  L           QN    +L  T + ++  + +            
Sbjct  162  MIYLCILNYVYQNLTLGLRLRDFVFSLQNSSSILLYITTSLVITAIIFWIYKPHQNTITK  221

Query  186  DVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIG  245
             + L         H+        L+ ++     L+     +L  + +   ++    ++  
Sbjct  222  YMILSIQQASAFLHIHKAQPEKPLIDVLFFQFMLIGYYDLILHFIRYVLYKFYSCLED--  279

Query  246  GLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSF  305
               AL+ +  L           F+ + +    + FL   +    +A  + F++      F
Sbjct  280  ---ALKWAWNLFLSIIRKCVVNFMNITL-YFIVIFLLEFLFTNSQALIICFNITKLTLMF  335

Query  306  LYYYLIYSD  314
               + I  D
Sbjct  336  GILFKICID  344


>XP_028991998.1 trace amine-associated receptor 13c-like [Betta splendens]
Length=409

 Score = 46.3 bits (106),  Expect = 0.023, Method: Composition-based stats.
 Identities = 25/209 (12%), Positives = 55/209 (26%), Gaps = 7/209 (3%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
              +    I  L   + F+ I +++        +              V    +       
Sbjct  180  CWYLGNHICFLQGYVTFSIISASIAGMVLISADRYAAICDPLHYSTRVTVSRVKKCVCLC  239

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
                +     +     +K GL +      ++ +  ++      L  +      ++ +   
Sbjct  240  WFCAFTYNIILFREELIKPGLYNSCYGMCVVYIDRIMSYVDVALTFVVPFSSIIFLYMRV  299

Query  237  YVLADDNIGGL----QALEKSRLLVSG---HWWAIFGRFVLLLVISLTLSFLTARIPYVG  289
            ++ A      +     A+                  G  V   +I     +L   IPY  
Sbjct  300  FLTAVSQARAMRSHVAAVTLHHSGTKRSELKAARTLGVVVAWFLICFLPYYLLTLIPYTL  359

Query  290  EAANLAFSLLLTPFSFLYYYLIYSDLKAN  318
             A    F L L+PF     Y ++      
Sbjct  360  PAVFSLFLLCLSPFLNPVIYALFYPWFRK  388


>KAA3632145.1 hypothetical protein DWP97_11630 [Calditrichaeota bacterium]
Length=246

 Score = 45.5 bits (104),  Expect = 0.023, Method: Composition-based stats.
 Identities = 24/204 (12%), Positives = 52/204 (25%), Gaps = 4/204 (2%)

Query  99   GLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWA  158
                    L             +L  + +  +     I   LL     +L    ++    
Sbjct  28   WEMFKKHFLILMGGFIVFFAITILIQFTMIGMFVSIAIMPPLLGGWILFLLNAARDSNPR  87

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
            +      +   G++     +   I    +        G        L   L         
Sbjct  88   VGDLFKGFDRYGMTMGVYWLSTLIVGILLFPSFLYIFGNLLFSPEFLSGNLFYSEFFEIY  147

Query  219  LLLII---PGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
              +        L  + F    +V  D+   G +    +S  L+ GH  +     +L  ++
Sbjct  148  FFVFCNAAIIFLIFIRFVLFWFVAMDEMERGIVNVFNESARLMKGHVVSYLQLSILGFLL  207

Query  275  SLTLSFLTARIPYVGEAANLAFSL  298
            +L    L      +  A  +   +
Sbjct  208  TLGGFILLVIPGLIIMAVFILAQI  231


>TKR67062.1 hypothetical protein L596_023271 [Steinernema carpocapsae]
Length=2506

 Score = 47.1 bits (108),  Expect = 0.023, Method: Composition-based stats.
 Identities = 28/280 (10%), Positives = 62/280 (22%), Gaps = 23/280 (8%)

Query  350  VSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSAD-YKLLLSKQRK  408
               +              +  +   +          +         S D  +   + +  
Sbjct  579  YIAANIPTKEPHAAIPTAETGKPSQSVNGPDGLPLATDTSGNYIAPSGDKVETDENGRPL  638

Query  409  TTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDA  468
              +   L       +       D  P    + +   +P +    +        + L  D 
Sbjct  639  GPNGSVLPTDDTGKYIYPVLGPDGQPLPTDENQRPIYPAVEPDGRPLPTDSEGRPLGSDG  698

Query  469  RDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSI----YLRQGTQAEQVHSILGKLELT  524
            +       + +  A     +    + ++      I     L      + +  I G   L 
Sbjct  699  Q-PLPTNAAGQPVAPDGSPLPTDAQGNVVFDQEKIKEVKPLPTDESGKMILPINGPDGLP  757

Query  525  LPLAI-------ESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVH--  575
            LP          +   +  ++ G+ L   G  L              +G     L     
Sbjct  758  LPTDSSRKPVNAQGEVIPTDESGRPLGPDGSVLPTDD--DGKYIYPAVGPDGKALPTDDN  815

Query  576  ------ASNSHAEPLREIGFTWQKSGDAFSLRQMFDGNIE  609
                  A ++  +PL           D   L     G   
Sbjct  816  GRPIYPAVDADGQPLPTDSEGKPLGSDGKPLPTNAGGKPV  855


>WP_042278998.1 MULTISPECIES: hypothetical protein [Nonlabens]ALM20391.1 hypothetical 
protein AAT17_03665 [Nonlabens sp. MIC269]ARN70546.1 
hypothetical protein BST91_02160 [Nonlabens sediminis]GAK97389.1 
hypothetical protein JCM19294_1435 [Nonlabens sediminis]
Length=312

 Score = 45.9 bits (105),  Expect = 0.023, Method: Composition-based stats.
 Identities = 21/115 (18%), Positives = 41/115 (36%), Gaps = 1/115 (1%)

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             +F       +G F  + LG   V    ++ +++I +   G        + +  W     
Sbjct  124  LVFKRALGKMLGSFVILFLGAICVIPVAIVGVIMIFIPILGLFAFAGLIITYFTWIGLSV  183

Query  237  YVL-ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
            +     D I    A+ +   L+   +W  F    + LV+   LS L   +P +  
Sbjct  184  FCYAYTDEISMTDAMGRGWQLLFFKFWKAFFTSFVALVVLYILSILFQILPAMLI  238


>RLG14639.1 hypothetical protein DRN66_01465 [Candidatus Nanohaloarchaeota 
archaeon]
Length=284

 Score = 45.9 bits (105),  Expect = 0.023, Method: Composition-based stats.
 Identities = 30/212 (14%), Positives = 62/212 (29%), Gaps = 9/212 (4%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
                      +    L   LA       L        +   +       ++    +L   
Sbjct  63   PTMLFYSLWAIAKDYLIYSLAILFAIIILHYYIKATYSVIAKAIHQNKPVSIFEALLESK  122

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
              +   M        +     + + +  +     L  + I+++      L   GL++ V 
Sbjct  123  KRIVALMITDALLILIAFMAVIAVLVIEIPICIFLGPIGIILLLFVDFFLFFAGLVYLVL  182

Query  232  FFFCQY-VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA-------  283
                   V+  D   GL A+++S  L+       +   +L+ +IS    FL +       
Sbjct  183  LSIISISVVVLDKKSGLDAIKESYRLIKYEKLDSWLLIILVFIISSIYIFLVSAVVEIFS  242

Query  284  -RIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
              I            + L  F ++ Y L+Y  
Sbjct  243  LLIGPYAVLVGTILHVFLPVFGYICYTLLYFH  274


>WP_161790246.1 hypothetical protein, partial [Streptacidiphilus carbonis]
Length=277

 Score = 45.9 bits (105),  Expect = 0.023, Method: Composition-based stats.
 Identities = 21/98 (21%), Positives = 32/98 (33%), Gaps = 0/98 (0%)

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
                   LV+    L+  +  L   V       V+  +N     AL ++  L  G WW  
Sbjct  172  GSGAAFGLVLLLFWLVTSVTVLYVTVRLVPLVPVVVLENQRPFAALRRAWRLNEGAWWRS  231

Query  265  FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
            FG   L+ ++      +      V     L   L  TP
Sbjct  232  FGIPYLIGLVGSIAGQVIMVPAVVIGTLPLFDQLGRTP  269


>WP_181885661.1 zinc-ribbon domain-containing protein, partial [Trinickia dinghuensis]
Length=158

 Score = 44.4 bits (101),  Expect = 0.023, Method: Composition-based stats.
 Identities = 13/93 (14%), Positives = 19/93 (20%), Gaps = 1/93 (1%)

Query  1   MPTVR-CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCG  59
           M     CPHC        + L A    ARC  C Q       +      +   A      
Sbjct  1   MVLATRCPHCETVFRVQEALLAASGGFARCGHCQQVFDARSNQLDPHAGSHPAAHESESA  60

Query  60  LQRRIPSDRLEIQSKTVNCRRCNRSFCLQPERE  92
                            +    +     +    
Sbjct  61  PLPPAAEHEPTHDDPHASAAEAHAQAEREAREH  93


>WP_136799889.1 hypothetical protein [Desulfopila sp. IMCC35004]
Length=288

 Score = 45.9 bits (105),  Expect = 0.023, Method: Composition-based stats.
 Identities = 21/161 (13%), Positives = 49/161 (30%), Gaps = 11/161 (7%)

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGG-------  216
                +           + + K +   F  + +    + S      L   +          
Sbjct  86   YFISMFFNVATVQCAIMRMNKQNPTFFDGISVAFYRLPSVLCWSFLSSTIGFLLQFVEER  145

Query  217  ----GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
                G+L+    GL + +  +    ++  + +G L +L++S  ++ G W           
Sbjct  146  SNWVGNLMAKFIGLTWTLASWLVLPIMVYEKVGPLTSLKRSAKIIKGTWVVALLEEFTFG  205

Query  273  VISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYS  313
            ++   LS      P          SL++  F  + Y  +  
Sbjct  206  LLISVLSSKAIFFPLFALLFGTYKSLIIAAFISILYLAVLW  246


>OAI45068.1 hypothetical protein AYO44_13170 [Planctomycetaceae bacterium 
SCGC AG-212-F19]
Length=628

 Score = 46.7 bits (107),  Expect = 0.023, Method: Composition-based stats.
 Identities = 12/72 (17%), Positives = 18/72 (25%), Gaps = 3/72 (4%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M    CP C    N   SK        +CP+C Q +                 + P    
Sbjct  1   MIRFACPSCQTAFNVDPSK---AGLRTKCPKCGQRVAVPQPRDVTLPGILLTGSPPADDT  57

Query  61  QRRIPSDRLEIQ  72
                +  +   
Sbjct  58  PTGGQTAPIPEW  69


>WP_068135767.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Limnochorda pilosa]
Length=513

 Score = 46.7 bits (107),  Expect = 0.023, Method: Composition-based stats.
 Identities = 15/101 (15%), Positives = 30/101 (30%), Gaps = 11/101 (11%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            + V    +  +   L     +     ++ D N+   +AL  S   + G    + G   +L
Sbjct  398  VGVLVFFVAGVWLMLYLGFGYMITIPLVLDQNLSPWRALTTSARALRGRRLTVLGVLAVL  457

Query  272  LVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
             +           I  +G        L+  P        +Y
Sbjct  458  FL-----------INVLGALPFGLGLLVTAPLGPTAIVAVY  487


>WP_051533940.1 hypothetical protein [Desulfitibacter alkalitolerans]
Length=252

 Score = 45.5 bits (104),  Expect = 0.023, Method: Composition-based stats.
 Identities = 27/171 (16%), Positives = 58/171 (34%), Gaps = 10/171 (6%)

Query  158  AILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGG  217
             + +  +           GS+   +      +   ++ G    G F L  +++ L +   
Sbjct  80   FMFIIFLLLSSFLKGGFLGSVLSGLNGESFNMDTFIRYGKDFFGRFLLQFLIIFLAMFVL  139

Query  218  SLLLIIPGLLFCVWFFF----------CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGR  267
                 I G L  ++               YV+  +N   +Q  +KS  LVS +   +FG 
Sbjct  140  IPFAFILGPLALLFLLVFLVLFFMLLFWDYVIVVENTDVIQGAKKSWNLVSNNIGTVFGF  199

Query  268  FVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKAN  318
             + ++V++   S L   +        +  S+    F  +  + + S     
Sbjct  200  ILPVVVVAAIFSILANFMVGAAPILAVMASIGYAYFGTVVIFAMMSFYHEI  250


>MBA4071726.1 hypothetical protein [Gemmatimonas sp.]
Length=455

 Score = 46.3 bits (106),  Expect = 0.023, Method: Composition-based stats.
 Identities = 14/62 (23%), Positives = 22/62 (35%), Gaps = 0/62 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
           V CP C +      SK+P     ARC  C   +   P  +   +      + P   ++  
Sbjct  3   VSCPECRSVFRVDPSKVPTVGVRARCSVCGGVITMAPPAAAVAEQEGRRTSMPATAVRPM  62

Query  64  IP  65
            P
Sbjct  63  AP  64


>WP_191346790.1 DUF975 family protein [Candidatus Gallimonas merdae]
Length=232

 Score = 45.5 bits (104),  Expect = 0.023, Method: Composition-based stats.
 Identities = 16/123 (13%), Positives = 45/123 (37%), Gaps = 1/123 (1%)

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
            ++   ++ +     I+I    + +  +          F     L  ++    +++ I+  
Sbjct  80   FMRALVATLIRFAIIWIPIVILMIIDAALCAPLLADGFQSDASLAFVLTMVFAIVCILYY  139

Query  226  LLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            L+     +F  YV+ D+ ++ G   +++S  +  G  W + G  +  L   +        
Sbjct  140  LIMNAKMYFTYYVMNDEVDLSGWNCIKRSMEMAKGRIWKLIGFELSFLGWWILCIITLGA  199

Query  285  IPY  287
            +  
Sbjct  200  LTL  202


>KAF2556170.1 hypothetical protein F2Q68_00012976 [Brassica cretica]
Length=2011

 Score = 47.1 bits (108),  Expect = 0.024, Method: Composition-based stats.
 Identities = 50/480 (10%), Positives = 124/480 (26%), Gaps = 16/480 (3%)

Query  25    SSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRS  84
                RC            E+     +D IA  P+    +     R    +        N  
Sbjct  683   WDKRCFRLG---WPMKPEADFFIHSDEIAQHPNERRDQVPHGKRKPKTNFVEARTFWNLY  739

Query  85    FCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKP  144
                     F         I         L        L +  + I  AF  +  A L   
Sbjct  740   RSFDRMWMFLVLSLQTMMIVAWSPSGSILAIFEEDVFLNVLTIFITSAFLNLLQATLDII  799

Query  145   ATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT  204
              ++   ++  +   +   T   +    + +    +    +   GL +     ++     T
Sbjct  800   LSFGAWKSLKFSQILRFITKFLMAAMWAIILPIAYSKSVQNPTGLVKFFSSWVQSWPHQT  859

Query  205   LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
             L    + L V         P +L  V+F         +       +    L++      +
Sbjct  860   LYNYAIALYVL--------PNILAAVFFLLPPLRRIMERSN----MRIVTLIMWWAQPKL  907

Query  265   FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
             +    +   +     +    +  +      +F + + P       LI+     NY+  + 
Sbjct  908   YVGRGMHEEMFALFKYTFFWVMLLLSKLAFSFYVEILPLVKPTK-LIWDMSGVNYQWHEF  966

Query  325   PPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLN  384
              P     + +  +I+G +++   +   +     S       G                  
Sbjct  967   FPNATHNIGVIISIWGPIVLVYFMDTQIWYAIFSTIFGGIYGAFSHLGEIRTLGMLRSRF  1026

Query  385   RSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSD  444
             R +P       +        ++          +   +   ++F    ++  L     L  
Sbjct  1027  RFVPSAFCSKLTPSPPGRAKRKHLDEQVDENDIARFSQMWNKFIYTMRDEDLISDRILER  1086

Query  445   FPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIY  504
               + S   +   + +  + ++   ++   R+          V  +  +        R + 
Sbjct  1087  AHHQSGDIENEKKEQRFEKINLGGQNDSWREKVVRLLLLVTVKESAINIPQSLEARRRMT  1146


>MBL91301.1 hypothetical protein [Myxococcales bacterium]
Length=676

 Score = 46.7 bits (107),  Expect = 0.024, Method: Composition-based stats.
 Identities = 8/34 (24%), Positives = 14/34 (41%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            V C  CG E +   +++P    + +C  C    
Sbjct  2   EVVCSSCGTEYDFDEARVPEGGVTVKCTTCGHVF  35


>KAB2836129.1 hypothetical protein F9K49_02430, partial [Caedimonadaceae bacterium]
Length=183

 Score = 44.8 bits (102),  Expect = 0.024, Method: Composition-based stats.
 Identities = 9/58 (16%), Positives = 11/58 (19%), Gaps = 1/58 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHC  58
           M  + CP C     T            RC  C                   +   P  
Sbjct  1   MI-IVCPQCDRRYLTNDVDFKETGRHVRCSGCGHEWFHASVMESDDAEPLYVDVDPKH  57


>OQA12877.1 hypothetical protein BWY64_03851 [bacterium ADurb.Bin363]
Length=308

 Score = 45.9 bits (105),  Expect = 0.024, Method: Composition-based stats.
 Identities = 23/217 (11%), Positives = 71/217 (33%), Gaps = 16/217 (7%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            L    + I +    +    ++   + +                 +  +    MT  ++ +
Sbjct  73   LFFLCVPIWIIILNLSQGFIINIISHIISDRPFSLSETWKEFFKFEKIFNLLMTMFLYGF  132

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            I    +     + +    + + +    LL ++     ++L+I  ++F + + F   V+  
Sbjct  133  IMSLPIIPCVVLFICFPFIVTSS-YKALLTVLFFIILIILVIVIMIFALVYNFLVPVIVL  191

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL-------  294
            +      A++++  L+      +    +LL ++   +    +          +       
Sbjct  192  EKKAYFSAIKRAMTLIIKDPLKVISVTLLLSMLVQIIQGAFSVPFVFLSIFLMQYHKGLY  251

Query  295  --------AFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                      +++L P  F+   L+Y D++    G  
Sbjct  252  LVIQMLPQLSAIILVPVLFVGNTLLYYDVRFRKEGYD  288


>HHJ03925.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=481

 Score = 46.3 bits (106),  Expect = 0.024, Method: Composition-based stats.
 Identities = 9/51 (18%), Positives = 16/51 (31%), Gaps = 0/51 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIA  53
            VRCP C        + +  +  + +C  C       P ++    T     
Sbjct  2   EVRCPKCQTMYQLDETHITQEGVNVQCTNCHNIFRVLPPKAAPAPTQMGWH  52


>KPA16453.1 magnetosome protein Mad28-2 [Candidatus Magnetomorum sp. HK-1]
Length=620

 Score = 46.7 bits (107),  Expect = 0.024, Method: Composition-based stats.
 Identities = 10/117 (9%), Positives = 31/117 (26%), Gaps = 0/117 (0%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
              +C  C    +    ++P+K  + +C  C       P E +     +         +  
Sbjct  2    KFKCDKCKTTYSINDDRIPSKGGNVKCRVCKHIFKIYPPELKEIINNNEEKPVDPVIIFT  61

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGW  119
               +   +++      +     F    E +        +   + +         + +
Sbjct  62   PDQNKIKQLEKLFQQSQVPVTFFNNFKEAQQYFKIDTSQDFPKDIPQKLPQEFHQEY  118


>WP_163302419.1 zinc-ribbon domain-containing protein [Desulfovibrio sulfodismutans]NDY57374.1 
hypothetical protein [Desulfovibrio sulfodismutans]QLA12445.1 
hypothetical protein GD606_09235 [Desulfovibrio 
sulfodismutans DSM 3696]
Length=559

 Score = 46.7 bits (107),  Expect = 0.025, Method: Composition-based stats.
 Identities = 16/70 (23%), Positives = 24/70 (34%), Gaps = 0/70 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + CP CG  R+ P  K+PA  + A CP+C     F   E     ++             
Sbjct  2   RITCPECGFSRDVPDGKVPAATAVATCPKCRHRFRFRDFEDGFDPSSSAYEQPASEPRFS  61

Query  63  RIPSDRLEIQ  72
               D    +
Sbjct  62  GEERDGYAPR  71


>HGU84329.1 hypothetical protein [Gemmatimonadetes bacterium]
Length=150

 Score = 44.0 bits (100),  Expect = 0.025, Method: Composition-based stats.
 Identities = 13/34 (38%), Positives = 16/34 (47%), Gaps = 1/34 (3%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           V+C  CGA      +K+PA    ARC  C   L 
Sbjct  3   VQC-RCGATYRIDPAKVPAGGVRARCAYCTAVLW  35


>NBB94226.1 hypothetical protein [Planctomycetes bacterium]
Length=691

 Score = 46.7 bits (107),  Expect = 0.025, Method: Composition-based stats.
 Identities = 36/368 (10%), Positives = 92/368 (25%), Gaps = 16/368 (4%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFA--PIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
              +      L  I  L   LA       +   +    +   +  +    +  +   ++  
Sbjct  333  FRWMSLQHWLFAIIFLVWTLAIVACFGGAISRMTALHFAREEKISIGQGLRFSRGKFLSF  392

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
              + +     I +    + L   +   +  +G   + L+  + ++ G  +  +   +   
Sbjct  393  FSAPLVPIAVIVLTGLLISLGGFLFSLVPWIGPILMGLLFGLALLLGIGIAFM--AIGLA  450

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG  289
              +      +A +      A+ +S   V    +               ++ +   I Y+ 
Sbjct  451  AGWPLMYPTIAVEGSDSFDAISRSFSYVFARPFR--------AAFYGVVAAVYGTITYLF  502

Query  290  E-AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLL  348
                     L +  F  L ++ +Y D                 L    + +    +    
Sbjct  503  VRLFAYLTLLAVHAFVKLGFWDLYFDYG---ESLSPNADMIDVLWHRPSFWDLQTVNWAA  559

Query  349  LVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRK  408
            L  +         +  A              T           +R+ + D   +  ++ +
Sbjct  560  LSGVGWVAAILVGVWVAAVVGAVLGYVWTYFTAASTSIYFVLRRRVDATDLDDVYIEEEQ  619

Query  409  TTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDA  468
                                          K E    P+   A+K     E D   D+  
Sbjct  620  EPLAPAEESVVEGDVEGDGADKADEEAEEEKDEGQSEPSEESAEKDQPEDEADGAADESD  679

Query  469  RDLYDRQH  476
            +D    + 
Sbjct  680  KDEASDED  687


>PZR07111.1 hypothetical protein DI536_29080 [Archangium gephyra]
Length=791

 Score = 46.7 bits (107),  Expect = 0.025, Method: Composition-based stats.
 Identities = 6/33 (18%), Positives = 10/33 (30%), Gaps = 0/33 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           + C  C        S +P +    +C  C    
Sbjct  3   ITCEKCSTTYVLDDSLIPPQGVPVQCTRCAHVF  35


>NBX66208.1 hypothetical protein [Proteobacteria bacterium]
Length=226

 Score = 45.1 bits (103),  Expect = 0.025, Method: Composition-based stats.
 Identities = 11/66 (17%), Positives = 15/66 (23%), Gaps = 0/66 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            V CP C  E + P   L  +    RC +C      +P                      
Sbjct  2   KVVCPECAHETDVPDDVLGDRGKRVRCAQCKHIWFQEPVVEATDFGKFRRFDESLDIEPI  61

Query  63  RIPSDR  68
                 
Sbjct  62  PQSVHP  67


>MBI3550627.1 hypothetical protein [Elusimicrobia bacterium]
Length=318

 Score = 45.9 bits (105),  Expect = 0.026, Method: Composition-based stats.
 Identities = 21/161 (13%), Positives = 43/161 (27%), Gaps = 12/161 (7%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL---  219
                I+        S+   +        + +          T   +    V+    L   
Sbjct  104  GTWLIVFFNIAFLLSVRRLLNGQPASFSQGLYAAWERRAEITKWAVFSATVMMVIELARN  163

Query  220  ---------LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
                     LL    L + +  F    VLA +    + A+ +S  L    W       V 
Sbjct  164  VSENWLTRRLLGAINLGWTLATFLAIPVLAVEGGAPIPAIRRSAELFRRTWGQTLAAGVG  223

Query  271  LLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
            L  ++  L      +P +        +L+     +    ++
Sbjct  224  LGALNAILFVALGAVPAIALFIVGINALVGNGGGWSMLAMM  264


>MAB62548.1 hypothetical protein [Marinovum sp.]MBU12724.1 hypothetical protein 
[Rhodobacteraceae bacterium]OAH08885.1 hypothetical protein 
pfor_3c0202 [Rhodobacteraceae bacterium SB2]HAW14414.1 
hypothetical protein [Cellvibrionales bacterium]MBF20845.1 
hypothetical protein [Marinovum sp.]
Length=89

 Score = 42.4 bits (96),  Expect = 0.026, Method: Composition-based stats.
 Identities = 14/59 (24%), Positives = 20/59 (34%), Gaps = 0/59 (0%)

Query  6   CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRI  64
           CP CG+    P  K+P+K     C  C      +   ++R     N A        R  
Sbjct  5   CPKCGSGYRIPKDKIPSKNRVVMCSSCNHMWKQNFVPARRNYAIKNQAAQYAPVPAREP  63


>NOR85605.1 hypothetical protein [archaeon]
Length=91

 Score = 42.4 bits (96),  Expect = 0.026, Method: Composition-based stats.
 Identities = 11/87 (13%), Positives = 31/87 (36%), Gaps = 8/87 (9%)

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI-PYVGEAAN------  293
             +N   + A++++  +   + +  +   ++L+V+ +  + L   I   +           
Sbjct  3    LENKSLIDAIKRAYEITKNNKFNAWIFVIILMVLLMIFNMLIGLITNALSSIIGYYTFGI  62

Query  294  -LAFSLLLTPFSFLYYYLIYSDLKANY  319
             +   L    F+ L     Y D+    
Sbjct  63   DMIVMLFTGTFASLATNFFYFDILKTK  89


>GAB06159.1 hypothetical protein GOAMR_48_00700 [Gordonia amarae NBRC 15530]
Length=267

 Score = 45.5 bits (104),  Expect = 0.026, Method: Composition-based stats.
 Identities = 13/91 (14%), Positives = 37/91 (41%), Gaps = 0/91 (0%)

Query  198  RHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLV  257
                       L+ L+    S L+I+  +   + F     ++A +  G ++A+ +S  L 
Sbjct  113  PLWILADSGNSLVALLGYMVSRLVILVAVALSIRFALVGPIIALERCGPVRAVRRSWQLT  172

Query  258  SGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
            + ++ ++    +L  V++  +      +  +
Sbjct  173  ADNYLSLAAVLMLCGVVAAAILVPMGLLCVL  203


>HIH43664.1 hypothetical protein [Candidatus Methanoperedenaceae archaeon]
Length=334

 Score = 45.9 bits (105),  Expect = 0.026, Method: Composition-based stats.
 Identities = 11/74 (15%), Positives = 31/74 (42%), Gaps = 4/74 (5%)

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
             +  +  G   ++++S  +V  +     G  +L L+  L +S +     ++ +     F+
Sbjct  212  CIVIEGRGIFASMKRSAEIVRTNA----GSTLLFLLFELVISIVFGIFYFLADMVAGLFN  267

Query  298  LLLTPFSFLYYYLI  311
              ++PF  +    +
Sbjct  268  SDISPFFTMVSAFV  281


>HCF62182.1 hypothetical protein [Myxococcales bacterium]
Length=196

 Score = 44.8 bits (102),  Expect = 0.026, Method: Composition-based stats.
 Identities = 9/33 (27%), Positives = 11/33 (33%), Gaps = 0/33 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           VRC  C  E      K+     + RC  C    
Sbjct  3   VRCDRCRTEYEFDDEKITEAGVTVRCTVCGHVF  35


>CCG41759.1 conserved hypothetical protein [Phaeospirillum molischianum DSM 
120]
Length=178

 Score = 44.4 bits (101),  Expect = 0.026, Method: Composition-based stats.
 Identities = 8/36 (22%), Positives = 15/36 (42%), Gaps = 0/36 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
            + CP+CG   + P + L  +    +C +C      
Sbjct  2   LITCPNCGTNFSIPDAALGTEGRKLKCAKCEHKWFQ  37


>WP_013321181.1 hypothetical protein [Gloeothece verrucosa]ADN13074.1 conserved 
hypothetical protein [Gloeothece verrucosa PCC 7822]
Length=311

 Score = 45.9 bits (105),  Expect = 0.027, Method: Composition-based stats.
 Identities = 23/189 (12%), Positives = 50/189 (26%), Gaps = 3/189 (2%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            W L     + L+                + L        P+        L       +L 
Sbjct  71   WFLMVLMPFWLIFFAYCAAQFLTNTGVISRLAFSLLINKPETVRQARQELKPRRLQFILN  130

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
                    F       +       +    + +  L L++  +      +           
Sbjct  131  HFLYLILFFFIFRVWQIIQVMVFYVPASFIPNPKLQLMIQWVGYLFFVIGFSWFY---AR  187

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
             FF    +  + N+    A+ +S  L SG  W+I     L++++++ L  L+        
Sbjct  188  LFFPEVPLAIEKNLQLGDAIIRSWKLTSGFAWSILFIIFLVILLTMPLYLLSGIPLIFAA  247

Query  291  AANLAFSLL  299
               +     
Sbjct  248  VYAMTGLFN  256


>MBH23226.1 hypothetical protein [Myxococcales bacterium]
Length=430

 Score = 46.3 bits (106),  Expect = 0.027, Method: Composition-based stats.
 Identities = 11/54 (20%), Positives = 16/54 (30%), Gaps = 1/54 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIAT  54
           M  V C  C ++      KLP    + +CP C       P       +      
Sbjct  1   MI-VVCERCSSKYRINEDKLPTGGGNIKCPSCQHVFFVAPPNQAPPSSQLPRRP  53


>XP_007224075.2 uncharacterized protein LOC18792349 [Prunus persica]
Length=438

 Score = 46.3 bits (106),  Expect = 0.027, Method: Composition-based stats.
 Identities = 27/155 (17%), Positives = 56/155 (36%), Gaps = 10/155 (6%)

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
            +   ++I +     GL  ++ L L  +       IL   ++     L     L   V + 
Sbjct  258  VVSWLYIRLLDLGYGLLVALLLLLLVLIFNITFTILAFSMILLVVALFFRTYLD--VVWN  315

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY------  287
                V   ++I G++AL K+  LV G     F   + +       + L            
Sbjct  316  LALVVSVLEDICGIEALGKAARLVKGSKLRGFFLNLFVAFSMSMFNGLVTIGTVAFPENA  375

Query  288  --VGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
              +    + + + L+T F ++ Y ++Y + K  + 
Sbjct  376  RMIPSLFSYSITCLITMFLYMSYTVLYYECKKTHG  410


>WP_085440340.1 hypothetical protein [Magnetofaba australis]
Length=85

 Score = 42.4 bits (96),  Expect = 0.027, Method: Composition-based stats.
 Identities = 11/65 (17%), Positives = 16/65 (25%), Gaps = 0/65 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V CP CG+E      K P      RC        +           + +        
Sbjct  3   MIPVTCPSCGSENTVKYGKRPNGTLRFRCDNPECKRRYFQLTYAYKGNEEGMGEKILDME  62

Query  61  QRRIP  65
           +    
Sbjct  63  KWPHY  67


>PYT06293.1 hypothetical protein DMF49_11630 [Acidobacteria bacterium]
Length=624

 Score = 46.3 bits (106),  Expect = 0.027, Method: Composition-based stats.
 Identities = 8/36 (22%), Positives = 12/36 (33%), Gaps = 0/36 (0%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
              + CP CG   +     +P     A C  C +   
Sbjct  144  VVLSCPACGKRFSVQDELVPRAGRRAHCASCGEDFW  179


>PID73240.1 hypothetical protein CSB33_04800 [Desulfobacterales bacterium]
Length=1282

 Score = 46.7 bits (107),  Expect = 0.027, Method: Composition-based stats.
 Identities = 12/36 (33%), Positives = 15/36 (42%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V C  C A+  T  SK+    +  RC  C Q  
Sbjct  1   MI-VTCDKCQAQFETDESKISPSGTRMRCAGCNQVF  35


>TVP89066.1 hypothetical protein EA347_04860 [Thioalkalivibrio sp.]
Length=257

 Score = 45.5 bits (104),  Expect = 0.027, Method: Composition-based stats.
 Identities = 25/143 (17%), Positives = 51/143 (36%), Gaps = 3/143 (2%)

Query  145  ATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT  204
                        +A ++A +       +         +     G F   +  L  V  + 
Sbjct  58   LIEAGELPGPAFYAAVIAGLVGNAYFWAVALLRADGVLAGGGEGSFSRARGMLLPVLGYA  117

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
            L+     L+V  G +LL +PG+   V       ++   + G   AL++S +LV G WW  
Sbjct  118  LV---YALLVMLGLVLLALPGIFLAVLLAPGVMLIVLRDSGVFAALKRSAVLVWGSWWFS  174

Query  265  FGRFVLLLVISLTLSFLTARIPY  287
             G  + + ++++    +      
Sbjct  175  LGILLTVTLVAVIPVSIAEAFLV  197


>NOY91010.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=314

 Score = 45.9 bits (105),  Expect = 0.027, Method: Composition-based stats.
 Identities = 31/203 (15%), Positives = 62/203 (31%), Gaps = 30/203 (15%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
               +Y++  + ++ P     L          N    W+  +A   +  +G + +T   F 
Sbjct  50   HPLLYVIVGLGSYLPGGVIDLAYTLNGQVSTNPLGFWSTRIADTIFTSIGFAMLTHLTFD  109

Query  181  YICKTDVGLFRSMKLGLR--------------------------HVGSFTLLLILLILVV  214
             +   DV L   MK GL                            +   +   ++ +L+ 
Sbjct  110  VVEGRDVSLGAYMKPGLSDAFSAFSVQLQSGILIVLSAIPGAILVLIGLSGGGLVAVLLD  169

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLAD----DNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
              G +L+ +  +   + +        D          QAL +S  LV      +F  F  
Sbjct  170  VLGGVLVFVASVYVYLGYSLALPAYIDSSRSHRKPPSQALRRSWSLVKHRRGRLFLAFAG  229

Query  271  LLVISLTLSFLTARIPYVGEAAN  293
            L ++   ++ L      V     
Sbjct  230  LNLVLFVVACLGLIPVGVAGVFG  252


>WP_193388349.1 zinc-ribbon domain-containing protein, partial [Anaeromyxobacter 
sp. PSR-1]
Length=221

 Score = 45.1 bits (103),  Expect = 0.027, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 13/36 (36%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M    C HC A+      K+  + +  RC  C    
Sbjct  1   MIA-HCTHCQAKFRIADEKIGPRGAKVRCSRCKTVF  35


>WP_123779737.1 DUF975 family protein [Aerococcus sp. SJQ22]RPA60774.1 DUF975 
family protein [Aerococcus sp. SJQ22]
Length=263

 Score = 45.5 bits (104),  Expect = 0.027, Method: Composition-based stats.
 Identities = 25/255 (10%), Positives = 68/255 (27%), Gaps = 12/255 (5%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
             + +   L FA ++  L L  A             +L   V +    + ++   M     
Sbjct  18   AWFIIFTLIFAGLYWVLGLVQAAITASIFPVIIVFLLFNLVFFAGTEMVYLRSLMIKNNQ  77

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
            ++       ++        +     +  + +   SL+   P +   + +    +++ D  
Sbjct  78   QSLHFFRDMVRPFFNDALRYGWANFIKGVYLVLWSLVFFFPVIYKWMAYALSNFLIIDFP  137

Query  244  I-GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
            +    +A+ +SR L+ G         +  +   + +                     + P
Sbjct  138  LLSTNEAITESRRLMRGRKGRFIWLHIRFIGWFILI-----------PITLGLAYFYVKP  186

Query  303  FSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQL  362
            +  L     Y         P+           +       +      V ++++       
Sbjct  187  YYTLALINFYEQALEEDGYPEKIRRFEAGDSASTNRHRQEVKRPRRKVYITKRKPQEPTH  246

Query  363  LSAGKDIQQRLGTQP  377
                 D Q    +  
Sbjct  247  TYKHTDYQYDDDSWD  261


>WP_161779186.1 hypothetical protein, partial [Proteus sp. G2675]NBL95514.1 hypothetical 
protein [Proteus sp. G2675]
Length=204

 Score = 44.8 bits (102),  Expect = 0.028, Method: Composition-based stats.
 Identities = 20/142 (14%), Positives = 45/142 (32%), Gaps = 0/142 (0%)

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFS  304
            G  + L     LV   ++   G  ++  +++L +S +   IP +G   +   S +L    
Sbjct  27   GATEWLGIGWDLVKERFFMWVGAVIIYFLVNLIISSILGFIPLIGPIISPFISAVLVAGL  86

Query  305  FLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLS  364
             +  +  Y   + N          +++L +  A    +LI     +     + S    ++
Sbjct  87   MMIAHQQYETGQLNLDNLFSGFQNKRYLSIMGAYGISVLIILAGFILALLISGSLLMEVA  146

Query  365  AGKDIQQRLGTQPQQTPDLNRS  386
                             D    
Sbjct  147  QAALNDAPTYYIQGLLADNLTP  168


>KKB96300.1 hypothetical protein SZ25_00622 [Candidatus Arcanobacter lacustris]
Length=224

 Score = 45.1 bits (103),  Expect = 0.028, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 13/36 (36%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  + C  C A    P S +  K    +C +C    
Sbjct  1   MIII-CEKCTARYAIPDSAISEKGRLVKCAKCAHQW  35


>TAF43635.1 hypothetical protein EAZ64_08585 [Sphingobacteriales bacterium]TAF79035.1 
hypothetical protein EAZ51_08370 [Sphingobacteriales 
bacterium]
Length=250

 Score = 45.5 bits (104),  Expect = 0.028, Method: Composition-based stats.
 Identities = 23/159 (14%), Positives = 45/159 (28%), Gaps = 19/159 (12%)

Query  152  NQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLI  211
             +    +    +    +  L     S    I    + +          V +  L+  L  
Sbjct  91   FKTIFPSWKNISSFISITILLGFVVSTITLIYTQLLRINTFKLFINNAVETPYLMAALAF  150

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
                   L ++         F F    + DDN   L +L +SR L  G+   I    +L+
Sbjct  151  FAFIIFVLAVM--------RFMFFPCFIVDDNSSSLHSLRQSRELTYGNITHILTILLLV  202

Query  272  LVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYL  310
            +                G        ++  PF+ +   +
Sbjct  203  IGFIAV-----------GFLCFGVGVIVTYPFTNIVLVV  230


>MBF0634602.1 zinc-ribbon domain-containing protein [Nitrospinae bacterium]
Length=51

 Score = 41.3 bits (93),  Expect = 0.028, Method: Composition-based stats.
 Identities = 8/44 (18%), Positives = 17/44 (39%), Gaps = 1/44 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           M  ++CP C +      + +  + + ARC +C           +
Sbjct  1   MV-IKCPRCSSRYKIDGAGVADEGTYARCRKCENVFFVRKRSHE  43


>MBI5681715.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=353

 Score = 45.9 bits (105),  Expect = 0.028, Method: Composition-based stats.
 Identities = 11/89 (12%), Positives = 18/89 (20%), Gaps = 1/89 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V+C  C  +     S +  K    RC +C       P                    
Sbjct  1   MI-VQCDVCQTKFRIDDSMVTEKGVRVRCTKCKNIFFVKPQTFDVPLPVPEPPKEEQTIS  59

Query  61  QRRIPSDRLEIQSKTVNCRRCNRSFCLQP  89
                   ++   K         +     
Sbjct  60  FNIDTKPDIQPTDKKEMPDDEKLAADWSM  88


>HIP24123.1 hypothetical protein [Rhodobacteraceae bacterium]
Length=285

 Score = 45.5 bits (104),  Expect = 0.028, Method: Composition-based stats.
 Identities = 8/38 (21%), Positives = 13/38 (34%), Gaps = 0/38 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
            + CP C A      + +P +     C  C    +  P
Sbjct  2   RITCPKCKATYEVGEADIPDEGIEVECSACLNRWMQMP  39


>HHO51993.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=743

 Score = 46.3 bits (106),  Expect = 0.028, Method: Composition-based stats.
 Identities = 13/45 (29%), Positives = 18/45 (40%), Gaps = 1/45 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           M  V CP C  +     SK+  + +   CP C    +    ES R
Sbjct  1   MI-VTCPACSQQYKLRESKIQGRGAKVTCPRCGHRFVIYRTESDR  44


>PIP53302.1 hypothetical protein COX08_01740, partial [Candidatus Beckwithbacteria 
bacterium CG23_combo_of_CG06-09_8_20_14_all_34_8]
Length=91

 Score = 42.4 bits (96),  Expect = 0.028, Method: Composition-based stats.
 Identities = 13/72 (18%), Positives = 30/72 (42%), Gaps = 2/72 (3%)

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL--VISLTLSFLTARIPYVGEAAN  293
             Y++AD  +    A++ S  +  G  W + G   + +  +I+  ++FL   I  +     
Sbjct  1    PYLIADRKLKVNDAMKLSATMTKGVKWQLIGYGFVQIGLIIAGAIAFLIGLIAVLPTLWL  60

Query  294  LAFSLLLTPFSF  305
             + ++     S 
Sbjct  61   ASLAIYQQLLSC  72


>WP_125119873.1 DUF975 family protein [Intestinibaculum porci]BBH27114.1 hypothetical 
protein SG0102_20480 [Intestinibaculum porci]
Length=254

 Score = 45.5 bits (104),  Expect = 0.028, Method: Composition-based stats.
 Identities = 21/148 (14%), Positives = 47/148 (32%), Gaps = 14/148 (9%)

Query  188  GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGL  247
             L   + + + +  S   L    ++ +G   +  ++ G+          Y+L D  +   
Sbjct  107  TLIPVIAMIIVYTMSGGFLTSNNMISMGLLGVAAVVLGIYVAFRLLLTPYLLEDYGMKKS  166

Query  248  QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA--------------N  293
            +A+  SR L+ GH +      +  L   +  + +T  + YV                   
Sbjct  167  EAINASRELMRGHVFDYVKLVLSFLPFIIIQALITYGLTYVLMLGLPAFVVEIIVSVVSL  226

Query  294  LAFSLLLTPFSFLYYYLIYSDLKANYRG  321
            +       P   +   + Y +L      
Sbjct  227  IIGVYTYLPLLTVSTAVFYKELAYRAYH  254


>HHA47197.1 hypothetical protein [Armatimonadetes bacterium]
Length=290

 Score = 45.5 bits (104),  Expect = 0.028, Method: Composition-based stats.
 Identities = 26/244 (11%), Positives = 68/244 (28%), Gaps = 27/244 (11%)

Query  108  ADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI  167
                 L        +  +      A     S       T +    +      +   +  I
Sbjct  51   HMYLALLGFPLLLAMAAFTPVFTAAVIFAISDRYFNRPTTIEACYRRAFSGSIYWNMLGI  110

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
             +       +  +      + +   M +G R     +  +  +I  +  G+L        
Sbjct  111  GILQGVAVAACMMVAAAPAMVVVVVMMIGSRQPTFPSTAMAAMIAFMLVGALATAPLW--  168

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
              V        +  +  G   AL +S  L+ G   ++F   +++++  + ++ + + +  
Sbjct  169  --VRLMLAGPAMVVERQGASSALGRSWTLIKG---SVFRSTLVIVIAYVVVNTVPSVLGG  223

Query  288  VGEAANL--------------------AFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPI  327
            +                             ++L P   +   L+Y DL+    G     +
Sbjct  224  LLAPLTGHANYYDVPVLYRVTEQGIVLLARVMLAPLLVIAETLLYFDLRVRKEGLDLQLM  283

Query  328  KRQW  331
              + 
Sbjct  284  AGEM  287


>PIE32553.1 hypothetical protein CSA56_15230 [candidate division KSB3 bacterium]
Length=448

 Score = 46.3 bits (106),  Expect = 0.029, Method: Composition-based stats.
 Identities = 10/74 (14%), Positives = 19/74 (26%), Gaps = 0/74 (0%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
               C HCG       +K+  + +  +C +C   +     E           T   C   +
Sbjct  189  KFCCDHCGEIYWVDDAKVSPQGAKTKCVKCQHIITVKRREKAEPVQPKAQQTAIPCPHCQ  248

Query  63   RIPSDRLEIQSKTV  76
                   +      
Sbjct  249  YENRAGAQFCVMCQ  262


>WP_095975748.1 zinc-ribbon domain-containing protein [Melittangium boletus]
Length=81

 Score = 42.1 bits (95),  Expect = 0.029, Method: Composition-based stats.
 Identities = 11/80 (14%), Positives = 17/80 (21%), Gaps = 0/80 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + C  C A        +  K   A+CP C    +    +S            P      
Sbjct  2   RISCQKCAAAYAIDDRVITPKGVRAQCPRCRHLQLVKREDSPAPAAPPLRRPNPRWPPNP  61

Query  63  RIPSDRLEIQSKTVNCRRCN  82
                       +    R  
Sbjct  62  PPRRPSPRPPRPSPPHWRAP  81


>MBI5527366.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=892

 Score = 46.3 bits (106),  Expect = 0.029, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V+C  C  +   P  K+    +  RC +C    
Sbjct  1   MI-VQCQKCKTKFKLPDEKIQPGGTKVRCGKCSTVF  35


>ELU13346.1 hypothetical protein CAPTEDRAFT_219079 [Capitella teleta]
Length=1222

 Score = 46.7 bits (107),  Expect = 0.029, Method: Composition-based stats.
 Identities = 38/551 (7%), Positives = 121/551 (22%), Gaps = 30/551 (5%)

Query  12    ERNTPSSKLPAKKSSARCPE------CCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIP  65
             E    ++        A C           +       +     + +      C       
Sbjct  673   EYRVYTANKKTAAKRASCRYFDENVETWTSGNDVCTVTNNLALSIDTHVDCECKHMSLYA  732

Query  66    SDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIY  125
              +   ++ + V         C         +                         +   
Sbjct  733   VESNIVRPEIVGFDLWFFVCCFICMIGLALAIVCHHICLTYHTMFAASLHMHMCFAVLAA  792

Query  126   LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKT  185
              +  V+      + LL   A   N +            +      ++       + +   
Sbjct  793   EICYVIDVYLSPNHLLSIEADQNNYRCTVMGLIFHYFFLTQFSWMMTQAINFYKVLVLND  852

Query  186   DVGLFRSMKLGLRHVGSFTLLLILL--ILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
             +    + +       G   +  ++      +G    LL +  ++  V + +  ++  D  
Sbjct  853   EHTERKYVLYFFIGWGECEMSHMICAEPSPIGLPVCLLALFYIVTYVIYRYVSFLPED--  910

Query  244   IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS-FLTARIPYVGEAANLAFSLLLTP  302
                     +    V+ +    F       +  + L   L   +  +        +     
Sbjct  911   --------RIYSDVTQNEDICFVNNAFAALCGVILPCLLFLMVTAIAFIQAFQVTAQWQA  962

Query  303   FSFLYYYLIYSDLKA----NYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLS  358
             +  +Y      +        +       +          ++  +L     +       ++
Sbjct  963   YDDVYRGRYNINEIRTLLLFWAFIVITWLWAGLHMAYGQLWMMVLFCIFNVALGVYAFVA  1022

Query  359   AEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLG  418
                L        +   +   +           P   +         ++            
Sbjct  1023  YAVLRHPCLPCFRPEKSVAYEASGHADGSEVGPMYTAQPSLAATPLRRPPPPPSLISMPS  1082

Query  419   PVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSF  478
              +    + +   D++P               +  K S            + ++Y      
Sbjct  1083  HMNAIHEHWAPGDESPSQTYTGASIQPQRGGIRVKRSLAPPSSGPATGGSSNVYVIPAHH  1142

Query  479   EHPAFHWVGINQTDENDLFSGIRS-----IYLRQGTQAEQVHSILGKLELTLPLAIESLQ  533
                A +   +    ++ +    R      + L+ G+    +HS     +  +   +E + 
Sbjct  1143  HPHASYEPSVTSLPQHHVSDDSRDFDDLILALKTGSSH--LHSNYAPSDYAVDREVEEVH  1200

Query  534   LTRNDIGKTLQ  544
                    + L 
Sbjct  1201  FQNPAPSEYLD  1211


>MBC8334379.1 hypothetical protein [Anaerolineales bacterium]
Length=297

 Score = 45.5 bits (104),  Expect = 0.029, Method: Composition-based stats.
 Identities = 20/193 (10%), Positives = 52/193 (27%), Gaps = 5/193 (3%)

Query  110  SWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
                F       +      +V+ F  + S +                  +          
Sbjct  49   QLHDFLNSLDQPVPPVGFFVVIGFIFLASIISALLTPLGWTAIAKGTVDLDTGKSGLTFN  108

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT-----LLLILLILVVGGGSLLLIIP  224
             L   T           + ++         +         L   L  L +    LL+   
Sbjct  109  SLLSDTMPFLGRAFGILLLIWFGYFFLFGGIMFLMAMFGILTAGLGFLCMLPLMLLIYPL  168

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
             L+     F     +  +++   + L+    L+  ++W +     +L +  + + F++A 
Sbjct  169  MLVIMAGVFMSIAAVPAEDLPFGETLQLVWDLLKSNFWPLMLMSFILYMFQMVVGFISAI  228

Query  285  IPYVGEAANLAFS  297
              ++G+       
Sbjct  229  PMFIGQFLFPFLL  241


>WP_116391669.1 zinc-ribbon domain-containing protein [Parvularcula sp. SM1705]RFB05038.1 
hypothetical protein DX908_06880 [Parvularcula 
sp. SM1705]
Length=391

 Score = 45.9 bits (105),  Expect = 0.029, Method: Composition-based stats.
 Identities = 8/35 (23%), Positives = 12/35 (34%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            + CP C      P+S +P +     C  C     
Sbjct  2   LISCPACPKMYRVPASAIPPEGREVSCSACGTVWY  36


>HHU06987.1 hypothetical protein [Clostridiaceae bacterium]
Length=243

 Score = 45.1 bits (103),  Expect = 0.029, Method: Composition-based stats.
 Identities = 27/217 (12%), Positives = 64/217 (29%), Gaps = 1/217 (0%)

Query  86   CLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPA  145
              +           L     + A    LF      LL                 L L+PA
Sbjct  1    MKRFSIFNAYKIYDLEPERHVQAWFLTLFACYSAFLLSFNGQDPFSKILSELLYLFLEPA  60

Query  146  TWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTL  205
              L   +  +   + +  +      +  ++ S+       +     ++   ++ +    +
Sbjct  61   APLPKISSKFLLYLGIYALISFTGFILSLSYSITYINKTINKETRSALPDVIKKIFPLFI  120

Query  206  LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI-  264
              ++   +    S+ L+IP  +F   F++    +         AL++S  L       I 
Sbjct  121  FAVISSGIYVISSVFLMIPYFIFMSMFYYTPVEIVLGRRSLPDALKQSWRLTKKRKVFIA  180

Query  265  FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
             G   ++ ++    S +         A++L       
Sbjct  181  AGVISIVFLLRYFASAIAGSFASSTWASSLIIGFSYA  217


>WP_187281031.1 hypothetical protein [Nonomuraea sp. C10]
Length=595

 Score = 46.3 bits (106),  Expect = 0.029, Method: Composition-based stats.
 Identities = 31/300 (10%), Positives = 67/300 (22%), Gaps = 18/300 (6%)

Query  320  RGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQ  379
                    +       A          +                  G + +     + + 
Sbjct  146  EYEDASYREDGPEYRDAWSREDGPEHEISWNREDEPEYRDAWSREDGPEHEISWNREDEP  205

Query  380  TPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLK  439
              +   S  +EP    +A+ +           E            D    +D        
Sbjct  206  EYEDAWSRDDEPNYEEAANREDEPEYDESWGREDEPEYEDAWSREDGPDHEDSWDRQGPA  265

Query  440  LELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSG  499
                    ++  Q      + + V      +        +  A +    N      L S 
Sbjct  266  DREPMIAEVADPQDQPEPSDAEPVGTQAENEAEKAARPEDRSANNAQATNNYVSVILPSY  325

Query  500  IRSIY--------LRQGTQAEQVHSILGK----LELTLPLAIESLQLTRNDIGKTLQIGG  547
                         L    Q     +  G+    L +  P A       ++  G+ ++   
Sbjct  326  GALFTPWLATVDQLNADRQNAFGQNAFGQTMEGLYVKGPDAEGQDVEAQDVEGQDVEAQD  385

Query  548  KQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSLRQMFDGN  607
             +      G +A      G   + LNV   ++  + +       Q      +  Q  +G 
Sbjct  386  VE------GQDAEAQDVEGPDAEDLNVEGPDAEGQDVEAQDVEGQDVEGQDAEAQDVEGP  439


>HCB52327.1 hypothetical protein [Rhodobacter sp.]
Length=304

 Score = 45.5 bits (104),  Expect = 0.030, Method: Composition-based stats.
 Identities = 8/43 (19%), Positives = 14/43 (33%), Gaps = 0/43 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
            + CP+C A        +       +C  C  T +    E  +
Sbjct  2   QIVCPNCEAHYEVGYDSIGDAGRQVQCSNCGHTWLAMRQEEDQ  44


>HAO22669.1 hypothetical protein [Desulfobacteraceae bacterium]
Length=131

 Score = 43.6 bits (99),  Expect = 0.030, Method: Composition-based stats.
 Identities = 10/41 (24%), Positives = 13/41 (32%), Gaps = 1/41 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
           M  V+C  C    N     +    S+ RC  C       P 
Sbjct  1   MI-VKCEKCQTAYNLKDDFVRPGGSNLRCSTCRHVFKIYPP  40


>HIJ41512.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=66

 Score = 41.7 bits (94),  Expect = 0.030, Method: Composition-based stats.
 Identities = 10/54 (19%), Positives = 14/54 (26%), Gaps = 3/54 (6%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIAT  54
           M    CP CGA  N   S++P K          +       +            
Sbjct  1   MI---CPKCGAVYNVDDSEIPDKGVRKEEIGSWRNFKCFYLQFFLVYHGLCYYA  51


>ABB37917.1 MJ0042 family finger-like protein [Desulfovibrio alaskensis G20]MBG0772930.1 
zinc-ribbon domain-containing protein [Desulfovibrio 
alaskensis]
Length=471

 Score = 45.9 bits (105),  Expect = 0.031, Method: Composition-based stats.
 Identities = 16/64 (25%), Positives = 22/64 (34%), Gaps = 0/64 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            +RCP C   R+   SK+PA  + A CP+C     F     Q     +           R
Sbjct  2   QIRCPECQFTRHVDESKIPASAALATCPKCRHKFRFRDFSQQDEFVLEPDTETAGRHQDR  61

Query  63  RIPS  66
               
Sbjct  62  PPLH  65


>WP_165356614.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Sphingosinicella sp. BN140058]
Length=215

 Score = 44.8 bits (102),  Expect = 0.031, Method: Composition-based stats.
 Identities = 30/145 (21%), Positives = 54/145 (37%), Gaps = 2/145 (1%)

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
            +      Y     +   R             +L ++  L +  G +LL+IPGL     +F
Sbjct  55   LAAFAAQYYLTRRLIDQRWAAGERSGALPVFVLQLVSTLGILLGLVLLVIPGLYLLTRWF  114

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAAN  293
                +L   N G   AL +S L   GH  A+FG  V++ +  L LS + A + +      
Sbjct  115  AAVPLLISRNQGTGDALRESWLSTEGHGLAVFGAMVIVYIP-LLLSLVVAFL-FDEALIG  172

Query  294  LAFSLLLTPFSFLYYYLIYSDLKAN  318
               ++ +     +   ++     A 
Sbjct  173  GFGAIGVATNFMISLAMVAGWCAAI  197


>QNN22952.1 hypothetical protein HED60_11965 [Planctomycetales bacterium 
zrk34]
Length=311

 Score = 45.5 bits (104),  Expect = 0.031, Method: Composition-based stats.
 Identities = 30/319 (9%), Positives = 65/319 (20%), Gaps = 17/319 (5%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDN------IAT  54
            MP + C  CG +               +C       I DP +                  
Sbjct  4    MPNIVCEGCGKKYRWKPEL---AGKKVKCKCGQVLEIPDPPQEDDPNDDSFGAIELADVG  60

Query  55   CPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELF  114
                         + +  +  +   +                     S +          
Sbjct  61   ESSHAKPAPPKPSKPDKPASPIGEVKIAGGDIEPVGLGGGEDDEADDSGNCPSCGQPLAI  120

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
                    G                         + +    +   +   +  I LG++ M
Sbjct  121  DAVLCIQCGFNRNTGKKIGTVGEMEGDEAGDGATSHKLHQLKQWKIPVAMVVIGLGVNLM  180

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
                       +V    ++   L       ++L++    +    L      +   +    
Sbjct  181  FVMSQGAEEGVEVNPAVTLIGMLIG-TGLGVVLMVGAAFLAAPLLDTTFGDIRTAILKLA  239

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
              YV     +  + A      L       I G  VL       L +L     +      +
Sbjct  240  AVYVTPGVLMSLVSAFMGDWQL-------ILGLPVLWGSYFGLLVWLFDFDFFEAAVFAI  292

Query  295  AFSLLLTPFSFLYYYLIYS  313
               L+          ++  
Sbjct  293  IGWLIKYWIIGFILAMLLY  311


>NLB29755.1 DUF975 family protein [Clostridiales bacterium]
Length=221

 Score = 44.8 bits (102),  Expect = 0.031, Method: Composition-based stats.
 Identities = 22/154 (14%), Positives = 55/154 (36%), Gaps = 7/154 (5%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +      +S           + ++   RS+  G         L I++ +++    LL 
Sbjct  57   IALRLCATVVSVGFAWYCFRAARGEMPETRSIFDGFGSFFKIIWLHIIMSVLIAVQLLLF  116

Query  222  IIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
            I+PG++  + +    Y++ +       + +++S  ++ G+    F   +  L   L  + 
Sbjct  117  IVPGIIAAIRYSQATYIMYEHPEYSAWRCIKESGRIMRGYKKEYFFLLLSFLGWLLIGAM  176

Query  281  LTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
            L+  IP           + +  +  L   L +  
Sbjct  177  LSLYIPVP------VLDIWINIYWGLAAALFHVH  204


>WP_154431035.1 GGDEF domain-containing protein [Roseburia sp. MUC/MUC-530-WT-4D]MST76078.1 
GGDEF domain-containing protein [Roseburia sp. 
MUC/MUC-530-WT-4D]
Length=713

 Score = 46.3 bits (106),  Expect = 0.032, Method: Composition-based stats.
 Identities = 33/391 (8%), Positives = 90/391 (23%), Gaps = 7/391 (2%)

Query  93   FRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQN  152
                      I  +   +     +     + ++L+  VL F    + +        N   
Sbjct  13   MDMFFCLFCLIMFVSIKANNPKQKSMRMFVRLFLIATVLFFGESLAYIFRGNLGSFNILV  72

Query  153  QNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLIL  212
                  ++ A    +         S+F+       G    +      +  F +++ L   
Sbjct  73   TRIANLMVFAMNIAMANTYVRYVSSVFVEKGAEVSGNSVKIANIFSCINIFIVVVNLFYP  132

Query  213  VVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
             +                 +     V+    IG   A++  + L    + ++     + +
Sbjct  133  WMYYFDEANYYHRNNSWYVYTLISLVVIF--IGAGMAIKYRKYLEKRSFISMMLFSFIPI  190

Query  273  VISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWL  332
            + ++  SF+           NL   +        Y Y    +   +             +
Sbjct  191  IATVVQSFIYGF-----SITNLGLGIGSFVMFAAYMYDWSHNGDEHTNMINDSRFDAVIM  245

Query  333  PLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQ  392
             +   +   + I   +         ++E        +           P           
Sbjct  246  FIIMLLSMSVSIIACVNAIQQVTKENSEIQSRTIAQMVSAKIENEFIKPITVSQTISSDI  305

Query  393  RLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQ  452
             + +        +      +    L  +    D       +         +        +
Sbjct  306  DIRTYIEGKTREEADGVKDDITNRLVSIGNEFDYKMVFVVSDKTRAYYTYNGISRYLDVE  365

Query  453  KGSARIEIDKVLDDDARDLYDRQHSFEHPAF  483
              S  I     LD   R + +     ++   
Sbjct  366  NDSHDIWYKDYLDSGKRYIVNVDTDEDNNGN  396


>TDI61717.1 hypothetical protein E2O91_03075 [Alphaproteobacteria bacterium]
Length=251

 Score = 45.1 bits (103),  Expect = 0.032, Method: Composition-based stats.
 Identities = 8/38 (21%), Positives = 11/38 (29%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  + CP C          +       RC  C +T   
Sbjct  1   MI-IGCPECETRFVVAPQAIGKDGRMVRCSRCSKTWFQ  37


>NVJ93742.1 zinc-ribbon domain-containing protein [Methylocystaceae bacterium]
Length=189

 Score = 44.4 bits (101),  Expect = 0.032, Method: Composition-based stats.
 Identities = 12/42 (29%), Positives = 18/42 (43%), Gaps = 1/42 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAE  42
           M  + CP+C A  + P + L     + RC +C       P E
Sbjct  1   MI-ISCPNCSAHYSVPIAALGENGRTLRCAKCSHAWDQMPYE  41


>RLN03787.1 hypothetical protein C2845_PM13G00480 [Panicum miliaceum]
Length=438

 Score = 45.9 bits (105),  Expect = 0.032, Method: Composition-based stats.
 Identities = 20/183 (11%), Positives = 45/183 (25%), Gaps = 10/183 (5%)

Query  206  LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
               L  L+     L  +   +   V +     V   ++  G  A+ KS+ L+ G      
Sbjct  256  GSGLAGLLAFLVLLAYLAGLVYLSVVWHLASVVSVLEDYKGFAAMRKSKDLIRGKLPTAT  315

Query  266  GRFVLLLVISLTLSFLTARIPY----------VGEAANLAFSLLLTPFSFLYYYLIYSDL  315
              FV L ++   +                   +     LA    +   + +   L+Y   
Sbjct  316  AIFVTLNLVFAFVELAFRAWVIKGGSSAPTRLILGVLALAALSCVVMLALVAQTLVYLVC  375

Query  316  KANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGT  375
            K+ +            L +    +  +    + +           +              
Sbjct  376  KSYHHESIDKAGISDHLEVYLGDYVPLKASDVQMEQFQVWFPLPCKSCPGFPLYTSTCDE  435

Query  376  QPQ  378
               
Sbjct  436  HSA  438


>TMQ51863.1 hypothetical protein E6K73_04420 [Candidatus Eisenbacteria bacterium]
Length=142

 Score = 43.6 bits (99),  Expect = 0.032, Method: Composition-based stats.
 Identities = 13/34 (38%), Positives = 14/34 (41%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            V CP CGA    P   L    +  RCP C Q  
Sbjct  29  EVSCPACGARYLLPGGLLGPWGAKVRCPGCGQRF  62


>WP_020887907.1 zinc-ribbon domain-containing protein [Desulfohalovibrio alkalitolerans]EPR31083.1 
Yip1 domain-containing protein [Desulfovibrio 
alkalitolerans DSM 16529]
Length=538

 Score = 46.3 bits (106),  Expect = 0.032, Method: Composition-based stats.
 Identities = 14/43 (33%), Positives = 19/43 (44%), Gaps = 0/43 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRT  46
           + CP CG  R+ P  K+PA    A CP+C     F     +  
Sbjct  3   ITCPECGFTRSVPDDKIPAGSVRATCPKCKTKFQFRSLPDEFP  45


>TMG17295.1 hypothetical protein E6H98_07720 [Chloroflexi bacterium]
Length=201

 Score = 44.8 bits (102),  Expect = 0.032, Method: Composition-based stats.
 Identities = 14/100 (14%), Positives = 29/100 (29%), Gaps = 1/100 (1%)

Query  160  LLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
                +         +  +           +   +    R          L   V+ G   
Sbjct  95   FPIALLLYPFTAGALYRAATSLAAGNVETIGSVLGGTARRYFGIFGNRFLWGCVIAGSVF  154

Query  220  LLIIPGLLF-CVWFFFCQYVLADDNIGGLQALEKSRLLVS  258
            ++ IP +++  V +      L  +  G +QAL +S  L  
Sbjct  155  IVTIPVVIWVLVRWAVSLPALFAEGAGPVQALGRSWNLTK  194


>GAK57210.1 hypothetical protein U27_04175 [Candidatus Vecturithrix granuli]
Length=328

 Score = 45.5 bits (104),  Expect = 0.032, Method: Composition-based stats.
 Identities = 15/70 (21%), Positives = 22/70 (31%), Gaps = 1/70 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M T+ CP C A+      K+  K S  RC +C           Q  + + +    P    
Sbjct  1   MATIICPACSAQYRISDEKISEK-SRLRCKKCGTVFRLHDTIKQADEFSPDHEAPPRQPS  59

Query  61  QRRIPSDRLE  70
                    E
Sbjct  60  LSEPVQPPSE  69


>MBF0283138.1 zinc-ribbon domain-containing protein [Magnetococcales bacterium]
Length=82

 Score = 42.1 bits (95),  Expect = 0.032, Method: Composition-based stats.
 Identities = 9/34 (26%), Positives = 15/34 (44%), Gaps = 1/34 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           M  ++C HC +      +K    K   +C +C Q
Sbjct  1   MV-IQCEHCASRFKVDETKYKPGKRRFKCAKCQQ  33


>QDS92289.1 hypothetical protein FF011L_10310 [Planctomycetes bacterium FF011L]
Length=367

 Score = 45.9 bits (105),  Expect = 0.032, Method: Composition-based stats.
 Identities = 38/344 (11%), Positives = 79/344 (23%), Gaps = 40/344 (12%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
                CP C A    P   +    S ARCP C    +   +E+          +       
Sbjct  4    IQQVCPRCSATLAVPEDSV---GSQARCPSCNHLFVITASETPAPAPLYAPESTAPAHST  60

Query  62   -----------RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADS  110
                             R     +     +   +   Q   +   + + +          
Sbjct  61   ADPVLSAADTVPPPNDRRTNPDLENPYAPKGTTTHSPQRASQPFIATTVVADQYISATIY  120

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
                  +    +G  LL I +    +   L +                I +      L  
Sbjct  121  LFQQSWQPLVGVGAILLVINILNQVLSQVLSMAFQESGQWVFMLVNIVIQVLLSLAYLYV  180

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLL-----------------------  207
            +  +    F       +   +  ++     G   L                         
Sbjct  181  MIGLKQMGFDLARGRPIRFSQGFEVPPILFGQTILGYLVLGIPFLLLAAPFALLVLGVDV  240

Query  208  --ILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
               L  +   GG  + +    L  ++ +  QY+L D  +   +A +K+  + S +    F
Sbjct  241  PEALAWVAFLGGGFIALAAMTLMYLFLWPWQYLLIDRRLSLREAFQKAYEIASINKVNAF  300

Query  266  GRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYY  309
               +L + +          I            +           
Sbjct  301  LLLLLGMGLGT-AGMCMCCIGQAATLPATVLLICTAVLQMSGQA  343


>MBF6559520.1 zinc-ribbon domain-containing protein [Candidatus Binataceae 
bacterium]
Length=445

 Score = 45.9 bits (105),  Expect = 0.033, Method: Composition-based stats.
 Identities = 8/64 (13%), Positives = 16/64 (25%), Gaps = 0/64 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
             ++C  C          LP    + +C  C      +P  ++  Q +            
Sbjct  3   IEIQCTSCHTRYRIDEQVLPDGTPTFKCSRCGHVFTLEPRAARDAQPSTPRGEAVRPTPA  62

Query  62  RRIP  65
               
Sbjct  63  PPAQ  66


>WP_131155613.1 hypothetical protein [Egibacter rhizosphaerae]QBI20618.1 hypothetical 
protein ER308_14315 [Egibacter rhizosphaerae]
Length=439

 Score = 45.9 bits (105),  Expect = 0.033, Method: Composition-based stats.
 Identities = 20/140 (14%), Positives = 49/140 (35%), Gaps = 8/140 (6%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
            + + +    +L  + +  +         V L R+++  +RH+G      ++   V     
Sbjct  240  LFVVSYLLAILAQAVLVVAFTDRFDGRPVDLSRAVRQVVRHLGPLARWALVDAAVGFVLD  299

Query  219  LLLIIPG--------LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
            ++             L +         ++A + I   +A+ +SR L+   W         
Sbjct  300  MVSERVNRTVGFAGKLAWHTLTVLVVPMIAVEGIPVREAIRRSRWLIGRVWSRSLVSGFG  359

Query  271  LLVISLTLSFLTARIPYVGE  290
            L ++   L+F+T     +  
Sbjct  360  LGLLVTVLAFVTVLTTVLIA  379


>OGW98200.1 hypothetical protein A2Z81_02485 [Omnitrophica WOR_2 bacterium 
GWA2_45_18]HBR14096.1 hypothetical protein [Candidatus Omnitrophica 
bacterium]
Length=252

 Score = 45.1 bits (103),  Expect = 0.033, Method: Composition-based stats.
 Identities = 28/175 (16%), Positives = 60/175 (34%), Gaps = 6/175 (3%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            + L+ +       + L+  V                       N++   L+ +   +   
Sbjct  16   FRLYKKHWGKWFRLALIYFVPYTLLELFFWENASLARKIVLKYNFKVYHLINSGTILSAV  75

Query  171  LSWMTGS------MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
            + ++         +       + G+  S +   R   S+  + I+ ++ VG G+LLLI+P
Sbjct  76   MFFLLFLSALFKSIQAADEGKNWGIRSSYREAYRVFKSYLWVKIMYVVKVGLGTLLLIVP  135

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            G +  + + F    L  D   G  A   S+ ++ G         +   +I   L 
Sbjct  136  GFMRLIQYSFSGVALLLDGKIGEDAFVWSKKIIQGQLNKYLDYVLFFFIIVFGLF  190


>NOR60995.1 cell envelope integrity protein TolA [Rhodobacteraceae bacterium]
Length=546

 Score = 45.9 bits (105),  Expect = 0.033, Method: Composition-based stats.
 Identities = 8/38 (21%), Positives = 13/38 (34%), Gaps = 0/38 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
            + CP C A      + +P +     C  C    +  P
Sbjct  2   RITCPKCKATYEVGEADIPEEGIEVECSACLNRWMQMP  39


>TMB13093.1 hypothetical protein E6J66_03980 [Deltaproteobacteria bacterium]
Length=286

 Score = 45.5 bits (104),  Expect = 0.033, Method: Composition-based stats.
 Identities = 9/33 (27%), Positives = 14/33 (42%), Gaps = 1/33 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECC  33
           M  V+CP C ++      K+  +    RC  C 
Sbjct  1   MV-VQCPTCQSKFRIADEKVTDRGVRVRCTSCK  32


>WP_144971118.1 hypothetical protein [Bremerella volcania]
Length=220

 Score = 44.8 bits (102),  Expect = 0.033, Method: Composition-based stats.
 Identities = 37/191 (19%), Positives = 64/191 (34%), Gaps = 2/191 (1%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            W+L  R  W  LG  L+ + +         +   A    P                    
Sbjct  8    WDLMARWFWPFLGFSLVCMFIRIPADHLENIASSAGDDVPAGLWMISQGYEI--FVATPI  65

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
               +T             L        R+        ++  ++VG G LLLI+PGL   V
Sbjct  66   SMGLTWVFLKAARSEHFELSDIFGAFSRNYLHAVGAGVVQTILVGLGLLLLIVPGLYLIV  125

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
             F F +Y++ D  +G ++A+++S  +  G    +F    + ++I L    L         
Sbjct  126  KFSFVEYLIVDRKMGMIEAMKESWHMTDGREGTLFALMAMSVLIGLGGLLLCGVGIIPAL  185

Query  291  AANLAFSLLLT  301
                A   +L 
Sbjct  186  IWLSATYAVLY  196


>WP_124330109.1 zinc-ribbon domain-containing protein [Desulfonema ishimotonii]GBC62987.1 
hypothetical protein DENIS_3976 [Desulfonema ishimotonii]
Length=1801

 Score = 46.3 bits (106),  Expect = 0.033, Method: Composition-based stats.
 Identities = 11/63 (17%), Positives = 17/63 (27%), Gaps = 1/63 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + C +C A  N     +    S  RC  C +  +  P +       D          
Sbjct  1   MI-ITCKNCNARFNLDEKVIKPAGSKVRCSNCKEVFVVHPPKPPEELPPDLHPPKTSAVP  59

Query  61  QRR  63
              
Sbjct  60  VPE  62


>HAG52565.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=222

 Score = 44.8 bits (102),  Expect = 0.033, Method: Composition-based stats.
 Identities = 13/63 (21%), Positives = 18/63 (29%), Gaps = 1/63 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + CP C       S KL       RC +C      +P    +    +     P   L
Sbjct  1   MI-ITCPECKTRYMIKSEKLGMDPKDVRCAKCKHQWTVNPNPLAQDLIKEKPPLPPIARL  59

Query  61  QRR  63
              
Sbjct  60  DSP  62


>PHS00622.1 hypothetical protein COA78_23775 [Blastopirellula sp.]
Length=335

 Score = 45.5 bits (104),  Expect = 0.035, Method: Composition-based stats.
 Identities = 36/300 (12%), Positives = 88/300 (29%), Gaps = 5/300 (2%)

Query  6    CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIP  65
            CP CG+ +  P     +        E    +     +S+    +    T       R  P
Sbjct  26   CPKCGSSQKVPEDLYASDSELNEEEEEELEVAISATDSENPYRSPQHYTSSKTQRTRIEP  85

Query  66   SDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIY  125
                     T +     ++  +            L  I+ +            +    + 
Sbjct  86   DKTGLDFVVTYSYECWMKNLPVLILIWAVVIIINLVVITLIRMTFTAAATELLYSNPPLA  145

Query  126  LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKT  185
             L  +       S  L           Q  +    + +  + +    +   ++++     
Sbjct  146  YLVEIAGSVVYLSVELQLGLGQTAILLQFARNENAVVSDLFQVRKAYFPLLAVYVMGYGL  205

Query  186  DVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIG  245
             +GL                LL    + +    L  +I      + ++   Y+LAD+ + 
Sbjct  206  FIGLMIVTISTALLYAEPPYLL----IGLAIAFLFPLIVAARVLITYWPVYYLLADEKVS  261

Query  246  GLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSF  305
              Q+ ++++ + S +   IF R  +  + +L L  +   +  V     +A   ++    +
Sbjct  262  FTQSFQEAKEIGSQNKMVIF-RLFVYSLGTLLLGAMFFLVGIVAALPLVALYWVVAYLYY  320


>RKY07547.1 hypothetical protein DRP56_05520 [Planctomycetes bacterium]
Length=114

 Score = 42.8 bits (97),  Expect = 0.035, Method: Composition-based stats.
 Identities = 9/34 (26%), Positives = 9/34 (26%), Gaps = 3/34 (9%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           M    C  CG     P           RC EC  
Sbjct  42  MIKFDCSKCGHSYRVPDR---YAGKRVRCKECST  72


>MBF0295426.1 zinc-ribbon domain-containing protein [Magnetococcales bacterium]
Length=187

 Score = 44.4 bits (101),  Expect = 0.035, Method: Composition-based stats.
 Identities = 10/74 (14%), Positives = 25/74 (34%), Gaps = 1/74 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V+C  C ++ +   + L       +C +C       P + + ++ T   A       
Sbjct  1   MI-VQCESCRSQFDVDDAILRPSGRKLKCSQCSAVFFQPPPKGEASRPTPVSAPASTPAQ  59

Query  61  QRRIPSDRLEIQSK  74
           +      ++  +  
Sbjct  60  ESPAVLKKIPDEEF  73


>WP_187139776.1 hypothetical protein [Sphingopyxis sp. OPL5]QNO26582.1 hypothetical 
protein EEB18_017815 [Sphingopyxis sp. OPL5]
Length=291

 Score = 45.5 bits (104),  Expect = 0.035, Method: Composition-based stats.
 Identities = 28/232 (12%), Positives = 64/232 (28%), Gaps = 6/232 (3%)

Query  71   IQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIV  130
               + +                   +      ++Q    +             I    + 
Sbjct  20   NWMQMLLWLGGAVVIACLLGWLLLRNAMMTMMMAQGDPSAAFGAFGSIILFGFIAGTIVY  79

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
             A   I+   L+      +        A L+     + + L  +   +   +     G+F
Sbjct  80   AASLLIWRTGLVGGEPASDIGWGLGAGAALMLANFVVQIALMIVFYIVLFIVGLLAFGIF  139

Query  191  RSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
             +  + L    +      L++  V     L++     F         +  + +     A 
Sbjct  140  GASGMSLESFATGGASAGLILFGVIFYVALIVFFLWFFGRLTAAGPVMAVNRSSNPFSAF  199

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
             +S  L S   W I G   L++++ L   F+ +             S ++TP
Sbjct  200  AESWRLTSASQWTIVGFNFLMILLFLVFFFIVSM------VFGGVASAMMTP  245


>TMB08409.1 hypothetical protein E6J64_01945 [Deltaproteobacteria bacterium]
Length=594

 Score = 45.9 bits (105),  Expect = 0.036, Method: Composition-based stats.
 Identities = 9/33 (27%), Positives = 15/33 (45%), Gaps = 0/33 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           VRC  C A      +++  +  + RC +C  T 
Sbjct  3   VRCDKCQARYRIDDARVGPQGLAMRCGKCGNTF  35


>WP_189575605.1 hypothetical protein [Parvularcula lutaonensis]
Length=445

 Score = 45.9 bits (105),  Expect = 0.036, Method: Composition-based stats.
 Identities = 32/301 (11%), Positives = 77/301 (26%), Gaps = 21/301 (7%)

Query  110  SWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
               +F     G++ ++L    L         +             +++            
Sbjct  125  QMMIFGGFSAGIILMFLAFFALTPVVTVIYEVSAGRRSWPSGFLYFRFGGRELRTIAGSY  184

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
             L  +  +  + I    +            +G  T+  ++ +  +  G L  I+  +   
Sbjct  185  ILGVLIQAFVLGIAGPII--------LFVLLGVSTMNEVVAVTTIVLGYLTFILATIWVS  236

Query  230  VWFFFCQYVLADDN-IGGLQALEKSRLLVSGHWWAIFGRFVLL----LVISLTLSFLTAR  284
            +        +A +N +  ++A +       G+ W IF   V+      ++          
Sbjct  237  LRTIAIVPAIAAENRLIFIEAFKA----TRGNAWRIFFSLVVFSVLLALLYGVFMIAFVA  292

Query  285  IPYVGEAA----NLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFG  340
            +  +G        + F +LL   + + Y +I S   A                       
Sbjct  293  VSALGAVLPEGGQVVFGILLAGLAAVAYVIILSMELAFGGLIWRAVRGNVMEDDDLPPAE  352

Query  341  WMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYK  400
              +   L           A +   A       +  +P           +  +R       
Sbjct  353  AAVADALDEFEEQEFGGEAVEEAPAEPAELDEVAEEPSAEAPDYDYAEQTARRDFVDRAP  412

Query  401  L  401
             
Sbjct  413  P  413


>PSQ08636.1 hypothetical protein BRC93_14835 [Halobacteriales archaeon QS_5_70_15]
Length=154

 Score = 43.6 bits (99),  Expect = 0.036, Method: Composition-based stats.
 Identities = 23/113 (20%), Positives = 45/113 (40%), Gaps = 5/113 (4%)

Query  202  SFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHW  261
            +  +  ++  + V  G +LL+IPGL   V   F +  +A +    ++    S  L  G  
Sbjct  23   NLLVGGVVFGIAVAIGFVLLVIPGLFLLVSLLFWEVFVAVEGDNFVEGFRHSWDLTGGRR  82

Query  262  WAIFGRFVLLLVISLTLSFLTARIPYVGEAANL-----AFSLLLTPFSFLYYY  309
              +F   V +++++L +S + A    V  +          S L+  F      
Sbjct  83   LRLFALGVAVVLLALVVSVVFAIPGVVLPSVLGFPIEQVGSALVGVFVLATIA  135


>MBD5405312.1 hypothetical protein [bacterium]
Length=55

 Score = 41.3 bits (93),  Expect = 0.036, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 15/36 (42%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  ++CP C  E     + +  +K   +C EC    
Sbjct  1   MI-IKCPKCNTEFEVEDTLINGRKMKFQCAECSFVW  35


>MBI5543635.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=448

 Score = 45.9 bits (105),  Expect = 0.036, Method: Composition-based stats.
 Identities = 16/36 (44%), Positives = 21/36 (58%), Gaps = 0/36 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           MPT+RCP C  ER+ PS +LP  +   RC +C    
Sbjct  1   MPTIRCPSCQVERDIPSERLPPGRLRVRCKQCGAAF  36


>MSQ40701.1 hypothetical protein [Dehalococcoidia bacterium]
Length=239

 Score = 44.8 bits (102),  Expect = 0.036, Method: Composition-based stats.
 Identities = 31/218 (14%), Positives = 70/218 (32%), Gaps = 23/218 (11%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             + +LG  +       +        L   +   Q  +++  +A   + ++ +        
Sbjct  12   LLGVLGWQVPTFLHPLSGQGFSQGDLFSVSSWGQGFLVVVALALASVLVACLYLGALAQG  71

Query  183  CKTDVGLFRSMKLGLRHVG-----------------------SFTLLLILLILVVGGGSL  219
             +      R +                                + LL  +  ++      
Sbjct  72   AQEQPWQPRLLLSQAPRYWGRLTAYLALLTGVVLVAVPVIGSVWILLQAVAPVLGSLFLA  131

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            L +   L+  +  +  +  +  + +GGL AL +S  L   H+W++ G F+LL +ISL + 
Sbjct  132  LALSGWLVVSISLYLTKPAIFLEGLGGLAALRRSIALARRHFWSVLGLFLLLNLISLGVG  191

Query  280  FLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKA  317
             L  R+  +     LA        + L   ++      
Sbjct  192  LLLVRVATLPWGLLLAIMGNAYLATGLSAAVMVYYRHR  229


>MBD3286106.1 hypothetical protein [candidate division WOR-3 bacterium]MBD3364810.1 
hypothetical protein [candidate division WOR-3 bacterium]
Length=247

 Score = 45.1 bits (103),  Expect = 0.037, Method: Composition-based stats.
 Identities = 16/110 (15%), Positives = 40/110 (36%), Gaps = 0/110 (0%)

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
                L   + L L  +    LL +     +    ++ I  G      F    + +  + +
Sbjct  138  GKALLAHIIWLLLLALVFLLLLGVRTWEGLIIVIIVGISLGFFISTRFQLALFAMIVEGL  197

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
               +A +++ LL+ G++W      + + + SL        +  +G + + 
Sbjct  198  SFNKAFKRNTLLLKGNFWRSAAVILRVCLPSLGFLAFLTILMVIGLSISA  247


>WP_007741265.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Halococcus thailandensis]EMA51762.1 hypothetical 
protein C451_13441 [Halococcus thailandensis JCM 13552]
Length=215

 Score = 44.8 bits (102),  Expect = 0.037, Method: Composition-based stats.
 Identities = 27/166 (16%), Positives = 46/166 (28%), Gaps = 14/166 (8%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV----  214
            +         L L  +   +                L L      T+   L + V     
Sbjct  44   LPPFIGVAARLLLILVGVVVAYRALGGRTRTDSPFLLRLFMAFLATVASYLSVFVGGMAL  103

Query  215  -------GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI-FG  266
                     G  +  +PGL      F     +  D  G  +AL  S  L++G   A  F 
Sbjct  104  AVPDTLGWVGFFVFALPGLYLYFRLFLSTPAVMIDGYGPAEALTTSWRLMNGSVLATAFA  163

Query  267  RFVLLLVISLTLSFLTARI--PYVGEAANLAFSLLLTPFSFLYYYL  310
              ++ +   + L  L   +   +V E   +            + YL
Sbjct  164  VTLVFVCGLVVLVSLFGIVQSSFVAEIGGVLIMDSFLAGMQAFLYL  209


>MBF0150403.1 zinc-ribbon domain-containing protein [Magnetococcales bacterium]
Length=193

 Score = 44.4 bits (101),  Expect = 0.037, Method: Composition-based stats.
 Identities = 12/55 (22%), Positives = 18/55 (33%), Gaps = 1/55 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATC  55
           M  V+C HC    +   S L       +C  C +     P E    +  +   T 
Sbjct  1   MI-VQCTHCTTRFDVDESMLLPTGRKLKCSNCKEVFFQKPPERDPDEQPEEDNTP  54


>NIA19101.1 hypothetical protein [Xanthomonadaceae bacterium]
Length=235

 Score = 44.8 bits (102),  Expect = 0.037, Method: Composition-based stats.
 Identities = 12/35 (34%), Positives = 15/35 (43%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            V C +C A      SK+PA     RCP C +   
Sbjct  2   KVYCEYCQARCRIDESKIPAAGGFVRCPACQKIFF  36


>SDB61763.1 MJ0042 family finger-like domain-containing protein [Belnapia 
rosea]
Length=166

 Score = 44.0 bits (100),  Expect = 0.037, Method: Composition-based stats.
 Identities = 9/34 (26%), Positives = 14/34 (41%), Gaps = 1/34 (3%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            ++CP C A    P + + A     RC +C    
Sbjct  37  RIQCPQCQAAYEVPDTLIGA-GRLLRCVKCRHEW  69


>WP_157758574.1 zinc-ribbon domain-containing protein [Cystobacter fuscus]
Length=244

 Score = 44.8 bits (102),  Expect = 0.038, Method: Composition-based stats.
 Identities = 7/34 (21%), Positives = 10/34 (29%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP C  +       LP      +C  C    
Sbjct  2   EIACPQCSMKYALDPRLLPPGGVPVQCTRCSHVF  35


>TET10455.1 hypothetical protein E3J86_05635, partial [Candidatus Thorarchaeota 
archaeon]
Length=287

 Score = 45.1 bits (103),  Expect = 0.038, Method: Composition-based stats.
 Identities = 29/227 (13%), Positives = 59/227 (26%), Gaps = 10/227 (4%)

Query  95   ASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQN  154
             +     +  +    +   + +   G +G   L  +     I    L+  A         
Sbjct  43   MAWLANPTDPEAFGLAIAAYFQGLSGFMGSNPLAGLFGGGAIGLLFLIPFAAIATWMFGA  102

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV  214
                     +       S         +     G+  S+ L +           L     
Sbjct  103  IFGMSNEIILTGGTHAESAFGYLRKNLVSYLGAGVLWSLVLFVPLWIIGLGATALTGFTS  162

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLAD-----DNIGGLQALEKSRLLVSGHWWAIFGRFV  269
                    I  + F   +    +++       D +G L +L+ S  L   +   +FG + 
Sbjct  163  LPAGWGWPIGIITFLYVYIVGGFLMLHLPASTDGLGALDSLKTSFRLTKENLVRVFGAWT  222

Query  270  LLLVISLTLSFLTAR----IPYVGEA-ANLAFSLLLTPFSFLYYYLI  311
            + +V+ +      A     I   G   A +      T F      L 
Sbjct  223  IFIVLIVVFFAPIALYSIYIGTPGLMDAGMIIVTAWTVFGVFALILF  269


>HBT83852.1 hypothetical protein [Desulfuromonas sp.]
Length=153

 Score = 43.6 bits (99),  Expect = 0.038, Method: Composition-based stats.
 Identities = 9/58 (16%), Positives = 15/58 (26%), Gaps = 0/58 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCG  59
             + C HC         KL  + +  RC +C +     P  +                
Sbjct  10  VVIECAHCQTRFKLADDKLRPEGTKVRCSKCKEIFTVFPPSTDDQPPAPAPPAXGTSC  67


>RMG19694.1 tetratricopeptide repeat protein, partial [Deltaproteobacteria 
bacterium]
Length=590

 Score = 45.9 bits (105),  Expect = 0.038, Method: Composition-based stats.
 Identities = 10/34 (29%), Positives = 13/34 (38%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + C  CGA    P + + A    A CP C    
Sbjct  2   RIDCERCGAAYRIPDAAVGAAGIRAECPRCGHQQ  35


>RKU36333.1 hypothetical protein C6496_13835 [Candidatus Poribacteria bacterium]
Length=310

 Score = 45.5 bits (104),  Expect = 0.038, Method: Composition-based stats.
 Identities = 26/242 (11%), Positives = 74/242 (31%), Gaps = 17/242 (7%)

Query  86   CLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPA  145
                      +   + +  +++ + + L+ +     +GI ++  ++         LL   
Sbjct  1    MSIENSTDTHTNLRIMNWMEIIDNVFHLYRKHFLLFIGISIIYFIVDSVEDKLFKLLWKN  60

Query  146  TWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT-  204
                   +   + +    +   ++  S +     I I  T           L +V  +  
Sbjct  61   NPYGLIEELISYLLSELVIVVFVVAASEIYFQRHITIRDTFQRFVNISPRYLVNVFIYLI  120

Query  205  --------------LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
                           +    ++++    LL  I    F + +     V+  +     + +
Sbjct  121  PLSILLLIGEIVSAGIRYFSMVILISLLLLTQIVRTYFLIIWQLYAPVIVVEGSMKPKPM  180

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYL  310
             +SR L+   WW +FG    L ++   +  +      +     +   +   P   +  Y+
Sbjct  181  RRSRALIRKSWWRVFGTIFTLKILLRAIRIIFVVSFVL--LLGMFGLMGDAPVWDIVEYV  238

Query  311  IY  312
            + 
Sbjct  239  LR  240


>RMG61088.1 DUF3426 domain-containing protein [Deltaproteobacteria bacterium]
Length=489

 Score = 45.9 bits (105),  Expect = 0.038, Method: Composition-based stats.
 Identities = 10/45 (22%), Positives = 18/45 (40%), Gaps = 1/45 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           M  ++C  C A+      K+  K +  RC +C   +I      + 
Sbjct  1   MI-IKCDSCNAKFKLNEDKIKGKGARVRCRKCGDYIIVMKPGYEH  44


>MBI2376544.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=739

 Score = 45.9 bits (105),  Expect = 0.038, Method: Composition-based stats.
 Identities = 8/33 (24%), Positives = 12/33 (36%), Gaps = 0/33 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           VRC  CG E      ++     + +C  C    
Sbjct  3   VRCEKCGTEYEFDEDRIGPNGVTVKCTACGHVF  35


>MBE7200905.1 zinc-ribbon domain-containing protein [Parafilimonas terrae]
Length=126

 Score = 43.2 bits (98),  Expect = 0.038, Method: Composition-based stats.
 Identities = 10/42 (24%), Positives = 14/42 (33%), Gaps = 0/42 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
            + CPHC +    P+          RC EC  T   +     
Sbjct  2   QISCPHCSSRYEVPAELFAEDSRMVRCAECRDTWEVESPRRH  43


>MBA3247973.1 zinc-ribbon domain-containing protein [Pyrinomonadaceae bacterium]
Length=40

 Score = 40.5 bits (91),  Expect = 0.038, Method: Composition-based stats.
 Identities = 14/32 (44%), Positives = 19/32 (59%), Gaps = 0/32 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           T+ CP C A      SKLPA++ + RCP+C  
Sbjct  2   TINCPQCSARLVMEDSKLPARQFTVRCPKCQH  33


>MBE74794.1 hypothetical protein [Rhodopirellula sp.]
Length=301

 Score = 45.1 bits (103),  Expect = 0.038, Method: Composition-based stats.
 Identities = 32/294 (11%), Positives = 70/294 (24%), Gaps = 15/294 (5%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
             TV CP C  +   P + L  K   ++C                +      A       Q
Sbjct  4    ITVVCPSCSHQMQVPENLLGKKGKCSKC---------------ASMILITNAAQQPFQQQ  48

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                  + +   +    +   +SF  QP ++        +   Q          +     
Sbjct  49   PPQQPFQQQPPQQPFQQQPPQQSFQQQPPQQSFQQQPPQQPFQQQPPQQGVRRNQLTGHF  108

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
                         P +S  LL   +       +     +  T   +L+  + +   +F +
Sbjct  109  QQQPPQQPYQKVYPKYSESLLVGNSETFNMLWHVYAVSICLTXVGVLIIAAGVIPDVFXW  168

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            +      L            S  +L+ +                     + F   +    
Sbjct  169  LSMILTTLGGLGLTISGIWMSINMLMAIYRYWTTLQGGNARTTPGKAIGFLFIPLFNFYW  228

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
                 +         ++ +   I        ++      L   +  +G    LA
Sbjct  229  GYQCFIGLRNDMNNYIANYRTDILPIRDGNYLVITLTLSLIPYVNLIGSILFLA  282


>RLC30587.1 hypothetical protein DRH32_05550, partial [Deltaproteobacteria 
bacterium]
Length=166

 Score = 44.0 bits (100),  Expect = 0.039, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 12/36 (33%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  + C  CG         L  + S  RC +C    
Sbjct  1   MI-ITCEKCGVSFKLNEKLLKPQGSKVRCSKCKHIF  35


>RKX61650.1 hypothetical protein DRP41_08160 [Thermodesulfobacteria bacterium]
Length=275

 Score = 45.1 bits (103),  Expect = 0.039, Method: Composition-based stats.
 Identities = 28/184 (15%), Positives = 66/184 (36%), Gaps = 10/184 (5%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
              +        +      K +   +      ++        +     + ++  +F+++  
Sbjct  97   VFVSAWTESMLLDVEKTGKTSIEGSFLRVRSKFWKEFFACIFTGFVSALLSIFIFLFVLT  156

Query  185  TDVGLFRSMKLGLR--HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
              V     +   +     G +      LI +      +LIIP  +  + F+F    +   
Sbjct  157  IAVAFGMGIFQEISTGRPGIYGFHAPALIGLGLLIIFILIIPFTILSISFWFSGTAIMKH  216

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
            ++G   AL  S      ++W I   F+L++ +        + +PYVGE   L  + L  P
Sbjct  217  DVGLFSALRLSWRFTMRNFWRIVFLFLLIIGV--------SLVPYVGELLGLFLTPLWMP  268

Query  303  FSFL  306
            ++++
Sbjct  269  YAYM  272


>OPX25013.1 hypothetical protein B1H04_00955 [Planctomycetales bacterium 
4484_123]
Length=639

 Score = 45.9 bits (105),  Expect = 0.039, Method: Composition-based stats.
 Identities = 8/31 (26%), Positives = 10/31 (32%), Gaps = 0/31 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPE  31
           M    CPHCG       +    K     C +
Sbjct  2   MIRFTCPHCGRYVRVSDAHGGKKGQCPYCHQ  32


>MBC7288743.1 zinc ribbon domain-containing protein [Armatimonadetes bacterium]
Length=364

 Score = 45.5 bits (104),  Expect = 0.039, Method: Composition-based stats.
 Identities = 26/284 (9%), Positives = 67/284 (24%), Gaps = 19/284 (7%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             + CP CGA      ++     +                +    +   ++          
Sbjct  5    ELACPRCGAPIVETDTQCVNCGADLY-------RGRLVDQPSPQRPGLSVHHGIAEVQVG  57

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
             +    +   +           F                 +   L              L
Sbjct  58   DVEYRTVPDAAIDWGSLSGGGFFDALSRSWRFFVACLRMVVDYPLMLLPSFLTLLVAAAL  117

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
               +  I+                            + L   A +L  +      +  Y+
Sbjct  118  LGAVWAILHFAGLDDQVFASDDKNSHGLWFWVVSVPLTLLVYAVVLSFMGMTVHMVDAYL  177

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC------------V  230
                  L +++   + +  +   L ++ I++    S+        +             V
Sbjct  178  RGRRATLGQALADVMNNFSALFYLALVNIVISILLSMARGRGERSWRNRAVDAADRVRDV  237

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
                   V+  ++    +AL+++  L S     I    + LL++
Sbjct  238  ANHLVVPVIMIEDKRIGEALKRAYQLFSRSILDIVIAELGLLML  281


>NOZ83095.1 hypothetical protein [Euryarchaeota archaeon]
Length=269

 Score = 45.1 bits (103),  Expect = 0.039, Method: Composition-based stats.
 Identities = 22/207 (11%), Positives = 55/207 (27%), Gaps = 31/207 (15%)

Query  127  LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD  186
            L   L ++ +    LL+  +                 +  I    +     + +   + +
Sbjct  20   LVPALTYSILSLPTLLQSFSAQASSGSLALEVAYAVFLWLIAPFFTAGIMGVALEARRGN  79

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL--------------------------  220
              +    + G     S  +  +++I+ V G  ++                          
Sbjct  80   ATISTFFQFGRSRYLSLLIATVVVIIAVLGVVMIALLGSILAYLAGRMVSEQVAVVAVVL  139

Query  221  ----LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
                  +  ++  V F F    +  ++   L A   S   V  +   +   FVL   + L
Sbjct  140  FLLLFGMLVIVAAVLFSFYDACVVVEDSDPLSAFTCSMSFVRSNLVHVIAFFVLAFAVFL  199

Query  277  TLSFLTARIPYVGEAANLAFSLLLTPF  303
              S +   +  +           +   
Sbjct  200  LFS-VPNAVAMLYYVLTSLPEFSMGML  225


>HBJ38725.1 hypothetical protein [Planctomycetaceae bacterium]
Length=543

 Score = 45.9 bits (105),  Expect = 0.039, Method: Composition-based stats.
 Identities = 15/40 (38%), Positives = 18/40 (45%), Gaps = 1/40 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
           M  VRCP CG     P   +PA  + A+CP C  T     
Sbjct  1   MSDVRCPSCGDSFRVPDFAVPA-GAMAKCPWCGDTFSMSM  39


>MBC7708400.1 hypothetical protein [Polaromonas sp.]
Length=366

 Score = 45.5 bits (104),  Expect = 0.039, Method: Composition-based stats.
 Identities = 30/246 (12%), Positives = 66/246 (27%), Gaps = 11/246 (4%)

Query  110  SWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
               +      G+  ++     LAF    S      A  ++P      +A  + T   I  
Sbjct  98   YLGITYAWIIGINLVFFGIYFLAFKTGNSLESSTDANGMSPIAYVLAFAGYIITYFVINF  157

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV-----------GGGS  218
              + +  +M              + +     G   +  I+   V              G 
Sbjct  158  YATALAANMLDIFKGQKRSYTEYIGVARGKAGKILIFSIISATVGMFLQYVVERIRFIGW  217

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            +L  I G  + +   F   ++AD++     ++++S  L    W       + +      +
Sbjct  218  ILAYILGTAWNLGTMFVLPIIADEDASAPASIKRSIALFKQTWGESITAKITVNAPLFLI  277

Query  279  SFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAI  338
                    ++     +    L      +  Y+I     A      +  I        A+ 
Sbjct  278  QLALIVPFFLLLFGAIVVHSLPLVIFVIVLYVIAVLSLAVLGSFANSLINTALYYYAASG  337

Query  339  FGWMLI  344
                  
Sbjct  338  KIPAAF  343


>NCB49684.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=235

 Score = 44.8 bits (102),  Expect = 0.039, Method: Composition-based stats.
 Identities = 20/144 (14%), Positives = 43/144 (30%), Gaps = 1/144 (1%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  + CPHC A+ + P++ L  K  + RC  C  T      ++ + +   +         
Sbjct  1    MI-INCPHCQAKYDVPTASLSKKIKAFRCSNCGYTWTVLIQDTHKKEKETSFMPSFVPKE  59

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
                     ++++K +  +          +++ +   S  R    L       F    + 
Sbjct  60   VGDFFKKVSKLETKKIKKQENLEEPSKIEKKKSKKRFSFSRVKEPLKRVLRFCFVFGSFL  119

Query  121  LLGIYLLGIVLAFAPIFSALLLKP  144
                Y                L+ 
Sbjct  120  FFMTYFYPDFERNVLRPITFPLEF  143


>PKI55621.1 hypothetical protein CRG98_024014 [Punica granatum]
Length=213

 Score = 44.4 bits (101),  Expect = 0.039, Method: Composition-based stats.
 Identities = 24/171 (14%), Positives = 59/171 (35%), Gaps = 6/171 (4%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            F          + +            L L     +    +     ++          +  
Sbjct  10   FLGVCGMFGKSFRVISTRKKLLAQITLALIFPVGIFFLAEPKFKKVMRVVPKVWTRLIIT  69

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
                  IY       ++ S+ +GL  +    +  ++ I++     +  I+  +   V   
Sbjct  70   FLLIFVIYF------VYTSIFVGLLLLCLVFVAHLVGIILAVIPIVASIVGIVYLTVVSA  123

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
                V   +++ G +A+ KS+ L+ G  W++ G FVLL++  + + ++   
Sbjct  124  LACVVSVLEDVYGRKAMSKSKALLKGKLWSVVGCFVLLILSYVLVEYVIYF  174


>NCO33277.1 hypothetical protein [Armatimonadetes bacterium]NCO90329.1 hypothetical 
protein [Armatimonadetes bacterium]OIP05463.1 hypothetical 
protein AUJ96_10820 [Armatimonadetes bacterium CG2_30_66_41]PIU92977.1 
hypothetical protein COS65_15190 [Armatimonadetes 
bacterium CG06_land_8_20_14_3_00_66_21]
Length=255

 Score = 44.8 bits (102),  Expect = 0.040, Method: Composition-based stats.
 Identities = 13/92 (14%), Positives = 23/92 (25%), Gaps = 3/92 (3%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
             V CP CG E   P S++      A+C      ++  P  +       +     +    
Sbjct  7   IAVTCPKCGKEYQIPPSRI---GQKAKCSCGNVMVVQKPPTAAEMTACASCGAQVYATEA  63

Query  62  RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREF  93
                 R   + K       +           
Sbjct  64  TCPACGRSTREGKPEKGPPASHPQDDSQHPPP  95


>NIS75466.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=165

 Score = 44.0 bits (100),  Expect = 0.040, Method: Composition-based stats.
 Identities = 10/45 (22%), Positives = 19/45 (42%), Gaps = 1/45 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           M  + C +C A+      K+  K +  +C +C   +I    E + 
Sbjct  1   MI-ILCDNCKAKYRLSEDKIAGKGARVKCRKCSDYIIVMKPEYEH  44


>NCD24537.1 DUF3426 domain-containing protein [Deltaproteobacteria bacterium]
Length=503

 Score = 45.9 bits (105),  Expect = 0.040, Method: Composition-based stats.
 Identities = 9/34 (26%), Positives = 14/34 (41%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            V+CP C  + N    ++  + S  RC  C    
Sbjct  2   LVQCPECSTKYNLNEKQIAPEGSKVRCSRCKNVF  35


>OLB24717.1 hypothetical protein AUH95_02275 [Nitrospirae bacterium 13_2_20CM_2_63_8]
Length=142

 Score = 43.2 bits (98),  Expect = 0.040, Method: Composition-based stats.
 Identities = 22/76 (29%), Positives = 35/76 (46%), Gaps = 1/76 (1%)

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA  283
            PG+   V + F   ++ D  +G  QALE SR +V+ HWW  FG  +++L++      + A
Sbjct  4    PGIYLLVAYTFSYLLIVDRRMGVWQALEGSRRVVNKHWWGGFGLTLVMLLLVGIGGMIGA  63

Query  284  RIPYVGEAANLAFSLL  299
             I   G          
Sbjct  64   VI-LGGPIGYGLSGWF  78


>GDX64267.1 hypothetical protein LBMAG35_11050 [Chlorobi bacterium]
Length=243

 Score = 44.8 bits (102),  Expect = 0.041, Method: Composition-based stats.
 Identities = 19/175 (11%), Positives = 50/175 (29%), Gaps = 0/175 (0%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
             +    G  +     L    A      +++      ++              +  IL   
Sbjct  14   HILKYAGIPICIASFLATTPASPLKEPSIITLAQFPIDDHIWLVSGIAFFTFLGLILAIS  73

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
            +    ++   +     G     +       +   +  LL L    G ++ + PG+     
Sbjct  74   TASAFTLHCLLHGERPGDGMLRETIQSSFVNVCTVGFLLALTSIIGIIVCVAPGIYIMSA  133

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
                  +L  ++   LQA++    L+  H    F   + + +++         + 
Sbjct  134  GTVMIPLLLHEHRSPLQAMQSGIQLIRNHALKPFAFSLFVFIVASICITAIEYVM  188


>XP_001024639.2 Ibr domain protein [Tetrahymena thermophila SB210]EAS04394.2 
Ibr domain protein [Tetrahymena thermophila SB210]
Length=369

 Score = 45.5 bits (104),  Expect = 0.041, Method: Composition-based stats.
 Identities = 19/237 (8%), Positives = 49/237 (21%), Gaps = 13/237 (5%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            TV CP C        +    +     C    Q    D ++ + +  T          + +
Sbjct  121  TVSCPKCNTYFIGTDAFCKGEYKCLEC----QFQWIDKSKIEVSLFTQRFYQEVFTYIFQ  176

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
             + +        ++           +               +                  
Sbjct  177  IMFTQYCPKCGISIQKNGGCLHMTCKKCDFEFCWLCKQNYNTH---------EDLRCVAY  227

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
               +  + +        +      + +                  +L   ++  + + YI
Sbjct  228  IFTMKSLTIYVFFNILVIFNSEMIFYSSIFWIISAFFKFIYYNLFILIAWFILSTFYSYI  287

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
                  L   +            L  ++I+ +     L             FC Y  
Sbjct  288  KMLQPSLKIGIYQKNEKKRKCLALCFIMIISMVVFLGLTYFLEHSLTNVLTFCFYEC  344


>PID97948.1 hypothetical protein CSA83_02005 [Actinomycetales bacterium]
Length=377

 Score = 45.5 bits (104),  Expect = 0.042, Method: Composition-based stats.
 Identities = 22/189 (12%), Positives = 58/189 (31%), Gaps = 2/189 (1%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            + V     ++    + F V + +   +LA +    +++L +S  + S  +W   G   + 
Sbjct  188  IRVYSFIFVVFFILMYFAVRWVYVGQLLALEKASAVESLRRSWGMTSRDFWRTLGYLFVG  247

Query  272  LVISLTLSFLTARIPYVGEAANL--AFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
             +I+L +      +      A+     + +    S     L+ + L + +       I  
Sbjct  248  YLIALMVYTPATMLGAGTMLASSDQLIAAIEPFRSDEVMGLLIAILVSVFTVIAPQAILT  307

Query  330  QWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPE  389
              +    +    + +  +      R  LS   +         +  +Q     +     P 
Sbjct  308  VLVSACISTCFLVYVAAMYRDQRYRDFLSERGIDPKNVTPVWQGPSQHHYPAEQPSLPPH  367

Query  390  EPQRLSSAD  398
                 +   
Sbjct  368  PSSGHAPGA  376


>PJA23892.1 hypothetical protein COX57_11340 [Alphaproteobacteria bacterium 
CG_4_10_14_0_2_um_filter_63_37]
Length=441

 Score = 45.5 bits (104),  Expect = 0.042, Method: Composition-based stats.
 Identities = 11/62 (18%), Positives = 16/62 (26%), Gaps = 0/62 (0%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            + C  C    + P   +P +    RC       IF       +            G Q  
Sbjct  46   ITCTQCQTTFSVPDGLIPPEGRRVRCANPECRTIFQAYPVAVSSPKPLEPEPLFAGPQEP  105

Query  64   IP  65
             P
Sbjct  106  EP  107


>WP_092870459.1 DUF975 family protein [Acetitomaculum ruminis]SFA83043.1 Uncharacterized 
membrane protein [Acetitomaculum ruminis DSM 5522]
Length=247

 Score = 44.8 bits (102),  Expect = 0.042, Method: Composition-based stats.
 Identities = 24/198 (12%), Positives = 56/198 (28%), Gaps = 3/198 (2%)

Query  108  ADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI  167
                  F    + +  +  + I      + + +       +             +    +
Sbjct  35   CCLSSPFYFTVYNVNNVASIVIFYVATFMINIISRLFFIGILYILMQITRHEKTSFANLL  94

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
                      +   I    +G+   + L +    S  +  +           L +I  L+
Sbjct  95   YAFTHKPDRFIKAIIFLLLIGIVVGLPLDILDYFSNKISFMNNDFSHIIILFLHMIIRLI  154

Query  228  FCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL--TLSFLTAR  284
              + F     ++ D+  I G++A++KS  L+ GH   +F   +      L   +SF  A 
Sbjct  155  IFLLFMPVYLLMIDNQEIKGMEAVKKSFFLMKGHRKNLFLLVMSFFGWLLLGIISFGFAF  214

Query  285  IPYVGEAANLAFSLLLTP  302
            +                 
Sbjct  215  LWVTPYFFTSVIYYYYYI  232


>MBI5870759.1 hypothetical protein [Actinobacteria bacterium]
Length=341

 Score = 45.5 bits (104),  Expect = 0.042, Method: Composition-based stats.
 Identities = 19/271 (7%), Positives = 63/271 (23%), Gaps = 6/271 (2%)

Query  28   RCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCL  87
            R       LI+              ++           +   +  S             L
Sbjct  16   RLMGDTWRLIWRHKFLWFFGLFAGGSSSFGGSGNFNFDAGTGDGPSNGRGRADDTGQEFL  75

Query  88   QPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATW  147
                        L +          L+     G +   +  +       F +   +    
Sbjct  76   DWINAHLTLVLVLAAAVVTFFILIWLWSVVCRGAVIGSVRDVRQERNISFRSAFARGRES  135

Query  148  LNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLL  207
                     + +LL    ++++  + +     + +  T      S+        +   L 
Sbjct  136  FGRLLLFDLFLLLLGICLFVIVAATVLFILFLVMVAGTAGTAILSIIGLWLLALAVFGLG  195

Query  208  ILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGR  267
             L    +     +     + +          +  + +  + A  +   ++  +       
Sbjct  196  YLACCTIWFVPWVFFGIIVNYATR------AVVLEQMRPMAAFRRGWRVMMDNLGQTMLL  249

Query  268  FVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
            F++ L +S+           +    ++   +
Sbjct  250  FLINLGLSIGAGIAIVLSVGLSSVPSIIAWI  280


>RLC05449.1 hypothetical protein DRI57_27550, partial [Deltaproteobacteria 
bacterium]
Length=500

 Score = 45.5 bits (104),  Expect = 0.042, Method: Composition-based stats.
 Identities = 12/43 (28%), Positives = 15/43 (35%), Gaps = 1/43 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAES  43
           M  V C  C    N   S L  K S  +C +C    +  P   
Sbjct  1   MI-VTCEKCQTSFNLDESLLKPKGSKVKCSQCKNVFLVHPPAP  42


>XP_005789164.1 hypothetical protein EMIHUDRAFT_420859 [Emiliania huxleyi CCMP1516]EOD36735.1 
hypothetical protein EMIHUDRAFT_420859 [Emiliania 
huxleyi CCMP1516]
Length=228

 Score = 44.8 bits (102),  Expect = 0.042, Method: Composition-based stats.
 Identities = 11/62 (18%), Positives = 15/62 (24%), Gaps = 1/62 (2%)

Query  4    VRCPHCGAERNTPSSKLPAK-KSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + C +C A  +     L        RC  C        A   R      +   P     R
Sbjct  49   ITCSNCKASYSVEEDALGPGAGKRVRCSNCDHEWFQSVARLARLPEDMELVEYPQEMKDR  108

Query  63   RI  64
              
Sbjct  109  MA  110


>HHS83215.1 hypothetical protein [Devosia sp.]
Length=216

 Score = 44.4 bits (101),  Expect = 0.042, Method: Composition-based stats.
 Identities = 10/33 (30%), Positives = 14/33 (42%), Gaps = 0/33 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           + CP C A+    +  L AK  + RC  C    
Sbjct  12  IVCPACKAQYKVAADTLGAKGRTVRCARCHTDW  44


>OQB43596.1 tRNA_anti-like protein [Candidatus Hydrogenedentes bacterium 
ADurb.Bin179]
Length=214

 Score = 44.4 bits (101),  Expect = 0.042, Method: Composition-based stats.
 Identities = 9/39 (23%), Positives = 12/39 (31%), Gaps = 0/39 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  V CPHCG E       +  +     C +        
Sbjct  1   MVKVACPHCGMELEIQEKYIGQQGRCKCCNQVFTAQPQP  39


>WP_171420597.1 zinc-ribbon domain-containing protein, partial [Corallococcus 
exercitus]NOK13426.1 hypothetical protein [Corallococcus exercitus]
Length=59

 Score = 40.9 bits (92),  Expect = 0.042, Method: Composition-based stats.
 Identities = 8/32 (25%), Positives = 11/32 (34%), Gaps = 0/32 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
            + C  C A        +  K   A+CP C  
Sbjct  2   RIVCQKCAAAYAIDDRLITTKGVRAQCPRCRH  33


>PMP79746.1 hypothetical protein C0184_09735 [Chloroflexus aggregans]
Length=183

 Score = 44.0 bits (100),  Expect = 0.043, Method: Composition-based stats.
 Identities = 14/84 (17%), Positives = 29/84 (35%), Gaps = 0/84 (0%)

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
            L   +L L V     L+I   +   +W  F       +  G +  +++   ++  +    
Sbjct  78   LSASMLGLGVLTILGLIICLAIPILLWSLFFAVFRPLEGQGPVAGIKRGWQVLRRNLGQA  137

Query  265  FGRFVLLLVISLTLSFLTARIPYV  288
               +++   I L  S     I  V
Sbjct  138  VVVWLVTFGIGLVYSLPVGAIGGV  161


>TKJ40567.1 hypothetical protein CEE36_09265 [candidate division TA06 bacterium 
B3_TA06]
Length=290

 Score = 45.1 bits (103),  Expect = 0.043, Method: Composition-based stats.
 Identities = 29/213 (14%), Positives = 52/213 (24%), Gaps = 8/213 (4%)

Query  80   RCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSA  139
                   L   R              +L+    +       LL +      L F      
Sbjct  22   FPAHIHLLDTTRRAWQILREEWWALIVLSSVPVIVKWTLRQLLDVARFPGSLFFYGYVDK  81

Query  140  LLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRH  199
              +  A         W         A  LL    +TG     I   +      M L L  
Sbjct  82   QGILLAVRDLALTGAWIVVASWFAAALTLLVHRSLTGRDHRLITVLEPSWKPLMLLSLCG  141

Query  200  VGSFTLLLILLILVVGGG--------SLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALE  251
            +         L L    G          +      +  +      + +  +  G  +AL 
Sbjct  142  LIIGLAAWSALALTTITGSLVVYLSLDWIFQAVAFVVGIRLVLVPFAIVVEKKGLFEALR  201

Query  252  KSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
             S  L   +++ I    V +++  L    + + 
Sbjct  202  YSFGLTRYNFFRILMVEVAIVLPVLGFWSILSL  234


>WP_167050864.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Salinibacterium sp. ZJ77]
Length=516

 Score = 45.5 bits (104),  Expect = 0.043, Method: Composition-based stats.
 Identities = 27/252 (11%), Positives = 55/252 (22%), Gaps = 1/252 (0%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            +    G    +I   +          +   + +   +A+  S  L  G   A+     L 
Sbjct  159  IPGVIGYGAALIVLGVVSTRLALTLPLFVVEEVSAARAMALSWRLTRGRPDALLTVACLS  218

Query  272  LVIS-LTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
             ++    ++ L A +  V  A     +   + +       +         G     +   
Sbjct  219  GLLLGGVVAILLAIVAVVPTAVADLIAPEASIWVAAAGLGLAEVAAVAVGGFAVLLLASG  278

Query  331  WLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEE  390
             L L A       +      +  R         +          T          +   +
Sbjct  279  LLELLARGRPHPRVTPTAPATPPRAVWRTLLTPAIIIAAMIGGATAVNVPAMRVLASQPD  338

Query  391  PQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSL  450
                  AD            +  G  +      AD+                 D   L +
Sbjct  339  TLIGVEADASATAEDSIAAAAPEGADVTVRIASADQPSGHAVALLREAIESGDDPRRLIV  398

Query  451  AQKGSARIEIDK  462
            A      I    
Sbjct  399  ASPDERVIRDLS  410


>OUT67844.1 hypothetical protein CBB70_08095 [Planctomycetaceae bacterium 
TMED10]
Length=199

 Score = 44.4 bits (101),  Expect = 0.043, Method: Composition-based stats.
 Identities = 10/29 (34%), Positives = 11/29 (38%), Gaps = 0/29 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARC  29
           M  +RCPHC      P S    K     C
Sbjct  7   MIRIRCPHCREPVKAPDSLAGKKGRCPEC  35


>RLB96685.1 hypothetical protein DRH90_24405, partial [Deltaproteobacteria 
bacterium]
Length=520

 Score = 45.5 bits (104),  Expect = 0.043, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 12/36 (33%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  + C +C    N     L    S  RC +C    
Sbjct  1   MI-ITCENCSTSFNLDEKFLKPSGSKVRCSKCKHVF  35


>HBI52056.1 hypothetical protein [Ruminococcaceae bacterium]
Length=288

 Score = 45.1 bits (103),  Expect = 0.043, Method: Composition-based stats.
 Identities = 25/289 (9%), Positives = 71/289 (25%), Gaps = 13/289 (4%)

Query  5    RCPHCGAERNTPSSKL--PAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             C  CG + +    K+  P   +        +  +      +   T        +   + 
Sbjct  11   VCDKCGKDLDNGEEKVYCPVCGTPVHKSCWEEDPVCPN---ESRHTEGFDWEKENKRTEI  67

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                +  +                         SG  L       ++         +   
Sbjct  68   PRDPEPYKNGYVNPAAFENFTEMIENHPIVSPESGEELTCRGVKQSELVHFLGMNNFSTP  127

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
              + + + +A      +L L    ++   +   +       +      L+  +    +  
Sbjct  128  RFFTIFMNMANGGKVLSLNLSAWFFMPLYHYYRRMTGPAVILTLASFILTIPSLMYQVMF  187

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
             +        +         F  L+ +   ++    ++L++    F + +   + +   +
Sbjct  188  MREMDTAMPDL--------QFGALVSITSYIMILFRIVLLLFNDYFYMRWSVNKILSLRE  239

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
                  A E    L       +      + +I   +  L   I Y G  
Sbjct  240  QYKDASADEYYAALERKGNPKMMYVLGGISLILFLVYILNIFISYTGIL  288


>RHQ11090.1 DUF975 family protein [Lachnospiraceae bacterium AM48-27BH]
Length=269

 Score = 44.8 bits (102),  Expect = 0.043, Method: Composition-based stats.
 Identities = 20/104 (19%), Positives = 34/104 (33%), Gaps = 12/104 (12%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVL  270
            L +    L+L + G+   + F    YVL D   +   QAL +SR L+ G+   +    + 
Sbjct  169  LFMLVYGLVLTVVGIYLALTFGMFLYVLVDRPEMTLWQALGESRRLMKGNRIRLVMLQIS  228

Query  271  LLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
             +               +         L L  +        Y D
Sbjct  229  FIGW-----------GLISILTLGIGLLWLNGYILCTTAWFYKD  261


>WP_182096764.1 zinc-ribbon domain-containing protein [Enhydrobacter aerosaccus]
Length=177

 Score = 44.0 bits (100),  Expect = 0.043, Method: Composition-based stats.
 Identities = 8/34 (24%), Positives = 11/34 (32%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            V CP+CGA        +       +C  C    
Sbjct  2   QVICPNCGARYAVDPLAIGPVGRIVQCARCSHRW  35


>TFH07157.1 hypothetical protein E4H08_09965 [Candidatus Atribacteria bacterium]
Length=277

 Score = 45.1 bits (103),  Expect = 0.043, Method: Composition-based stats.
 Identities = 26/194 (13%), Positives = 52/194 (27%), Gaps = 5/194 (3%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
             L  I     +L    I   +L              +  +     +  +  L        
Sbjct  52   QLGAIVGAPALLWTQLIVLRVLAPTEPKRPRMAGKLKDWLSAFAASAAVPTLFRTATWWI  111

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
                    GL   +          +  + L        ++L    G +     FF     
Sbjct  112  WAAVPLTGGLMYGLYAWGFGNMFASGRMDL-----FISAVLFSNAGFMISCGLFFAPLCA  166

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLL  299
              D+ G   A+ +S  + SGH   I     +   + +TL      +  +  AA +   L 
Sbjct  167  MQDSKGPFDAIRRSWRMASGHRLKILAIAAICFWLPVTLFLAAYFLSLLRHAAAVFEGLP  226

Query  300  LTPFSFLYYYLIYS  313
               ++     ++  
Sbjct  227  AMLWTISAVVVVLF  240


>MBI4817803.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Deltaproteobacteria bacterium]
Length=319

 Score = 45.1 bits (103),  Expect = 0.044, Method: Composition-based stats.
 Identities = 28/199 (14%), Positives = 55/199 (28%), Gaps = 3/199 (2%)

Query  108  ADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI  167
                    R    L   +     L F  +   L  +         Q       L  + + 
Sbjct  21   HRDVWRLMRVHPWLFVGFPAVAHLPFDVLSELLASEAGEDTLRAVQIGMRFQGLVDLVWG  80

Query  168  LLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
               ++ +   + I        L+ +M       G   +   ++  +    + + ++P L 
Sbjct  81   TFVVATLLQGLTIVGEGRTPTLWEAMGRARETWGRVVVTTFVVRTLTVLSAFIFVVPALY  140

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI---SLTLSFLTAR  284
                +       A D +    A +KS  LV          +   ++I   S  L+     
Sbjct  141  LASRWMLAIPAAAIDGMTSSAARKKSWELVRNRGALRVLIYGASMLIAYSSFCLAAAVVL  200

Query  285  IPYVGEAANLAFSLLLTPF  303
                G A  L  +L   P 
Sbjct  201  PSVEGFAGVLLEALKQAPI  219


>OLS24189.1 hypothetical protein HeimC3_20880 [Candidatus Heimdallarchaeota 
archaeon LC_3]
Length=420

 Score = 45.5 bits (104),  Expect = 0.044, Method: Composition-based stats.
 Identities = 20/174 (11%), Positives = 52/174 (30%), Gaps = 0/174 (0%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
                     ++ I  + + +                 +   Q +   +    +  ++   
Sbjct  173  FNIFEWLNSIMLIVGISLAILIFEKSIYETNIVKVQYSGLIQRFVNRLPKILIFSLIYAF  232

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
              +          T +    + +  L+ +  F        + +   S++     LLF + 
Sbjct  233  FQIFFIKSPDFLLTLMARDLADQSSLQLLSIFLSFSAYAYITITISSVVYFPLWLLFNIL  292

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            FF     +   +      ++ + +LV G W  IFG     L++   +  +   I
Sbjct  293  FFLALPSILLGDGEKAGGIKYAEVLVRGRWKKIFGFLFTRLLLFSVVVIILTYI  346


>RLC31491.1 hypothetical protein DRH37_02940 [Deltaproteobacteria bacterium]
Length=320

 Score = 45.1 bits (103),  Expect = 0.044, Method: Composition-based stats.
 Identities = 8/43 (19%), Positives = 17/43 (40%), Gaps = 1/43 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAES  43
           M  ++C  C  + +   +++    +  RC  C    +  P E 
Sbjct  1   MV-IQCDKCHTKFHLDETRIGTGGTRVRCSRCRHVFVAHPPEP  42


>PPR31196.1 hypothetical protein CFH36_02276 [Alphaproteobacteria bacterium 
MarineAlpha9_Bin6]
Length=263

 Score = 44.8 bits (102),  Expect = 0.044, Method: Composition-based stats.
 Identities = 10/45 (22%), Positives = 17/45 (38%), Gaps = 1/45 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           M  V CP+C    +  +  L       RC +C +  +  P    +
Sbjct  1   MI-VACPNCITRFSVKAGTLGEAGRKVRCSKCGKEWLQRPVIPDQ  44


>WP_145211101.1 hypothetical protein [Gimesia alba]QDT40707.1 hypothetical protein 
Pan241w_07650 [Gimesia alba]
Length=454

 Score = 45.5 bits (104),  Expect = 0.044, Method: Composition-based stats.
 Identities = 10/63 (16%), Positives = 15/63 (24%), Gaps = 0/63 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M   +CPHC      P      K +   C    Q         +   ++      P    
Sbjct  1   MINFKCPHCNFAIRVPDGAAGKKGTCKFCNRVVQVPTIQNETKESLTSSTPSNDSPIFFD  60

Query  61  QRR  63
              
Sbjct  61  CDD  63


>MBI3760084.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=265

 Score = 44.8 bits (102),  Expect = 0.044, Method: Composition-based stats.
 Identities = 11/39 (28%), Positives = 14/39 (36%), Gaps = 0/39 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
             V+C  C          LPA   + +C  C     FDP
Sbjct  3   IEVQCTSCHTRYRIDEQILPAGLPTFKCSRCGHVFNFDP  41


>XP_030940958.1 uncharacterized protein LOC115965797 [Quercus lobata]
Length=154

 Score = 43.6 bits (99),  Expect = 0.044, Method: Composition-based stats.
 Identities = 20/128 (16%), Positives = 38/128 (30%), Gaps = 11/128 (9%)

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
                 +     V   + I G+QA++KS+ L+ G        F+LL    + +  +   I 
Sbjct  16   CCSKVWQLASVVSVLEAIRGIQAMKKSKTLIKGKMGVAVAFFILLYTCFVGIDLVFGYIV  75

Query  287  -----------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLT  335
                        V    +  F   L   + +   ++Y   K+ +            L   
Sbjct  76   VGENVPDIAIRIVVGILSFLFLFKLILIALVVQTVVYFVCKSYHHQTIDKSALADHLEAV  135

Query  336  AAIFGWML  343
                   L
Sbjct  136  YLGDYVPL  143


>NJD92159.1 hypothetical protein [Geobacter sp.]
Length=288

 Score = 45.1 bits (103),  Expect = 0.044, Method: Composition-based stats.
 Identities = 10/36 (28%), Positives = 15/36 (42%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  ++C HC A      +KL       RC +C +  
Sbjct  1   MI-IQCDHCSARFRMDDAKLANGPVKVRCAKCKEVF  35


>BBK39348.1 MJ0042 family finger-like domain protein [Stella sp. ATCC 35155]
Length=216

 Score = 44.4 bits (101),  Expect = 0.045, Method: Composition-based stats.
 Identities = 11/62 (18%), Positives = 15/62 (24%), Gaps = 1/62 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M    C  C          L ++    RC  C  +    P      +        P    
Sbjct  1   MIL-TCSACDTRYLIDPVTLGSEGRVVRCANCGHSWHQMPPADSPRRIDLLAPPDPEQPA  59

Query  61  QR  62
           QR
Sbjct  60  QR  61


>ALP05310.1 Glycerophosphoryl diester phosphodiesterase [Clostridioides difficile]
Length=491

 Score = 45.5 bits (104),  Expect = 0.045, Method: Composition-based stats.
 Identities = 14/206 (7%), Positives = 64/206 (31%), Gaps = 13/206 (6%)

Query  87   LQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPAT  146
                         L +   ++     +     +       + I    +  +  + L    
Sbjct  62   HIVYLNMDNFIKVLTNPMSIVLIILSVIILAFYAFFEFTSVIICFNKSIKYEKIGLFELF  121

Query  147  WLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLL  206
             ++ +N           +   ++ +  +  ++ I      + +   +   ++      ++
Sbjct  122  KISFKNSIKLLYPKNILLFIFVILIIPLVNTVLISGFIGKIQIPEYILDYIKSDIILNII  181

Query  207  LILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFG  266
             ++ +L++            +  + + F  + +  +     +A ++S  L+ G       
Sbjct  182  YMIFLLIIY-----------ISVIRWIFSIHEITLNTENFKRARKESVNLIKGKIIRTII  230

Query  267  RFVLLLVISLTLSFLT--ARIPYVGE  290
              ++L ++   + +    + I ++G 
Sbjct  231  YSLILSLVVAIIGYAIYHSGIIFIGI  256


>HBI44919.1 hypothetical protein [Planctomycetales bacterium]
Length=218

 Score = 44.4 bits (101),  Expect = 0.045, Method: Composition-based stats.
 Identities = 22/193 (11%), Positives = 43/193 (22%), Gaps = 8/193 (4%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
             T+ CP C  +   P+  +       RC  C          ++           P     
Sbjct  5    ITIVCPECEKKIAVPAEAV---GKKVRCKGCQHVFAIKAPAAKPAGGKAAPIKKPPAAKP  61

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELF-----CR  116
            +  P+D  E     +     + +           S   +  ++        L        
Sbjct  62   KPKPTDDGEEDDAPIGVTALDTAPRCPNCANEMESEDAVICLTCGYNTRTRLQAATLAIE  121

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                +   + L   +  A  F  LL     +    +                    W   
Sbjct  122  DTTKMTWFWWLLPGILCALFFIILLTFDILYQIKIDNYVDKDNDWYAFVAAGFFKIWFIW  181

Query  177  SMFIYICKTDVGL  189
            +   +I    V  
Sbjct  182  APTTFINVGLVTF  194


>WP_194538646.1 hypothetical protein [Thermogemmata fonticola]MBA2227016.1 hypothetical 
protein [Thermogemmata fonticola]
Length=179

 Score = 44.0 bits (100),  Expect = 0.045, Method: Composition-based stats.
 Identities = 11/35 (31%), Positives = 16/35 (46%), Gaps = 3/35 (9%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
             V CP CGA    P +   A   + +CP+C   +
Sbjct  3   ILVACPSCGARLKVPDN---AAGKTVQCPKCGTNM  34


>MBC7863755.1 hypothetical protein [Bacteroidia bacterium]
Length=336

 Score = 45.1 bits (103),  Expect = 0.045, Method: Composition-based stats.
 Identities = 15/126 (12%), Positives = 44/126 (35%), Gaps = 3/126 (2%)

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
             WM  +  I        +   + +    + +  +  I+  + +    L+   P L     
Sbjct  137  VWMVFTGMIVFSLVSFVVLAIVAIPFFLLCTSGVGGIIFAIFLLFIILIAFGPQL--GYI  194

Query  232  FFFCQYVLADDNI-GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
            F +    +   +      A++++   + G++W ++   + + +I    S +T     + +
Sbjct  195  FSYAPPFVMVRDKVFVFTAIKRTIANIKGNFWWLWVLMICMGLIIGIFSMITNLPATIYQ  254

Query  291  AANLAF  296
              N   
Sbjct  255  MGNTFG  260


>MBE6369220.1 DUF975 family protein [Lentisphaerae bacterium]
Length=248

 Score = 44.8 bits (102),  Expect = 0.046, Method: Composition-based stats.
 Identities = 31/229 (14%), Positives = 56/229 (24%), Gaps = 40/229 (17%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
                    +    L  +       +         +          +      + L+ L  
Sbjct  19   IWSAELMNISFGALFKLANQKLSRNYWRSLLVVLITAGIIQGVNQLSFGLGTFFLMPLVA  78

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL-------------  220
                 F+ I +        +     +         L         LL             
Sbjct  79   GELWYFLLIVRGGDPEIDVIFNPFSNYWPLVGANFLASFFSMLPMLLAVPLGIAAVLLGI  138

Query  221  ----------------LIIPGLLFCVWFFFCQYVLADDNIGGL-QALEKSRLLVSGHWWA  263
                              IP ++  + FF   Y+L D     +  AL +SR L+  H   
Sbjct  139  TQYYIAAAVLAVAALLCWIPCIVISLSFFAVNYILLDSPSTPMMDALSRSRALMKNHKLE  198

Query  264  IFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
            +FG  +LL++I + + F                 L   PF   +    Y
Sbjct  199  LFGFNLLLMLIGIPVVF----------LTCGLGLLWYLPFCSYFSAAYY  237


>MBI4723487.1 zinc-ribbon domain-containing protein [Rhodomicrobium sp.]
Length=309

 Score = 45.1 bits (103),  Expect = 0.046, Method: Composition-based stats.
 Identities = 10/40 (25%), Positives = 15/40 (38%), Gaps = 2/40 (5%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
           M  + CP C    +   + LP +    RC +C       P
Sbjct  1   MI-IECPACTTRYDIK-AALPPEGRKVRCAKCKTEWRAMP  38


>WP_097115037.1 hypothetical protein [Alysiella filiformis]QMT31950.1 hypothetical 
protein H3L97_03475 [Alysiella filiformis]SOD70703.1 hypothetical 
protein SAMN02746062_02084 [Alysiella filiformis 
DSM 16848]
Length=285

 Score = 44.8 bits (102),  Expect = 0.046, Method: Composition-based stats.
 Identities = 25/233 (11%), Positives = 67/233 (29%), Gaps = 33/233 (14%)

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
             + +  W L        +L    +   + +         +      +       + +  +
Sbjct  27   FWLKHAWHLTKARFWRWMLISFIMNLCITVVSTALTFSSDSLILTNVAQILSGMLNVLFT  86

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG-------  225
                     + + D      +    +H     L+LIL  + +    +LL+          
Sbjct  87   GGLLLGMAALAEGDELEVNHLFAAFQHKWQDLLILILCFIPIVLLYILLMGGAFFALLGG  146

Query  226  -------------LLFCVWF-----------FFCQYVLADDNIGGLQALEKSRLLVSGHW  261
                         ++F ++F           FF   ++   ++  L A++ S      + 
Sbjct  147  GNIMSNDINTNSLIIFIIFFVLSYFFMMMVGFFSIPLVVLHDVKPLTAIKMSFSGSLKNI  206

Query  262  WAIFGRFVLLLVISLTLSFLTARIPYVGEAAN--LAFSLLLTPFSFLYYYLIY  312
              +    +   VI   L+F+   +  +   A       +  T  + +  ++I 
Sbjct  207  APMIALNLGFFVIGFGLAFILGILFSLLSFALNDSLAGITATILTMVPLFIIM  259


>NWG23342.1 hypothetical protein [Pseudorhodoplanes sp.]
Length=288

 Score = 45.1 bits (103),  Expect = 0.046, Method: Composition-based stats.
 Identities = 16/123 (13%), Positives = 39/123 (32%), Gaps = 7/123 (6%)

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
                 F  +    V L  ++      +   +         V     +++       V   
Sbjct  101  YYYLGFGRVELLFVLLPVAIGAIFLALSFLSAESDEESPFVLAAFAVIVFAFFYVMVRLS  160

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL----LLVISLTLSFLTARIPYVG  289
                ++  +         ++  L  G++W +F  +++    LL+I + L+ L   +  VG
Sbjct  161  LIFPIVVVEGRYDFG---QAWALTRGNFWRLFATWLVAVIPLLLIFMLLASLFGGLAGVG  217

Query  290  EAA  292
               
Sbjct  218  GMF  220


>MBC6406805.1 zinc-ribbon domain-containing protein [Rhodobacteraceae bacterium]
Length=229

 Score = 44.4 bits (101),  Expect = 0.046, Method: Composition-based stats.
 Identities = 11/39 (28%), Positives = 15/39 (38%), Gaps = 0/39 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
            +RCPHC  E N   + +P      +C  C  T      
Sbjct  2   QLRCPHCKTEYNIAENSIPETGRVIQCALCDNTWYRTHP  40


>NNM31535.1 hypothetical protein [Gemmatimonadetes bacterium]
Length=123

 Score = 42.8 bits (97),  Expect = 0.047, Method: Composition-based stats.
 Identities = 13/41 (32%), Positives = 19/41 (46%), Gaps = 0/41 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAE  42
            TV CP C +E    S+K+P     A+C  C  T   +  +
Sbjct  3   FTVSCPDCSSEFPVDSAKVPEGGVLAQCSACPGTFFVEQPQ  43


>QKF93524.1 chitin synthase [Fadolivirus 1]
Length=890

 Score = 45.9 bits (105),  Expect = 0.048, Method: Composition-based stats.
 Identities = 22/277 (8%), Positives = 63/277 (23%), Gaps = 23/277 (8%)

Query  57   HCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCR  116
                        L                 +               +S +     +    
Sbjct  588  WINGSVAGYIYLLFQNFSDFRQWDAPFYRKVYVWILLMCQFLIYCMVSVVPGIMLKTLYY  647

Query  117  RGWGLLGIYLLGIVL----AFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
                  G Y +   +     F  I++  +         +       +L+       L   
Sbjct  648  GLAYFFGYYGIKSDIGLISTFIVIWALYICHVFIHHKTKFNYIIMYLLVFLSLVTSLVSF  707

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
                          V    +    + ++G     L  ++ +   G     +  +   + +
Sbjct  708  GSLFHYAFIDTDQTVSDIMTSGNPIIYMGIAVFFLPFILALCLSGRGHSFMYMIKSFIQY  767

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGH------------WWAIFGR-----FVLLLVIS  275
                 +L      G  A  ++  L  G+               I         + ++++ 
Sbjct  768  TLFIPLLI--GWFGSYAYARTWDLTWGNRPANELNDITEEQKKIMITKFKEKNIRIIMVL  825

Query  276  LTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
            + L+     IP  G+   +     +  +   + ++  
Sbjct  826  IALNIAIFFIPLQGQFVLMGLFFAIALYQMFFSFVFC  862


>RII26264.1 hypothetical protein CXR31_11245 [Geobacter sp.]
Length=306

 Score = 45.1 bits (103),  Expect = 0.048, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V C  CG+      S++P     ARC  C    
Sbjct  1   MV-VNCEKCGSTLRVDESRIPDAGVIARCTICSHRF  35


>MBJ7471536.1 hypothetical protein [Solirubrobacteraceae bacterium]
Length=283

 Score = 44.8 bits (102),  Expect = 0.048, Method: Composition-based stats.
 Identities = 26/112 (23%), Positives = 43/112 (38%), Gaps = 1/112 (1%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
             ++    L  +    ++       +V    +++LG     +     +   L    G L L
Sbjct  81   VSLIGASLVTAAHARAVVALAAGENVRTGSALRLGGGAFLTVLGAALCYTLATIAGMLAL  140

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGH-WWAIFGRFVLLL  272
            I+PG+   V   F   + A  N G L +L  S  LV    WW  FG  +L+ 
Sbjct  141  IVPGIYLSVALLFGPQIAALTNTGPLDSLSSSYRLVRAAGWWRTFGYSILIF  192


>TAL03411.1 DUF3426 domain-containing protein [Rhodospirillaceae bacterium]
Length=835

 Score = 45.9 bits (105),  Expect = 0.048, Method: Composition-based stats.
 Identities = 11/39 (28%), Positives = 16/39 (41%), Gaps = 0/39 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
            + CP+C A    P+  L  K  S +C  C  +    P 
Sbjct  2   RITCPNCTASFEIPTELLGKKGRSLKCASCGHSWYQTPH  40


>ETR71671.1 fimbriae-associated protein [Candidatus Magnetoglobus multicellularis 
str. Araruama]
Length=972

 Score = 45.9 bits (105),  Expect = 0.048, Method: Composition-based stats.
 Identities = 13/93 (14%), Positives = 24/93 (26%), Gaps = 0/93 (0%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            V CP+C ++    +S +    S  RC  C       PA+   T   D+        +   
Sbjct  11   VTCPNCQSQFELDNSSIKKSGSKVRCSNCKHVFKVFPAKENDTPAQDSGTPPNEGTIACP  70

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRAS  96
                   +    +  +                 
Sbjct  71   NCESSYALSPDQLGEKGKKLKCTNCSHVFRAMP  103


>WP_191282923.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Lysinimonas yzui]GHF15727.1 hypothetical 
protein GCM10011600_15970 [Lysinimonas yzui]
Length=378

 Score = 45.1 bits (103),  Expect = 0.049, Method: Composition-based stats.
 Identities = 19/127 (15%), Positives = 42/127 (33%), Gaps = 1/127 (1%)

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
               +     +    T   L  +  L L    S +        V    ++ +++ G     
Sbjct  171  TGRLVAWTVVVFLLTAGTLVLASLLPLTLAVSSSAGAGFAFAVGFLEAIAILLVGGYLAG  230

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT-ARIPYVG  289
               F  + +A + +G   AL +S  L     W +FG  +L+ ++    + +    I +  
Sbjct  231  RVGFTSHAIALEGLGVAGALARSWRLTRRAGWRLFGAQLLIWIVVGLAAAVLTQPISWAL  290

Query  290  EAANLAF  296
            +      
Sbjct  291  DLGVGLV  297


>HDH90011.1 hypothetical protein [Candidatus Bathyarchaeota archaeon]
Length=319

 Score = 45.1 bits (103),  Expect = 0.049, Method: Composition-based stats.
 Identities = 14/105 (13%), Positives = 36/105 (34%), Gaps = 1/105 (1%)

Query  208  ILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGR  267
             +LI+ +    +   +   L     FF    +  +  G   ++++S  L+  H+  I   
Sbjct  140  PVLIVALIFFKIASALAYGLAQFVTFFALPAMIIEGKGFKDSIKRSWDLIKNHFIDIIIY  199

Query  268  FVLLLVISLTLSFLTARIPYV-GEAANLAFSLLLTPFSFLYYYLI  311
               +  + +   F+   +  V G    +   + +   S     + 
Sbjct  200  VSGVSGVYMFFIFIMILLYGVSGYLVGMFILVPILGISEDLMVVA  244


>QHQ37057.1 hypothetical protein GO499_18660 [Rhodobacteraceae bacterium 
9Alg 56]
Length=280

 Score = 44.8 bits (102),  Expect = 0.049, Method: Composition-based stats.
 Identities = 7/43 (16%), Positives = 16/43 (37%), Gaps = 0/43 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
            + CP+C A+         ++  + +C +C         E + 
Sbjct  2   RIVCPNCSAQYEVDGRLFTSEGRAVQCAQCNTRWTQQKVEDEP  44


>HAC91574.1 hypothetical protein [Planctomycetaceae bacterium]
Length=325

 Score = 45.1 bits (103),  Expect = 0.050, Method: Composition-based stats.
 Identities = 24/274 (9%), Positives = 57/274 (21%), Gaps = 22/274 (8%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
                CP C       +          +C            ++     +   +      ++
Sbjct  3    IQFNCPSCQHVLTVGNQLAGKAGKCHKCGAKTPVPGETTEDAASEGKSKPSSAGSASKVK  62

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                       +  +       +      +                     ++       
Sbjct  63   AGTRKPASAAPAGNLANVMDELTESDFNRQSPFKQ----------------VYSPPKPDT  106

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
             G   L  V+           K               I  +  A  L+ + W  G     
Sbjct  107  SGDARLRRVVEMEKKDKKGKGKQEAEFGWNVLMGVHNIFESLAAVALIAIVWGWGWPEWR  166

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                +                 T++  L  L++  G ++LI     + V      +    
Sbjct  167  SSYREYIPLIDWGGQYDQGRGLTVVGSLAFLMMICGVVMLIHHPWFYIVALASYVFFCEL  226

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
                 +        L   +   I G +++L +  
Sbjct  227  HIFNVIA------RLGDRNNSIIAGIWLVLALFL  254


>MBF0263282.1 zinc-ribbon domain-containing protein [Magnetococcales bacterium]
Length=261

 Score = 44.8 bits (102),  Expect = 0.051, Method: Composition-based stats.
 Identities = 10/57 (18%), Positives = 20/57 (35%), Gaps = 1/57 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPH  57
           M  V+C  C ++ +     L       +C +C       P +S+  +      + P 
Sbjct  1   MI-VQCESCRSQFDVDDEILRPSGRKLKCSQCQAVFFQPPPKSRDAKGDMVRGSDPR  56


>OLA82105.1 hypothetical protein BHW58_02080 [Azospirillum sp. 51_20]
Length=262

 Score = 44.8 bits (102),  Expect = 0.051, Method: Composition-based stats.
 Identities = 10/38 (26%), Positives = 13/38 (34%), Gaps = 0/38 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
            ++CP C       S KLP      RC +C        
Sbjct  2   LIKCPQCQTVYRLESEKLPENGLKMRCAKCRCVWRAYN  39


>MBI5528023.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=726

 Score = 45.5 bits (104),  Expect = 0.051, Method: Composition-based stats.
 Identities = 8/55 (15%), Positives = 16/55 (29%), Gaps = 0/55 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCP  56
             ++C  CG E      ++P +  + +C  C          S       +     
Sbjct  3   VDIKCAKCGTEYELEEDRIPPQGLTVKCTSCGFIFKAAKPVSGAPGMHPDHKKTM  57


>NOR25893.1 hypothetical protein [Desulforhopalus sp.]
Length=233

 Score = 44.4 bits (101),  Expect = 0.052, Method: Composition-based stats.
 Identities = 23/208 (11%), Positives = 63/208 (30%), Gaps = 19/208 (9%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
               + ++     + +      A     +        L+  +    + LS +   M     
Sbjct  16   FKYVALLSVPFTLLAFPSYFIAQTAENRIGYITVIGLIVYLVGFAMYLSSLIFFMSQAYQ  75

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII-------------PGLLFCV  230
                 +  ++  GL +     + L++    +  G +++                G+   +
Sbjct  76   GKLQSIKSNLINGLVYTPLLMVTLLVANSPLIAGFVIMFTSTPFQFLTLPLLVLGIYVSL  135

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL------TAR  284
               F  + L  +    L A++ S     G    I    ++  + +  +  +         
Sbjct  136  RSTFAPFHLILEGHKPLGAIKSSFSSTKGQVSKIVKVVMMFYIATTVVEVVSTSNANIEL  195

Query  285  IPYVGEAANLAFSLLLTPFSFLYYYLIY  312
               +     +A +LL+T F  +  + +Y
Sbjct  196  FNILMFFLGVAVTLLMTAFQQIVVFKMY  223


>NOU26329.1 hypothetical protein [Polyangiaceae bacterium]
Length=47

 Score = 40.5 bits (91),  Expect = 0.052, Method: Composition-based stats.
 Identities = 6/33 (18%), Positives = 15/33 (45%), Gaps = 0/33 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           V+C  C  E +   + +  + ++ +C +C    
Sbjct  3   VQCDRCKTEYDFDDALVSTRGTTVKCTQCGHQF  35


>TNF35770.1 hypothetical protein EP329_05890 [Deltaproteobacteria bacterium]
Length=432

 Score = 45.1 bits (103),  Expect = 0.052, Method: Composition-based stats.
 Identities = 12/102 (12%), Positives = 21/102 (21%), Gaps = 1/102 (1%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  V CP C  +      KL  + +   CP+C    +        +    N         
Sbjct  1    MI-VACPSCEKKYKFSEEKLGKRGAKITCPKCRHVFVVYKDREIESLGRKNADGTIEADD  59

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRS  102
            +                     R          +   +    
Sbjct  60   RATNQPAPPRDPWPRDFPTWSFRELGATWRVRKQQGLTYDFY  101


>MAQ02903.1 hypothetical protein [Rhodospirillaceae bacterium]
Length=421

 Score = 45.1 bits (103),  Expect = 0.053, Method: Composition-based stats.
 Identities = 8/35 (23%), Positives = 14/35 (40%), Gaps = 0/35 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           V CP+C        +++ A+  + RC  C      
Sbjct  3   VVCPNCQTSYILDGAQIGAEGKAMRCSNCGTAWRQ  37


>CDB39928.1 mJ0042 finger-like region protein [Azospirillum sp. CAG:260]
Length=276

 Score = 44.8 bits (102),  Expect = 0.053, Method: Composition-based stats.
 Identities = 7/33 (21%), Positives = 10/33 (30%), Gaps = 0/33 (0%)

Query  6   CPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           CP C          +P      RC +C +    
Sbjct  5   CPKCKTGYAVEPELIPENGKKMRCAKCGEIWFC  37


>TMA15853.1 hypothetical protein E6J85_19310, partial [Deltaproteobacteria 
bacterium]
Length=546

 Score = 45.5 bits (104),  Expect = 0.053, Method: Composition-based stats.
 Identities = 13/54 (24%), Positives = 20/54 (37%), Gaps = 0/54 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCP  56
            ++CPHC A       ++P    S +CP+C    +    E          A  P
Sbjct  2   RIQCPHCPAVYELDDGRVPPAGLSIKCPKCKAAFVVHRPEDGSKMAPTTAAKIP  55


>QKK10258.1 hypothetical protein HND59_00175 [Proteobacteria bacterium]
Length=237

 Score = 44.4 bits (101),  Expect = 0.053, Method: Composition-based stats.
 Identities = 10/40 (25%), Positives = 15/40 (38%), Gaps = 0/40 (0%)

Query  5   RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           +CPHC        ++L A    ARC  C +         +
Sbjct  9   QCPHCDTRFRITEAQLDAAGGKARCSRCHRVFNARAYLQE  48


>NIY18218.1 hypothetical protein [Nitrospinaceae bacterium]
Length=192

 Score = 44.0 bits (100),  Expect = 0.053, Method: Composition-based stats.
 Identities = 29/188 (15%), Positives = 47/188 (25%), Gaps = 11/188 (6%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  + CP+CG  +  P  K P K   A CP+C                 + +        
Sbjct  3    MVKISCPYCGLTKTIPRDKAPTKPVKANCPKCKHQF---------PINPEKLQPAEPETA  53

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
            Q R                        +        G    SI  L   ++ +F  R + 
Sbjct  54   QPRAGQPPAPEVQTQQAPPPRPPLPATEEIDPHAGLGGVPESIGNLFTQTFGIFAGRFFT  113

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L G+YLL  ++   P                     W I         + +++       
Sbjct  114  LFGLYLLMFIMILVPGGIFGAGTVFLVPMMPG--LGWLITPLGGLITAIVVTYFLFWGXX  171

Query  181  YICKTDVG  188
                    
Sbjct  172  AAVSVKRS  179


>HBT83656.1 hypothetical protein [Desulfuromonas sp.]
Length=84

 Score = 41.7 bits (94),  Expect = 0.053, Method: Composition-based stats.
 Identities = 19/79 (24%), Positives = 29/79 (37%), Gaps = 0/79 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
            T+ CPHC   R+ P  K+P   +SA CPEC ++ +  P  +            P   L 
Sbjct  1   VTIACPHCHFARSLPLEKVPRSIASATCPECNRSFMPQPQVTITCPHCHFSQELPLDKLP  60

Query  62  RRIPSDRLEIQSKTVNCRR  80
            +  +       K      
Sbjct  61  TKAVNATCPECRKQFPLPM  79


>WP_171982531.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Sphingomonas sp. LM7]
Length=271

 Score = 44.8 bits (102),  Expect = 0.054, Method: Composition-based stats.
 Identities = 22/171 (13%), Positives = 48/171 (28%), Gaps = 3/171 (2%)

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
            C           +  ++   P+   L       L      +            +  +   
Sbjct  56   CMTFHTNALFRRIAALVRAHPLPGLLWFTALVALGTAVDVFGDGDPRIQFVITIPEIFAH  115

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
                   +   ++            V S+  L I+  L +  G  LLI+PGL     +  
Sbjct  116  FAITGALLRGEEL---HRAWAMPGRVASYIGLGIVTGLAIMIGLFLLILPGLYLYARWVV  172

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
               ++  +     +A+ +S         +I    VL+ +  +   F    +
Sbjct  173  ATPLVIGEGASMSEAMSRSWRRTRPAATSITAALVLINLPYVAGLFAILYL  223


>OZB73339.1 thiol reductase thioredoxin, partial [Thiomonas sp. 14-64-326]
Length=28

 Score = 39.8 bits (89),  Expect = 0.054, Method: Composition-based stats.
 Identities = 7/27 (26%), Positives = 10/27 (37%), Gaps = 0/27 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARC  29
            + CPHC      P  +L       +C
Sbjct  2   LIVCPHCATLNRVPDDRLKDAPVCGQC  28


>KAA3653043.1 DUF3426 domain-containing protein, partial [Proteobacteria bacterium]
Length=91

 Score = 41.7 bits (94),  Expect = 0.055, Method: Composition-based stats.
 Identities = 12/37 (32%), Positives = 15/37 (41%), Gaps = 0/37 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M   RCP C       S +L A++   RC  C  T  
Sbjct  1   MMRTRCPACQTLFRISSEQLRARQGMVRCGHCRHTFN  37


>MBA2603838.1 hypothetical protein [Acidobacteria bacterium]
Length=144

 Score = 42.8 bits (97),  Expect = 0.055, Method: Composition-based stats.
 Identities = 16/88 (18%), Positives = 33/88 (38%), Gaps = 7/88 (8%)

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA---IFGRFVLLLVISLTLSFL----T  282
             ++   Q V+A +++G  +A+ ++   V G       IFG  +LL  I+   S +     
Sbjct  20   FFYLITQMVMAVEDLGVRRAIGRAAEFVRGSLREVAGIFGIVLLLAAIATVASVVATAGF  79

Query  283  ARIPYVGEAANLAFSLLLTPFSFLYYYL  310
              I  +         L +  +    +  
Sbjct  80   GLINLIPILGLAVLPLQIAAWLVRGFVF  107


>WP_053550135.1 zinc-ribbon domain-containing protein [Desulfuromonas soudanensis]ALC15981.1 
hypothetical protein DSOUD_1200 [Desulfuromonas 
soudanensis]
Length=174

 Score = 43.6 bits (99),  Expect = 0.055, Method: Composition-based stats.
 Identities = 12/37 (32%), Positives = 17/37 (46%), Gaps = 0/37 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           T+ CP CGA      +K+    +  RCP+C      D
Sbjct  2   TIECPGCGARYRMDPAKVSGTTARVRCPKCSVPFQID  38


>HGM97971.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=180

 Score = 43.6 bits (99),  Expect = 0.055, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 15/36 (42%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V C  CGA+      K+  K +  RC +C    
Sbjct  1   MI-VNCDKCGAKFQLADDKITDKGTKVRCSKCKAIF  35


>WP_054966758.1 zinc-ribbon domain-containing protein [Thiohalorhabdus denitrificans]KPV39740.1 
hypothetical protein AN478_11630 [Thiohalorhabdus 
denitrificans]SCX91453.1 MJ0042 family finger-like 
domain-containing protein [Thiohalorhabdus denitrificans]
Length=272

 Score = 44.8 bits (102),  Expect = 0.055, Method: Composition-based stats.
 Identities = 8/35 (23%), Positives = 14/35 (40%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            +RCP+C       S ++  + +  RC  C     
Sbjct  2   EIRCPNCDTRFRVRSGQIREEGTKVRCSVCTFRFW  36


>MBC8145691.1 hypothetical protein [bacterium]
Length=184

 Score = 43.6 bits (99),  Expect = 0.055, Method: Composition-based stats.
 Identities = 21/126 (17%), Positives = 46/126 (37%), Gaps = 19/126 (15%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            +    +L  +P +     F     +  ++ IG  +A+ +   L+ G WW  F   V++  
Sbjct  28   LVVLIILGFVPTIYIYNCFTILIAMRLEEEIGLFEAVRRCFSLMKGRWWFSFWITVVMAF  87

Query  274  ISLTLSFLTAR-------------------IPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
            + + +S +                      I ++G A +   S L+     L   ++Y +
Sbjct  88   VVMFVSMIFQLPLQVSMGVATYAGSDGTGWILFIGVALSTIGSYLMYSILILSCSVLYYN  147

Query  315  LKANYR  320
            L  +  
Sbjct  148  LVEHKE  153


>MBI3467178.1 hypothetical protein [Planctomycetes bacterium]
Length=215

 Score = 44.0 bits (100),  Expect = 0.056, Method: Composition-based stats.
 Identities = 12/137 (9%), Positives = 27/137 (20%), Gaps = 3/137 (2%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M    CP CG +   P            C  C   +       +       +   P    
Sbjct  1    MIRFLCPKCGKKLEVPD---GTAGQKGNCSGCGAMVEVPGTVKEVGPEAFALDDVPPADD  57

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
                     + +              +   R +        + +      +     +   
Sbjct  58   VGHGAHQSAQPEPHPPPAGSEQGPRPVLKRRIWTVQPKEPVAETARSRFEFPRRAVQFVQ  117

Query  121  LLGIYLLGIVLAFAPIF  137
               +    + L +   F
Sbjct  118  WGTVIAAVLALLWTLPF  134


>RLS58209.1 hypothetical protein DWH91_02825 [Planctomycetes bacterium]
Length=307

 Score = 44.8 bits (102),  Expect = 0.056, Method: Composition-based stats.
 Identities = 35/318 (11%), Positives = 77/318 (24%), Gaps = 19/318 (6%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
             TV C  C          +       +C  C  +L  +          +   + P     
Sbjct  3    ITVTCEECSEIHRVRDDAV---GKRLKCKGCGISLKVEAPAPSEDDFANLDVSEPDDDEI  59

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                      + K+   RR +             S + +    QL+   + LF    +  
Sbjct  60   DPSRLKPALRKVKSAAGRRKSSKSGTGKSAPVPLSETKVPLGIQLVYYGFMLFLFAMFVT  119

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
             GI              +  +     L                + +      M  +    
Sbjct  120  FGIAWTFRNTPRGIPPISAPVLYGLGLL-----------YFASSIVTTVGKLMCVTAPPQ  168

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF-----FCQ  236
            +    V L            +   L ++L   +     L+ + G++  VWF      F +
Sbjct  169  MSGKGVILAAVAFDLFAQAITVAKLFMVLPPPLVASINLVSVAGMVCFVWFLQHLGRFLK  228

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
                 +   GL AL    + +      +    +  ++  +        +        +  
Sbjct  229  EQDISERASGLLALGFGIVAMWLAMIVLSALAMARVLPVMVGGLGGVLLSLALLIVGIIG  288

Query  297  SLLLTPFSFLYYYLIYSD  314
             +          Y +  +
Sbjct  289  VIRYVGLLHSCLYTMSYE  306


>TNE52213.1 hypothetical protein EP343_02090 [Deltaproteobacteria bacterium]
Length=402

 Score = 45.1 bits (103),  Expect = 0.056, Method: Composition-based stats.
 Identities = 12/72 (17%), Positives = 18/72 (25%), Gaps = 0/72 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
           + C  C      P  K+P   +  RC  C  T   +      +         P      +
Sbjct  3   IECQFCQTRFRVPQEKIPPAGTLVRCSSCRATFWVNAPSHDASSAPPPAPATPPPSSLPK  62

Query  64  IPSDRLEIQSKT  75
             SD        
Sbjct  63  SNSDSWWEMPPM  74


>OGP84803.1 hypothetical protein A2Y95_07645 [Deltaproteobacteria bacterium 
RBG_13_65_10]
Length=429

 Score = 45.1 bits (103),  Expect = 0.056, Method: Composition-based stats.
 Identities = 8/36 (22%), Positives = 11/36 (31%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V C  C         ++       RC +C  T 
Sbjct  1   MI-VCCEKCQTRFKLNEHRIKPPGVRLRCSKCQHTF  35


>RME19960.1 hypothetical protein D6806_17360, partial [Deltaproteobacteria 
bacterium]
Length=327

 Score = 44.8 bits (102),  Expect = 0.056, Method: Composition-based stats.
 Identities = 8/34 (24%), Positives = 11/34 (32%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            V CP C         K+    +  +CP C    
Sbjct  2   RVVCPGCSTGYRVADEKVSGDGARIKCPRCGTVF  35


>SEH05063.1 Uncharacterised protein [Thiotrichales bacterium HS_08]
Length=125

 Score = 42.4 bits (96),  Expect = 0.057, Method: Composition-based stats.
 Identities = 10/33 (30%), Positives = 19/33 (58%), Gaps = 0/33 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQT  35
            V+CP C A+ +   +++PA K+  +C +C   
Sbjct  2   QVQCPDCQAQYSIDPAQIPAGKTGFKCKQCGGF  34


>MXV76647.1 hypothetical protein [Candidatus Poribacteria bacterium]
Length=216

 Score = 44.0 bits (100),  Expect = 0.057, Method: Composition-based stats.
 Identities = 21/216 (10%), Positives = 63/216 (29%), Gaps = 16/216 (7%)

Query  81   CNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSAL  140
             +           + +      +  +L  +  L+       L I  + IV         +
Sbjct  1    MSNPNDTSKTDNRQMTTLQPMELGGILDTALSLYRNNFRSFLAIISVYIVWIGLQEALVV  60

Query  141  LLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHV  200
             L   +  +  +        L      +  +     +         + +   ++      
Sbjct  61   WLLEISNTSRLDNLISDVEDLLDNLVYMFCVGVFVIASSAIYLGKPITVRVVLQQFRSQF  120

Query  201  GSFTLLLILLILVVGGGSLLLIIPGLL----------------FCVWFFFCQYVLADDNI  244
              +    +L ++     +L  I   ++                F + + F  + +  +  
Sbjct  121  PIYFGSSLLFLIPYMILTLESIETSIITLLSPLSLFSLLFLCVFYISWIFYGHAVLLEGF  180

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
              ++A  +SR+LV G W       + +L++ + + +
Sbjct  181  IAMEAFGRSRILVRGTWMRGCSITLAILLLQIGIYY  216


>MBU48998.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=334

 Score = 44.8 bits (102),  Expect = 0.057, Method: Composition-based stats.
 Identities = 31/329 (9%), Positives = 67/329 (20%), Gaps = 38/329 (12%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQT----------------------LIFDP  40
             + CP CG+     S         +      Q                        I  P
Sbjct  2    LIECPSCGSHFVVDSEGQHPCPVCSHMCGGQQNAAPSTPQEPPPSRPPESAWDVGPIAPP  61

Query  41   AESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGL  100
              S                    I + +      T              + +   +    
Sbjct  62   MPSSPQTIHGPPPVSQGTHSMCAIHAQKEAEGVCTRCGNFYCDECRGMIDGQPFCAPCYY  121

Query  101  RSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAIL  160
            +   +     WE                  + F+P      +                + 
Sbjct  122  KMEGEARYIVWEDASSSMSFYNKFMATVKEVMFSPSTFFDSMPLKGGYGKPILFAMICMG  181

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
            L ++   +L +  +           + G    M      V   ++ + +++ V  GG + 
Sbjct  182  LGSIGIAILQVGMLVLGASSGGRSLNPGFLIVMGFMYVIVMLVSIPMSVVMAVFVGGGIY  241

Query  221  LIIPGL----------------LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
                 L                     +      L    IG    +              
Sbjct  242  HGCLKLFGGGEKGYEATVRVLCYATAPYILGLIPLCGMYIGAFWQMGLQIYGAKRAHDMT  301

Query  265  FGRFVLLLVISLTLSFLTARIPYVGEAAN  293
             GR ++ + +   +  +   I  +   A 
Sbjct  302  LGRVLMAVFVPFFVVMIIVAIFVLAVIAL  330


>MBF0308179.1 zinc-ribbon domain-containing protein [Magnetococcales bacterium]
Length=673

 Score = 45.5 bits (104),  Expect = 0.057, Method: Composition-based stats.
 Identities = 13/31 (42%), Positives = 15/31 (48%), Gaps = 0/31 (0%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPEC  32
              V CP CG +    +S LPAK    RC  C
Sbjct  632  IQVACPGCGKKFRLQASLLPAKGRKLRCSAC  662


>NUO54404.1 hypothetical protein [Polyangiaceae bacterium]
Length=295

 Score = 44.8 bits (102),  Expect = 0.058, Method: Composition-based stats.
 Identities = 8/47 (17%), Positives = 14/47 (30%), Gaps = 0/47 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTD  50
           V C  C  E     + +  + +S RC +C             +    
Sbjct  3   VTCEKCQTEYEFDDALVSEQGTSVRCTQCGHRFKVRRPTGSGSPEVW  49


>PYP89631.1 hypothetical protein DMF61_02885 [Blastocatellia bacterium AA13]
Length=288

 Score = 44.8 bits (102),  Expect = 0.058, Method: Composition-based stats.
 Identities = 29/315 (9%), Positives = 72/315 (23%), Gaps = 36/315 (11%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             ++CP+CG      +             E   T    P                      
Sbjct  1    MIQCPNCGTGATDSTKFCRQCGWKMPQGEDAATWRLPPKTEPAPAPGTEPVGNTQTVEPP  60

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                  +                   P R   A G  L +  ++ +++W L         
Sbjct  61   GTSPAYMPPGGFYEPAPAVPYQPETAPPRGNIALGDWLSNGWRVYSENWSLMSVAAMIGG  120

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             + L  + +   P+   +       +  +                      +  ++   +
Sbjct  121  LLSLCSVGILAGPLLMGMYRMAFKTMRGERPQLADMFNFEGRFLQAFLAFLIYAAIQFGL  180

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                                            G  ++L  +   +  +   F   ++ + 
Sbjct  181  P--------------------------GGGKGGLFAMLSFVISPMMTMLLAFVLPLILER  214

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
             +  +QA+ +   L+    W          ++   +  + A I            L+  P
Sbjct  215  KVDVVQAVNQVFKLIFSREW----------LMWWIVGVVFAAIVTTSPILCGVGVLVAGP  264

Query  303  FSFLYYYLIYSDLKA  317
            +      + Y D+  
Sbjct  265  WVISSAAVAYRDVFG  279


>EYU18031.1 hypothetical protein MIMGU_mgv1a013845mg [Erythranthe guttata]
Length=209

 Score = 44.0 bits (100),  Expect = 0.058, Method: Composition-based stats.
 Identities = 22/151 (15%), Positives = 43/151 (28%), Gaps = 10/151 (7%)

Query  204  TLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA  263
             +  I  +  V   S+   +  L   + +     V   +   GL A+ KSR L+ G    
Sbjct  58   HMGYIGGMAFVLVLSIPYFMGLLYITMIWHLASVVSVLEESYGLNAMRKSRGLIKGKMGI  117

Query  264  IFGRFVLLLVISLTLSFLTARIPYVG----------EAANLAFSLLLTPFSFLYYYLIYS  313
                FV+L +    +  +       G              L    +   F  +   ++Y 
Sbjct  118  SAAVFVVLGLSFFGIQHMFKIHVVFGHEGIVKRIVYGIICLVLLSISMLFQLIMETIVYF  177

Query  314  DLKANYRGPQHPPIKRQWLPLTAAIFGWMLI  344
              K+ +            + L         +
Sbjct  178  VCKSYHHENIDKSSSGSIVRLLNGSNLIGFV  208


>TNE54541.1 hypothetical protein EP338_06910 [Bacteroidetes bacterium]
Length=245

 Score = 44.4 bits (101),  Expect = 0.059, Method: Composition-based stats.
 Identities = 18/131 (14%), Positives = 38/131 (29%), Gaps = 2/131 (2%)

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
                     F+ I    +     + + L        +  L   + G  ++  +I  L+  
Sbjct  105  LTGAKFSGGFVRIITDLLRAVFILVVALFLNFMIMGIWWLFAWITGFHAIDSLIYWLINS  164

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG  289
             +  F  Y  + +  G  +A+  S     G    +     L  +I L         P++ 
Sbjct  165  FFLGFAFYDYSLERYG--EAIFSSFGFGFGRMLHVLLTGALFNLIYLIPYAGILLAPFLV  222

Query  290  EAANLAFSLLL  300
               +    L  
Sbjct  223  TMVSTVVYLKT  233


>MBA4077435.1 hypothetical protein [Cyanobacteria bacterium PR.023]
Length=252

 Score = 44.4 bits (101),  Expect = 0.059, Method: Composition-based stats.
 Identities = 31/151 (21%), Positives = 52/151 (34%), Gaps = 0/151 (0%)

Query  129  IVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVG  188
                 A       +  A           ++  +   A+I  G+  +    ++        
Sbjct  55   QWPFVAICMVGSNMSQAHLFADIGLRQSFSFGIVAAAFISAGILPIIKLFYVSYLAAPKY  114

Query  189  LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQ  248
              R   +GL  V  + L   L+  + G G +  I+PGLL  V         A +      
Sbjct  115  PGRVAMIGLWTVLLYVLGEWLISALGGLGMMFFILPGLLVMVRACLFLPAYALEGHHPFA  174

Query  249  ALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            A E+S  L SG +W +     L  ++ L LS
Sbjct  175  AFERSWALTSGKYWLVSRYLGLPTILFLLLS  205


>CDO74879.1 hypothetical protein BN946_scf185004.g29 [Trametes cinnabarina]
Length=846

 Score = 45.5 bits (104),  Expect = 0.059, Method: Composition-based stats.
 Identities = 25/241 (10%), Positives = 55/241 (23%), Gaps = 8/241 (3%)

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             M   +    +  F  + L L           L         L ++         +    
Sbjct  436  LMLNTVALVSIYFFDWLLLPLAQEQQKWFHRNLGWFYQVLWLLPVVGISFYLNSSWCTLI  495

Query  237  YVLADDNIGGLQALEKSRLLVSG--HWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
                     G +A+ +     SG  +  A      +++  S+ LSF  A IPY+G  A  
Sbjct  496  AKRTFTLQHGSRAVAQPPSTYSGMLNALATSAYRAVMVFTSVVLSFALAYIPYIGPVAAF  555

Query  295  AFSLLLTPFSFLYY------YLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLL  348
             F   +  +    +      Y +   ++       +           +++    L     
Sbjct  556  VFLCWVDAYYCFEFIWIARGYSLARRVRHLEERWAYYFAFAALCMWGSSLANVALFALFF  615

Query  349  LVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRK  408
               +     +    +                    +      P           +     
Sbjct  616  PAYIIMAMYARPLPIDPYNPALSVGSFAGTSHDSDDAVRYPSPLVPIRVPVFAPVIFLND  675

Query  409  T  409
             
Sbjct  676  W  676


>TLM67218.1 hypothetical protein FDZ69_04835 [Deltaproteobacteria bacterium]
Length=213

 Score = 44.0 bits (100),  Expect = 0.059, Method: Composition-based stats.
 Identities = 12/33 (36%), Positives = 18/33 (55%), Gaps = 0/33 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           ++CP CG       S +PA+  + +CPEC Q  
Sbjct  3   IKCPKCGKSGRIADSSIPAEGRNLKCPECAQIF  35


>KRP05744.1 hypothetical protein ABS25_02925 [Cryomorphaceae bacterium BACL18 
MAG-120507-bin74]
Length=309

 Score = 44.8 bits (102),  Expect = 0.060, Method: Composition-based stats.
 Identities = 30/180 (17%), Positives = 56/180 (31%), Gaps = 1/180 (1%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L  + +L + L         L          +   + A L   +  + L LS + G    
Sbjct  122  LFVLPMLYVGLGTLAEAEPGLRFTQALRLALSAWPRTAALGGMMFILFLFLSTLLGISLS  181

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +   ++ +            S  LL  +       G   L++  L      +   + L 
Sbjct  182  ALYGDELMVLAQTHESDPAAFSQALLQEINWSASAFGLAALVLLTLGLIALVWMAPFALV  241

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
              N+  LQAL  S  +   H+ A+F   +L ++ S     L   +  +       F    
Sbjct  242  YHNMNVLQALRWSVQMTRPHFAALFPALLLWVLFSGLSGALVG-LNIILWPFTALFHFYC  300


>TVW80792.1 glycerophosphodiester phosphodiesterase, partial [Streptococcus 
pneumoniae]
Length=309

 Score = 44.8 bits (102),  Expect = 0.060, Method: Composition-based stats.
 Identities = 34/288 (12%), Positives = 74/288 (26%), Gaps = 16/288 (6%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             +     +         LL      L   ++       L      +  ++++   + +  
Sbjct  20   LLLAYYQIGLLFIGARHLLYHEKRTLLEYSRKVFRQSFLFMKQVTISKMAFIFFYVMMLF  79

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                  L       +         +     +VG    +L +      V   F    L  +
Sbjct  80   PLIRKILKIYYLNKIVIPEFIVAYVEDKYWLVGLVITVLALLLFYISVRLMFALPQLLFE  139

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG------------E  290
                 +A+E S        W      + ++V +    F+   IP +              
Sbjct  140  KKTVKEAVEYSLEKTKRQSWFYIWNLLWIIVKTYLF-FILLLIPILASQVLMDGLTHKES  198

Query  291  AANLAFSLLLT---PFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGL  347
                  + +L     +  L Y+LI           +  P +++   +   + G   I   
Sbjct  199  LVLGIINFVLIKNFHYMALTYFLIKFVSFLTGEELEITPRRKKDHWMRWGVMGCAGIFFA  258

Query  348  LLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLS  395
            L   +  +   A + L                   L R+   EP  + 
Sbjct  259  LEGYVYLEAPVAHRPLIISHRGVSDKNGVQNTVQSLERTAQLEPDFVE  306


>NXV60213.1 KRA43 protein [Molothrus ater]
Length=205

 Score = 44.0 bits (100),  Expect = 0.060, Method: Composition-based stats.
 Identities = 8/139 (6%), Positives = 15/139 (11%), Gaps = 2/139 (1%)

Query  24   KSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNR  83
                    C            +                        +         +C R
Sbjct  47   PQCCWPQCCWPYCCRPQCCRPQCCWPQCCWPQCCRPQCCWPQCCWPQCCWPQCCRPQCCR  106

Query  84   SFCLQPEREFRASGSGLRSI--SQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALL  141
              C +P+                      W   C      L             +     
Sbjct  107  PQCCRPQCCRPQCCRPQWCWPQCCRPQCCWPQCCWPYCCWLQCCWPYCCFCGPALPQCCR  166

Query  142  LKPATWLNPQNQNWQWAIL  160
                     +         
Sbjct  167  PYCCWPQCCRPYCCWPYCC  185


>WP_089966368.1 hypothetical protein [Lihuaxuella thermophila]SEM99807.1 hypothetical 
protein SAMN05444955_104177 [Lihuaxuella thermophila]
Length=359

 Score = 44.8 bits (102),  Expect = 0.060, Method: Composition-based stats.
 Identities = 28/284 (10%), Positives = 67/284 (24%), Gaps = 28/284 (10%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            L I +L ++     +         +              + T        +      F+ 
Sbjct  79   LSILILFVLFFILILLPTAFQTAGSIAVAVEAAAFNRSDVGTFFSKGFKYTGKMFLFFLL  138

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                 + +F    +      +     I+L +++   +++L +   L          +L  
Sbjct  139  SSLLYLAVFVVGGIAFPFFATDNGGGIILGVLLTLAAIVLAVLLGLAL---MHAPVILIA  195

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI---SLTLSFLTARIP------------  286
            ++    QA+ KS  L    +  +FG      +I      + F+   I             
Sbjct  196  EHTKVTQAISKSFYLFKKAFGRVFGSAFYAFLITAGLFVIYFIVLLIFGSVFAGFSAAGT  255

Query  287  ----------YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTA  336
                       +G    +A S ++ P       L+       Y      P          
Sbjct  256  GGEEAAAIFSLIGNLVGMALSWVVGPVFTTITSLLIIHRYFKYLRHWINPHVPGGESAPG  315

Query  337  AIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQT  380
                         +    +         + +          +  
Sbjct  316  GNHPNTDFKPSFSIKTDAEPSQHSNRSDSDETFNWDTDGPEKNQ  359


>HCC69002.1 hypothetical protein [Nitrospiraceae bacterium]
Length=186

 Score = 43.6 bits (99),  Expect = 0.061, Method: Composition-based stats.
 Identities = 20/161 (12%), Positives = 51/161 (32%), Gaps = 1/161 (1%)

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV  214
            +               +    G + ++      G + S  +         + ++      
Sbjct  2    FFQEGKKLFFPIAGFAIVASAGLVVVFFILGVFGGYGSSIISAYGEKETFIAVLTGTFFA  61

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
                +  ++  +    + F+    L  + IG L+A +K   L+       F  + +L++ 
Sbjct  62   LLLIVCSLVIAIGALAFVFYSVIALVVERIGPLKAFKKGFALIKE-EPKAFIFYAILILG  120

Query  275  SLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
             ++ +FL   + Y      +   ++  PF    Y L     
Sbjct  121  YMSANFLLVLLVYPLSLIPVIGPIISFPFHLASYVLQSYLW  161


>MBC7793367.1 zinc-ribbon domain-containing protein [Clostridia bacterium]
Length=161

 Score = 43.2 bits (98),  Expect = 0.061, Method: Composition-based stats.
 Identities = 14/63 (22%), Positives = 19/63 (30%), Gaps = 0/63 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            V CPHCGA      S +P     A+C  C       P+ +         +         
Sbjct  4   QVTCPHCGARYQFERSLIPPGGYDAQCANCSGIFPVAPSPAPEEMAIALPSMSMVDTWLT  63

Query  63  RIP  65
             P
Sbjct  64  APP  66


>NIR99057.1 hypothetical protein [Gammaproteobacteria bacterium]
Length=83

 Score = 41.3 bits (93),  Expect = 0.061, Method: Composition-based stats.
 Identities = 8/37 (22%), Positives = 12/37 (32%), Gaps = 0/37 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
             V CP C A     +  +   K   +C +C      
Sbjct  13  VLVVCPSCDARYRIDADTVARPKVKFKCTQCGHLFPM  49


>HIF96739.1 hypothetical protein [Myxococcales bacterium]
Length=306

 Score = 44.8 bits (102),  Expect = 0.061, Method: Composition-based stats.
 Identities = 10/33 (30%), Positives = 15/33 (45%), Gaps = 1/33 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECC  33
           M  V C +C A      S++P + +  RC  C 
Sbjct  1   MI-VTCLNCDARFQLDESRVPEQGTHVRCSACK  32


>XP_029967131.1 basic proline-rich protein-like [Salarias fasciatus]
Length=872

 Score = 45.5 bits (104),  Expect = 0.061, Method: Composition-based stats.
 Identities = 10/211 (5%), Positives = 31/211 (15%), Gaps = 4/211 (2%)

Query  302  PFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQ  361
            P             +   +      + R                       +     + +
Sbjct  488  PEVCPSSVQFMLWGEPEEQKVWAVLVGRSPRSPQEPPGPPRTPQEPQKPPGAPGAPKSPR  547

Query  362  LLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVT  421
                 +D                        +          + +   +S+         
Sbjct  548  SSQGPQDPPGAPAAPRSPRSSQEPPGAPRSPQEPQDPPGAPAAPRSPRSSQEPPGAPRSP  607

Query  422  LFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHP  481
                      ++     +   +        +   A   +    +        ++      
Sbjct  608  QEPQEPPGASRSLQEPQEPPGASRSPQEPQEPPGAPRSLQDPQEPPGAPRSPQEPPGAPG  667

Query  482  AFHWVGINQTDENDLFSGIRSIYLRQGTQAE  512
            A       Q          RS     G    
Sbjct  668  APRSPPEPQEPPG----ASRSPQEPPGASRS  694


>KUK51213.1 75k gamma secalin [candidate division TA06 bacterium 32_111]
Length=393

 Score = 45.1 bits (103),  Expect = 0.061, Method: Composition-based stats.
 Identities = 10/47 (21%), Positives = 13/47 (28%), Gaps = 1/47 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQ  47
           M  V CP C  +       +P   +  RCP C               
Sbjct  1   MI-VECPKCHKKYEIEDMYIPIGGAPIRCPNCKNIFGIYVEPIDIPM  46


>MBO87689.1 hypothetical protein [Deltaproteobacteria bacterium]HCH63681.1 
hypothetical protein [Deltaproteobacteria bacterium]
Length=521

 Score = 45.1 bits (103),  Expect = 0.061, Method: Composition-based stats.
 Identities = 8/36 (22%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V C +C A      +++  + +   CP C    
Sbjct  1   MV-VTCENCSARYKLDDNRISGRGAKITCPRCRHVF  35


>KAA0205950.1 hypothetical protein EDM68_03715 [Candidatus Uhrbacteria bacterium]
Length=302

 Score = 44.8 bits (102),  Expect = 0.062, Method: Composition-based stats.
 Identities = 32/213 (15%), Positives = 66/213 (31%), Gaps = 31/213 (15%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            W+   +         LL ++  +  + +  L                   +A +  ++  
Sbjct  22   WKPTLKYTVWFFLAPLLFVIFTWISLLATGLG--GAQPGMGLFGLFILGYIALILGLVWA  79

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP------  224
               +   +  +    D+G  +  K  L ++ S   + IL  L      ++  +P      
Sbjct  80   SIALMQHVIGHAQGHDMGKRKPEKPALSYLPSLLWVTILATLPAVAAWIIAYLPLFFARD  139

Query  225  ------------------GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFG  266
                               +   V F     ++  D   G+ A+++S  LV G WW    
Sbjct  140  SNAAGILLAVLFLAALVFTIWIGVSFSQSYLLVLTDEARGVAAIKRSYALVRGRWWKTLW  199

Query  267  RFVLL-----LVISLTLSFLTARIPYVGEAANL  294
            R ++      L++   LS     I  +G A   
Sbjct  200  RILVPNLAFQLIVWTILSLFYGAIFMIGFALFG  232


>QNR22563.1 hypothetical protein H4K34_09190 [Cryomorphaceae bacterium A20-9]
Length=310

 Score = 44.8 bits (102),  Expect = 0.062, Method: Composition-based stats.
 Identities = 25/171 (15%), Positives = 52/171 (30%), Gaps = 13/171 (8%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM-----  174
              + I+ L          +   +   ++       +          +  LG+        
Sbjct  46   YFISIFFLQEEAQRILTGNRTFVSFDSFGIFGQVIFALFFYAIFTLFSQLGIYAWVRKAY  105

Query  175  -------TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLL  227
                      ++  + +  + L     + L        ++ LL  +      L  IP   
Sbjct  106  EEDRAPRLIEVWRLVRRGILPLIGIAFILLIAGIITIAIISLLGFMQPLLIFLFFIPFFY  165

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
              V        +  +    + +L +S  LV  +WWA FG F L+ +I+  L
Sbjct  166  LLVRISLFPVAIVMEGRS-IDSLARSWRLVDQNWWATFGFFFLIGLITSIL  215


>TMB08116.1 tetratricopeptide repeat protein [Deltaproteobacteria bacterium]
Length=1050

 Score = 45.5 bits (104),  Expect = 0.062, Method: Composition-based stats.
 Identities = 9/41 (22%), Positives = 17/41 (41%), Gaps = 0/41 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAES  43
            ++CP+C A       ++P    S +CP+C         + 
Sbjct  2   RIQCPNCPAAYELDDGRVPPAGLSIKCPKCKTPFTVHRPKP  42


>NOG92060.1 hypothetical protein [Armatimonadetes bacterium]
Length=345

 Score = 44.8 bits (102),  Expect = 0.062, Method: Composition-based stats.
 Identities = 25/197 (13%), Positives = 57/197 (29%), Gaps = 30/197 (15%)

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGL--RHVGSFTLLLILLILVVGGGSLLLII  223
             +    + M            VG+   +   +     G   ++  +  L+   G  + I+
Sbjct  139  LLPKLTALMLVFFMYSFAGILVGMVFLLASAIITDQYGGAQVVAPIASLLGILGVSMGIV  198

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLV--SGHW--WAIFGRFVLLLVISLTLS  279
                    +      + ++N+G   AL ++R LV   G      I+G   ++  +   L 
Sbjct  199  ILPFAFARYALAPVAMVNENLGVKDALRRARELVTSKGAPSSMTIWGLLFVIFSLQYLLW  258

Query  280  FLTARIP------------------------YVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
               +                           ++  +      LL+ P  F     +Y D 
Sbjct  259  GGLSLPALGVLDFLQLQGVASSSLSLKALYYFLSVSGGYLAFLLVHPVLFSGLTYVYFDR  318

Query  316  KANYRGPQHPPIKRQWL  332
            +    G     + ++  
Sbjct  319  RVRIEGYDIELLAQEIW  335


>NHZ47757.1 hypothetical protein [Desulfovibrio sp. XJ01]
Length=229

 Score = 44.0 bits (100),  Expect = 0.062, Method: Composition-based stats.
 Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  +RCP C  ER+  SSK+P+  + A CP+C     F 
Sbjct  1   MI-IRCPECQFERSIDSSKIPSSAAIATCPKCRHRFRFR  38


>HBP93529.1 hypothetical protein [Alcanivorax sp.]
Length=75

 Score = 40.9 bits (92),  Expect = 0.062, Method: Composition-based stats.
 Identities = 13/66 (20%), Positives = 18/66 (27%), Gaps = 0/66 (0%)

Query  5   RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRI  64
           RCPHCGA+       L   + + RC  C Q           T    +             
Sbjct  8   RCPHCGAQFKISDEHLGQARGAVRCGSCLQIFQATDHFVGETPARPHADDNDTTPTGDNA  67

Query  65  PSDRLE  70
              +  
Sbjct  68  GQWQYA  73


>WP_084235725.1 DUF975 family protein [Papillibacter cinnamivorans]SMC92047.1 
Protein of unknown function [Papillibacter cinnamivorans DSM 
12816]
Length=299

 Score = 44.8 bits (102),  Expect = 0.063, Method: Composition-based stats.
 Identities = 23/203 (11%), Positives = 56/203 (28%), Gaps = 14/203 (7%)

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
                + L  + +  +        S +    A+ L P  ++          +   +GL ++
Sbjct  95   QMILFFLSQVLIFAVTAPVILAGSEIYYALASGLKPGLKDIFKWYGDLRYSGRAMGLRFL  154

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
             G +   +     G+  +    +             IL+     L+L+   L   + + F
Sbjct  155  LGLIQWGLLAAFAGIPITFLWWVSLSDGAAPGSAAGILLRALLVLMLLGMVLATMISWSF  214

Query  235  CQYVLADDN---IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
                        +   +A+ ++   + G     F   +   +  L               
Sbjct  215  LPAHYILTTDPAVSVREAISRTNRFMRGRMVRFFLFRLSFALWYLLF-----------LV  263

Query  292  ANLAFSLLLTPFSFLYYYLIYSD  314
             + A    + P+  L       D
Sbjct  264  TSGAAIFYVYPYLELASAGFVRD  286


>MBJ26091.1 hypothetical protein [Alphaproteobacteria bacterium]MBR72952.1 
hypothetical protein [Rhodospirillaceae bacterium]
Length=221

 Score = 44.0 bits (100),  Expect = 0.063, Method: Composition-based stats.
 Identities = 8/40 (20%), Positives = 14/40 (35%), Gaps = 1/40 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
           M  V C +C         ++  +  + RC +C       P
Sbjct  1   MI-VNCLNCNTSYFVDPLEIGNQGRTVRCMKCSNIWHQQP  39


>CDQ74224.1 unnamed protein product [Oncorhynchus mykiss]
Length=3677

 Score = 45.5 bits (104),  Expect = 0.064, Method: Composition-based stats.
 Identities = 28/294 (10%), Positives = 56/294 (19%), Gaps = 8/294 (3%)

Query  320   RGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQ  379
             +           + +        ++   +  +   Q   +           Q   +    
Sbjct  928   QQDSPMQHSLLTVQVPLPQHSQPMVKPTVQQAPVVQRYQSPAHTVQQSSTLQSYPSSVPL  987

Query  380   TPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLK  439
                 N +    P            S           +       A          +    
Sbjct  988   MGQQNSASQSYPHTAPHIQQAASQSYPPSDPHTQQAASQSYPPSAPHTQQAASQSYPP--  1045

Query  440   LELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSG  499
                +  P   +    S +  +       A+          +P    V             
Sbjct  1046  ---TATPAAMVQGYTSLQPSVMGQAYPSAQAPQQAADPQTYPVAPIVQQQTGTTAQHSQ-  1101

Query  500   IRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLIL-QRLGSN  558
             I+   L  G Q +Q   +   +  + P    ++  +    G           L Q L  N
Sbjct  1102  IQVAPLPPGQQIKQQTQLPDPVHPSQPTH-PAVFSSPIHSGPDRGTVTAAAQLDQNLSQN  1160

Query  559   AVTLRFLGDRTDLLNVHASNSHAEPLREIGFTWQKSGDAFSLRQMFDGNIESIT  612
                 + LG         AS          G                   + S  
Sbjct  1161  LSQRQALGQNQSQPQAPASQPPGSQTAGPGPASTSLTQQNQASASTGTTVTSAQ  1214


>WP_072697496.1 zinc-ribbon domain-containing protein [Desulfovibrio litoralis]SHN68061.1 
MJ0042 family finger-like domain-containing protein 
[Desulfovibrio litoralis DSM 11393]
Length=492

 Score = 45.1 bits (103),  Expect = 0.064, Method: Composition-based stats.
 Identities = 11/47 (23%), Positives = 19/47 (40%), Gaps = 2/47 (4%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQ  47
           M  V CP+C  +   P  ++    + ARC  C +    +P  +    
Sbjct  1   MI-VACPNCSTKYKLPDEQVRP-GAKARCSVCSEVFSIEPDYNTPPP  45


>MBI5895757.1 zinc-ribbon domain-containing protein [Desulfobacterales bacterium]
Length=72

 Score = 40.9 bits (92),  Expect = 0.064, Method: Composition-based stats.
 Identities = 9/36 (25%), Positives = 12/36 (33%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  + C  C          L +  S  RC +C  T 
Sbjct  1   MF-ITCQECNTTFRLDERLLKSTGSKVRCSQCRYTD  35


>RLA88716.1 hypothetical protein DRG20_05730 [Deltaproteobacteria bacterium]
Length=109

 Score = 42.1 bits (95),  Expect = 0.065, Method: Composition-based stats.
 Identities = 12/45 (27%), Positives = 15/45 (33%), Gaps = 1/45 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           M  V CP C  +      KL    + ARC  C    +     S  
Sbjct  1   MV-VECPKCKTKFMLDEKKLKHFYTKARCSICGHIFVIQRIPSTF  44


>WP_149285000.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Halomonas sp. Y2R2]QEM81990.1 glycerophosphodiester 
phosphodiesterase [Halomonas sp. Y2R2]
Length=624

 Score = 45.1 bits (103),  Expect = 0.065, Method: Composition-based stats.
 Identities = 23/187 (12%), Positives = 52/187 (28%), Gaps = 4/187 (2%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            W          L       ++  A          A     Q  +   A+L   +  +   
Sbjct  71   WIYASVVAVLTLLYLQQAGMMLMAFDRRGSHAHQAMVAFWQTLHRLPALLSLAMIQVACH  130

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
            L+ +   +          L       +R V      L +   +   G+   +       +
Sbjct  131  LAMIVVVLLCLDWLYQGMLGHLDAYYVRRVRPMEFWLFIASCLPVIGTWAALAGR--QLL  188

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
             ++     L  + +    AL++S  L   +   +    + +L I L L +  +     G 
Sbjct  189  HWWLALPCLILEKLSAPAALKRSHALTHDNLSGMAAAVISMLAIILGLPWAISF--AFGS  246

Query  291  AANLAFS  297
              +   +
Sbjct  247  MLSPLLA  253


>HGZ68260.1 hypothetical protein [Deltaproteobacteria bacterium]HHS47610.1 
hypothetical protein [Deltaproteobacteria bacterium]
Length=106

 Score = 42.1 bits (95),  Expect = 0.065, Method: Composition-based stats.
 Identities = 12/42 (29%), Positives = 15/42 (36%), Gaps = 1/42 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAE  42
           M   +C  CG +     S L    +  RC  C  T    P E
Sbjct  1   MIL-QCQACGTKYRLEDSLLKPSGTKVRCSRCGFTWRVYPQE  41


>EKK01429.1 hypothetical protein RBSH_03254 [Rhodopirellula baltica SH28]
Length=385

 Score = 44.8 bits (102),  Expect = 0.065, Method: Composition-based stats.
 Identities = 25/194 (13%), Positives = 49/194 (25%), Gaps = 8/194 (4%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M    CP C  +   P   + A   + +C  C   +    A+ + T       T      
Sbjct  1    MIQFTCPTCDKQLRAP---VAAAGKTGKCNGCSTPVKVPRAKEKATTPHPAAVTKTQPKP  57

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
                P      Q  T + +              +   + + S          +     + 
Sbjct  58   ASPKPLHESPGQQTTSSNKPPAPKRPAGKGNPQQEILALIASELDGKVPRSPMNIPYQFT  117

Query  121  LLGIYLLG-----IVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
            +L + ++      +      +    +    T + P           A  A +L     + 
Sbjct  118  MLLVAMVMLLMPLLYCCLIGVACYGMYWYFTEILPVAMENLPRGRAAIFAILLYSAPVVA  177

Query  176  GSMFIYICKTDVGL  189
            G M I      V  
Sbjct  178  GCMMILFMVKPVFF  191


>MBA4391882.1 hypothetical protein [Syntrophus sp. (in: Bacteria)]
Length=44

 Score = 40.1 bits (90),  Expect = 0.066, Method: Composition-based stats.
 Identities = 8/37 (22%), Positives = 14/37 (38%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  V C +CG +     + +  +    RC +C     
Sbjct  1   MI-VHCKNCGTKFRFDETLIEGEGIWVRCNQCKNLFF  36


>WP_144067417.1 zinc-ribbon domain-containing protein [Ferrovibrio terrae]QDO96436.1 
hypothetical protein FNB15_03735 [Ferrovibrio terrae]
Length=322

 Score = 44.8 bits (102),  Expect = 0.066, Method: Composition-based stats.
 Identities = 7/39 (18%), Positives = 9/39 (23%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M    CP C        +         RC  C      +
Sbjct  1   MIL-NCPACATRFQVDPNAFGDAPRKVRCSVCRNVWRQE  38


>WP_073709516.1 hypothetical protein [Actinomyces liubingyangii]OKL47284.1 hypothetical 
protein BSR29_06615 [Actinomyces liubingyangii]
Length=338

 Score = 44.8 bits (102),  Expect = 0.067, Method: Composition-based stats.
 Identities = 13/134 (10%), Positives = 42/134 (31%), Gaps = 18/134 (13%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSG----HWWAIFGRFV  269
            V       +I  +   +   F       +    ++A+++S  L  G    + +      V
Sbjct  205  VSLALFAAMIIIVYIELKLLFAPAAAVLEGSKPIEAIKRSWDLSRGLNLINLFTTIVTAV  264

Query  270  LLLVISLTLSFLTARI-PYVGEAAN-------------LAFSLLLTPFSFLYYYLIYSDL  315
            +  + +  ++ +   I   + +  +             L    + +P   + + L+Y + 
Sbjct  265  MATLAAFLVAVVLTLIQNLITQMISNPALASSLEILNELLLQAVFSPLVPITWCLLYVNA  324

Query  316  KANYRGPQHPPIKR  329
            +           ++
Sbjct  325  RFIKEDFWRQLPQK  338


>MBI2551191.1 hypothetical protein [Candidatus Uhrbacteria bacterium]
Length=258

 Score = 44.4 bits (101),  Expect = 0.067, Method: Composition-based stats.
 Identities = 31/203 (15%), Positives = 62/203 (31%), Gaps = 2/203 (1%)

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
            +      + G             L                +L   V         +  + 
Sbjct  15   YRYNRATIFGFAGWLIIPVLCSFLVLLFAPPAWQIPLGIMVLGMDVFAHAWSGIHIGSAG  74

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV--GGGSLLLIIPGLLFCVWFFFCQ  236
             + +              L    +     IL +L++   GG +L ++PG L+   F F  
Sbjct  75   ALLLRGVPPASVALFSAALVRNRTKAAGFILALLLIGTAGGGILFLVPGALYLSLFSFAI  134

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
            ++   +   G  A+E SR+L  G +W      +   V+ +   FL   +P      ++  
Sbjct  135  WITLLEGKPGFDAMEASRILFQGRFWYTLWLTIGSPVVLIGSYFLFLLLPATWIIQSVFP  194

Query  297  SLLLTPFSFLYYYLIYSDLKANY  319
               L   S  ++ L+  +     
Sbjct  195  FTELEANSPGFFTLVVVNNIGER  217


>WP_185884924.1 zinc-ribbon domain-containing protein [Croceicoccus marinus]QNE05874.1 
zinc-ribbon domain-containing protein [Croceicoccus 
marinus]
Length=154

 Score = 42.8 bits (97),  Expect = 0.068, Method: Composition-based stats.
 Identities = 12/42 (29%), Positives = 17/42 (40%), Gaps = 0/42 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQR  45
           V C HC ++ + PS  L  +    RC EC         +S  
Sbjct  3   VTCDHCDSDYSVPSQLLGGEGRPLRCGECGSRWWQQGEQSTM  44


>MBE0564325.1 zinc-ribbon domain-containing protein [Krumholzibacteria bacterium]
Length=76

 Score = 40.9 bits (92),  Expect = 0.068, Method: Composition-based stats.
 Identities = 11/36 (31%), Positives = 13/36 (36%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M T  C  C A       K+P +    RCP C    
Sbjct  1   MIT-TCTVCQARYQLEDDKVPRRVIRVRCPACSGVF  35


>MBI3099836.1 zinc-ribbon domain-containing protein [Planctomycetes bacterium]
Length=183

 Score = 43.6 bits (99),  Expect = 0.069, Method: Composition-based stats.
 Identities = 14/36 (39%), Positives = 17/36 (47%), Gaps = 3/36 (8%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           MPTV CP+C    + P S   A     RCP+C    
Sbjct  1   MPTVACPNCQMPLSVPES---AAGRKVRCPQCQTVF  33


>WP_189585541.1 hypothetical protein [Litorimonas cladophorae]GGX71005.1 hypothetical 
protein GCM10011309_21450 [Litorimonas cladophorae]
Length=284

 Score = 44.4 bits (101),  Expect = 0.069, Method: Composition-based stats.
 Identities = 26/184 (14%), Positives = 59/184 (32%), Gaps = 1/184 (1%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              +    ++                 L      W  A+ L  +   +     +  + +  
Sbjct  53   FTVASAILMTTQLSEIVTSSSPEEFILEGAYWGWTAAVSLFGLLLTVWFQLVVIQTSYAS  112

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            I  TDV       L LR      +  ++  +V   G++ L+I  +     +     ++  
Sbjct  113  ITGTDVPANTH-SLALRLTIPMFVTALIYTIVCYIGTIPLLIGFIFVWPGWALAGPMMVH  171

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
            +N G   +L  +     G+   I    +++ +I +T+  +   I  V    N+       
Sbjct  172  ENKGIFSSLGAAWTFAKGNKRYIILLLLVITLIGVTVYSVALGIGMVLTGVNVMGGDPTA  231

Query  302  PFSF  305
             F+ 
Sbjct  232  AFNM  235


>WP_008319499.1 hypothetical protein [Haloferax mucosum]ELZ95956.1 hypothetical 
protein C440_06687 [Haloferax mucosum ATCC BAA-1512]
Length=313

 Score = 44.4 bits (101),  Expect = 0.069, Method: Composition-based stats.
 Identities = 30/165 (18%), Positives = 52/165 (32%), Gaps = 7/165 (4%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              V   +    W+T +  +   +        + + +  +G   LL  L   V      LL
Sbjct  154  LLVVVAVGTAGWLTITRALDSERHSDSFTWYLGIFVGGIGISQLLTSLNFTVETLLFGLL  213

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            +   L   +   F          G   AL +S     GH W IF  F +  V +  L  +
Sbjct  214  LFVVLFLSLVRLFLFPGFLAAGYGPTTALRESVRQSRGHGWTIFWLFTMFAVSAWGLGHV  273

Query  282  TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPP  326
                 ++  A        + P   +   +I    +A    P+  P
Sbjct  274  AVAGGFLTTAL-------VAPVQAVSLAVIVQRCEARDEFPEKRP  311


>TMQ32887.1 hypothetical protein E6K70_16135, partial [Planctomycetes bacterium]
Length=223

 Score = 44.0 bits (100),  Expect = 0.069, Method: Composition-based stats.
 Identities = 8/28 (29%), Positives = 12/28 (43%), Gaps = 0/28 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARC  29
             + CP C A  N P  +L  +    +C
Sbjct  3   IQIVCPRCQATYNVPEDQLGKRARCKKC  30


>HGM99368.1 response regulator [Deltaproteobacteria bacterium]
Length=371

 Score = 44.8 bits (102),  Expect = 0.069, Method: Composition-based stats.
 Identities = 10/36 (28%), Positives = 16/36 (44%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M    CP C A      +K+  +  + +CP+C  T 
Sbjct  1   MIA-GCPTCKARYQIDDAKVGPQGLNLKCPKCQTTF  35


>MST98034.1 hypothetical protein [Victivallaceae bacterium BBE-744-WT-12]
Length=331

 Score = 44.8 bits (102),  Expect = 0.070, Method: Composition-based stats.
 Identities = 30/329 (9%), Positives = 76/329 (23%), Gaps = 28/329 (9%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
               RCPHC A+          + +  +C +    +     E  + Q T   +        
Sbjct  3    FHFRCPHCNAKLEAEDDWNGMEAACPKCSQTITIVPETTPEKPKIQLTPITSPTLTEAQH  62

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSW------ELFC  115
                           N        C        +  +           S       +   
Sbjct  63   PVSTQSHPPKNPSASNKFPFICPSCGTLTDLDSSLQNQEYECPACCEKSIAVPATEKPCP  122

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQ---------------WAIL  160
              G  +     +      +   +      +   N                      +   
Sbjct  123  HCGEMIKFQAKICRFCKGSVEPNKSFSPSSVQRNSSLPQLMPATSNTIVIKELDTLFFWW  182

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
              ++A  +        +  ++ C      +  ++     +     +  L I V G   + 
Sbjct  183  WLSLALAIPTFGIGGIASAVFFCMLLYKYWNLVQTEPSSMTPGKAVGFLFIPVFGLYWMF  242

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
            + I GL               + +  +  +        G    I    +++   ++ +  
Sbjct  243  VSIWGLGKAFNALTDLGKQKLETMPLIACIC-------GATAPIAWGLMIIGANTIFIIG  295

Query  281  LTARIPYVGEAANLAFSLLLTPFSFLYYY  309
            +   + Y+G         ++T  +F    
Sbjct  296  VFFTLVYLGAIIGGLVFYIITMINFQNAA  324


>MBI2117027.1 zinc-ribbon domain-containing protein [candidate division NC10 
bacterium]MBI2456721.1 zinc-ribbon domain-containing protein 
[candidate division NC10 bacterium]
Length=275

 Score = 44.4 bits (101),  Expect = 0.070, Method: Composition-based stats.
 Identities = 11/37 (30%), Positives = 16/37 (43%), Gaps = 0/37 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
            T+ CP CG         +P +    +CP+C  T  F
Sbjct  4   ITITCPACGRSGAVDERAVPDRPMRLKCPQCQGTFTF  40


>WP_060929981.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Granulicoccus phenolivorans]
Length=313

 Score = 44.4 bits (101),  Expect = 0.071, Method: Composition-based stats.
 Identities = 21/98 (21%), Positives = 39/98 (40%), Gaps = 0/98 (0%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            L++     +L+I    F   F F   VL  + +    A+++S LL  G  W+  G  + +
Sbjct  152  LLILAWWTVLLILNYYFGTRFAFVPTVLTLERLPLPAAIKQSWLLTRGKVWSTLGHQLAM  211

Query  272  LVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYY  309
              I  TL  +   +  +   + L  +   T F   +  
Sbjct  212  AAICGTLVGIGWALVALIWISTLTAAESTTGFGSTFLV  249


>WP_103017879.1 hypothetical protein [Salinibacter ruber]
Length=298

 Score = 44.4 bits (101),  Expect = 0.071, Method: Composition-based stats.
 Identities = 18/119 (15%), Positives = 35/119 (29%), Gaps = 14/119 (12%)

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
                   +    G LLL++PG +          ++ D   G  +A+ ++  L  GH W I
Sbjct  171  GAYAAASVATLAGLLLLVVPGAVVAAGLAPLMPLIVDTGQGPRRAIRRAWRLTEGHRWQI  230

Query  265  FGRFVLLLVISLT--------------LSFLTARIPYVGEAANLAFSLLLTPFSFLYYY  309
            F  ++L+ + +                L                      +  +    Y
Sbjct  231  FNLYLLVWIAATVAAGMAAVVAFGTESLGGPAGLAGVGAGLLFGTGVGAWSLLARCCLY  289


>TND08756.1 hypothetical protein FD123_1972 [Bacteroidetes bacterium]
Length=321

 Score = 44.4 bits (101),  Expect = 0.071, Method: Composition-based stats.
 Identities = 24/177 (14%), Positives = 59/177 (33%), Gaps = 0/177 (0%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            L +    +V+    + + +++        +    +            L  S +   +   
Sbjct  75   LNLLWSYVVIIITSVVANMMVVGVVSYYFRLYREKGPGNFTVGDLAKLVFSRLPALLGTT  134

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                 + L  ++ LGL   G  +L        V    +  ++       +F+    V A 
Sbjct  135  ALMLLLVLVVALLLGLIIGGMASLGAGAAFFFVLIFFIGFLLICFPMWFYFYSIYIVRAT  194

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
            + +G  +A+ + R  +SG++W+ +    +  +    + F  A    +      A S 
Sbjct  195  ERVGIFEAMSRVRWAMSGNYWSTWLVMFVFYLCLGLIGFSVAMPQQIVFWILSASSA  251


>WP_149108714.1 hypothetical protein [Limnoglobus roseus]QEL13773.1 hypothetical 
protein PX52LOC_00631 [Limnoglobus roseus]
Length=596

 Score = 45.1 bits (103),  Expect = 0.071, Method: Composition-based stats.
 Identities = 11/38 (29%), Positives = 13/38 (34%), Gaps = 3/38 (8%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
             V CP+C A    P  K       ARC +C       
Sbjct  3   IEVACPNCQARLKAPDEK---AGKKARCKKCQHAFRLP  37


>WP_052608971.1 DUF975 family protein [Candidatus Izimaplasma sp. HR1]AIO18003.1 
hypothetical protein KQ51_00099 [Candidatus Izimaplasma 
sp. HR1]
Length=275

 Score = 44.4 bits (101),  Expect = 0.072, Method: Composition-based stats.
 Identities = 22/199 (11%), Positives = 64/199 (32%), Gaps = 18/199 (9%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              +  + + LA     +      +   +  N  +   I+      I L ++ +    +I 
Sbjct  65   FMVAKMSLNLAHGAKNAGFDESLSPGESLLNVMYYVIIMQVISLLITLPINKIFFDAWIV  124

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
               T   L ++ +     +  F    +  +++V    + +    +          +++ D
Sbjct  125  SEDTFSNLIQNFQGSSPELLGFIWGYVGSMVLVVILLMFITYKIIYVA-------FIIID  177

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
              +G  +A  KS +   G+++ I G  +  +              ++           + 
Sbjct  178  QKVGVKEAFTKSFIYTKGNFFRIIGMNLFFIGWY-----------FLSVPTCGLLLFYVI  226

Query  302  PFSFLYYYLIYSDLKANYR  320
            P+  L    +Y +++    
Sbjct  227  PYETLSRTNLYLEIRIENG  245


>WP_012470329.1 zinc-ribbon domain-containing protein [Geobacter lovleyi]ACD95995.1 
MJ0042 family finger-like protein [Geobacter lovleyi 
SZ]
Length=294

 Score = 44.4 bits (101),  Expect = 0.072, Method: Composition-based stats.
 Identities = 23/284 (8%), Positives = 61/284 (21%), Gaps = 4/284 (1%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIA-TCPHCGLQ  61
             + CP+C A       ++P       CP C ++   +    + T                
Sbjct  2    RIECPNCKASGTINDLEIPDDGMMLACPRCKESFRVEKPRKKATSAFATNTCPSCGYSTF  61

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                 D        V      +      ++E       +R  + +               
Sbjct  62   CEEVFDECPHCGLDVKTVIERKRKEDVQKQELEMRNRNIRPDTVVALPQPSGIITPAAPA  121

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
             G      +  FA  F  +                               + ++      
Sbjct  122  AGEKPAISLAGFANGFDPVAAVGWGVAVWAAVFLLLGGWGVIDYLGTDLQAQLSEQSIEP  181

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            +    V         ++ +     L      +      +  +  ++         Y    
Sbjct  182  VSAWQVFWGYGFLPWIKLLYGLAALSAAFGFLQRAAWGMQGVQQVVMASLVLVPMYETGQ  241

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
              +  ++++         +        ++  +    L FL   +
Sbjct  242  YVVWVVKSIAPPWW---AYLVEGVSAMLVSALWMAPLYFLLLYL  282


>NCB30924.1 hypothetical protein [Clostridia bacterium]
Length=48

 Score = 40.1 bits (90),  Expect = 0.072, Method: Composition-based stats.
 Identities = 11/40 (28%), Positives = 12/40 (30%), Gaps = 0/40 (0%)

Query  5   RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQ  44
           RCPHC          L A +   RC  C Q          
Sbjct  7   RCPHCQTSFRVRDEHLSAARGMVRCGSCLQVFKAAEHFID  46


>TMI14652.1 DUF3426 domain-containing protein, partial [Betaproteobacteria 
bacterium]
Length=73

 Score = 40.9 bits (92),  Expect = 0.072, Method: Composition-based stats.
 Identities = 10/32 (31%), Positives = 15/32 (47%), Gaps = 0/32 (0%)

Query  5   RCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           RCP CG       ++L A+    RC +C +  
Sbjct  6   RCPVCGTAFRVQRAQLAARGGRVRCGKCGEVF  37


>HDM25517.1 hypothetical protein [Thermoplasmatales archaeon]
Length=196

 Score = 43.6 bits (99),  Expect = 0.073, Method: Composition-based stats.
 Identities = 28/184 (15%), Positives = 66/184 (36%), Gaps = 10/184 (5%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
              +        +      K +   +      ++        +     + ++  +F+++  
Sbjct  18   VFVSAWTESMLLDVEKTGKTSIEGSFLRVRSKFWKEFFACIFTGFVSALLSIFIFLFVLT  77

Query  185  TDVGLFRSMKLGLR--HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
              V     +   +     G +      LI +      +LIIP  +  + F+F    +   
Sbjct  78   IAVAFGMGIFQEISTGRPGIYGFHAPALIGLGLLIIFILIIPFTILSISFWFSGTAIMKH  137

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
            ++G   AL  S      ++W I   F+L++ +        + +PYVGE   L  + L  P
Sbjct  138  DVGLFSALRLSWRFTMRNFWRIVFLFLLIIGV--------SLVPYVGELLGLFLTPLWMP  189

Query  303  FSFL  306
            ++++
Sbjct  190  YAYM  193


>HBN26829.1 hypothetical protein [Desulfobacteraceae bacterium]
Length=481

 Score = 44.8 bits (102),  Expect = 0.073, Method: Composition-based stats.
 Identities = 11/41 (27%), Positives = 16/41 (39%), Gaps = 1/41 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
           M  + C  CG   N   + L A  +  RC  C +   + P 
Sbjct  3   MI-ITCNECGTSFNVDETLLDAAGNKFRCSVCKEIFFYYPP  42


>HDM09293.1 hypothetical protein [Desulfobacteraceae bacterium]
Length=36

 Score = 39.8 bits (89),  Expect = 0.074, Method: Composition-based stats.
 Identities = 14/36 (39%), Positives = 18/36 (50%), Gaps = 1/36 (3%)

Query  1   MP-TVRCPHCGAERNTPSSKLPAKKSSARCPECCQT  35
           M   +RCP+CG     P  K+P+    A CP C Q 
Sbjct  1   MIVEIRCPYCGYSGEVPKEKIPSNAKWAVCPRCRQR  36


>HIM68373.1 hypothetical protein [Verrucomicrobia bacterium]
Length=775

 Score = 45.1 bits (103),  Expect = 0.074, Method: Composition-based stats.
 Identities = 47/488 (10%), Positives = 118/488 (24%), Gaps = 13/488 (3%)

Query  85   FCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKP  144
            +          S + L   S ++   W     R              +         L  
Sbjct  223  YWSWLGGGLAVSAALLALASWIIPYRWRETKLRQQTPRLAKRPTGDASRPSAVQQAALLD  282

Query  145  ATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT  204
            A          +   +   +   L+         ++ +    +G    +   L  V SF 
Sbjct  283  ANPAGWLVWRMRTFRVSRRLLMFLMITVGALLLGYMALGGVVIGESWWIMGSLIWVASFW  342

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAI  264
            + L +    V           L   +              G   A+ +  +       ++
Sbjct  343  IRLEVARHAVTTIHEAKASGALEQILV---TPIDERHFRRGHFAAMVRFWMWPVIVLASL  399

Query  265  FGRFVLLLVIS-------LTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKA  317
                +LL ++S         + F    +  +   A     L    +S  ++ L  ++  A
Sbjct  400  PLAAILLSIVSAGWGSNEALIGFSIMGMMGILFVAVFFGDLFALYYSGCWFALRSNNYSA  459

Query  318  NYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQP  377
             +            +      +    +  +  +      L++ Q         +      
Sbjct  460  AFWKTFGFVYLLPTIGSVFLCWLGQFVWLVTDIIFIVWPLTSLQGNFRRVVSGEYGLPYV  519

Query  378  QQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLW  437
            +     +    E  +       ++   +           +            + +     
Sbjct  520  KPASMGHSESNETNEPEDLRRRQVEQQRDILPPPPKLSEVKVKLEEEKNKPTEFKQMQGE  579

Query  438  LKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLF  497
            L+ E  +  +       S    I+    D    + +  +S E PA   +      E D+ 
Sbjct  580  LERERFEIGDALDEDFVSTEQVIETPESDPGVGVVEAVYSLEAPAPVSLEDTGEQEGDVE  639

Query  498  SGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGS  557
              I           +   ++    E  +   + +    +       Q  G+ +    L  
Sbjct  640  EAIEQSSPEDEPTDDGSITVEFFCETCMGEGVVTFDTEQIFKD---QFEGQSVAWNGLLK  696

Query  558  NAVTLRFL  565
            +   L + 
Sbjct  697  SIDRLSYD  704


>MBI5695645.1 zinc-ribbon domain-containing protein [Nitrospirae bacterium]
Length=736

 Score = 45.1 bits (103),  Expect = 0.074, Method: Composition-based stats.
 Identities = 12/35 (34%), Positives = 16/35 (46%), Gaps = 0/35 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
             V CPHCG  R+   +++PA      CP C    
Sbjct  4   VRVTCPHCGYHRDVSETRVPATPVRVTCPACKNVF  38


>WP_082025357.1 zinc-ribbon domain-containing protein [Methyloceanibacter caenitepidi]
Length=371

 Score = 44.8 bits (102),  Expect = 0.075, Method: Composition-based stats.
 Identities = 8/34 (24%), Positives = 12/34 (35%), Gaps = 1/34 (3%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP CG       +  P +    RC +C    
Sbjct  4   LIICPACGTRYQIK-AAFPPEGRKVRCAKCSHVW  36


>MBI4774594.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=246

 Score = 44.0 bits (100),  Expect = 0.075, Method: Composition-based stats.
 Identities = 10/36 (28%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V CP C  + N    ++P   +  RC  C    
Sbjct  1   MI-VTCPECATKFNLDPKRVPGDTAKVRCSRCKFVF  35


>WP_197528571.1 hypothetical protein [Aeoliella mucimassa]QDU58229.1 hypothetical 
protein Pan181_44620 [Aeoliella mucimassa]
Length=298

 Score = 44.4 bits (101),  Expect = 0.075, Method: Composition-based stats.
 Identities = 47/311 (15%), Positives = 87/311 (28%), Gaps = 24/311 (8%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            V+CP CG + + P S   A     +C  C  T +    E   +     ++          
Sbjct  5    VKCP-CGKQYSVPES---AAGKKGKCNSCGLTFVIPSPELSASNVAPAVSGQSEPVTASA  60

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLG  123
               ++L     T N                 +    +   S  L+ +           L 
Sbjct  61   EQGNKLAPHDDTENYNPFAAPTTSSAPDIPSSGVGTIDIRSAGLSWAI---------YLV  111

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
             Y   I L    +    L+    + +     +    L   +   LL  +     +     
Sbjct  112  FYGAVITLVSPVLLVTSLVISLVFRSDAIWIFIVGALGLVLLGYLLNTAGRVLCLVRAPS  171

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
              + GL   +   +    S   L +L +  +    +  +   L    W  F  Y+ A   
Sbjct  172  LGNAGL---LVASIACDLSSIALTVLSVADITDHQITSLAGLLALASWLLFAFYLKAIAK  228

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
            + G  +L             I G  V L +++  L  L          A    S  +  +
Sbjct  229  LIGDASLVDEAR----RLTLILGVAVALPIVNSLLVLLFGS----AMIAFGLGSFAIMLY  280

Query  304  SFLYYYLIYSD  314
              + Y L+   
Sbjct  281  GLVMYMLLIRH  291


>HAC90890.1 hypothetical protein [Planctomycetaceae bacterium]
Length=550

 Score = 44.8 bits (102),  Expect = 0.077, Method: Composition-based stats.
 Identities = 27/299 (9%), Positives = 65/299 (22%), Gaps = 19/299 (6%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             V CP+CG            + +  +CP+C       P  +    +  +I          
Sbjct  195  RVACPNCGTLIYVNQ---GDEGTKTKCPDCFTFFKVPPPPANWKPSHVSIRHNNFDTSAP  251

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                D +    +    R        + E E               A   +         +
Sbjct  252  LTGDDAIREIDRRRQLRTQQMLELAESELEDERYNQRDFGQDFDTATFVQGTFGFFKDTM  311

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             I  +        +  A +       +         ++       +              
Sbjct  312  AIGFMLGYSLMFAVVFAGIHHGINDPSKIFLVLAAVLVGLLTVLPMFSTVMALL--ESAA  369

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP--------------GLLF  228
             +              +     ++   ++     G L+ +                 +L 
Sbjct  370  NRQLRVSQWPGFNVYEYAADIFVIAGAVMASALPGYLVGLWLGGELDGSGRIQINGAMLS  429

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
                F    +   DN    Q +    +         +  + L  +I+   + L   +  
Sbjct  430  TFLLFPVILISMLDNGSVFQPISWDVIRSFKLAAEAWAGYYLKTLIAFATTMLFWYLLL  488


>MBA2279251.1 hypothetical protein [Candidatus Saccharibacteria bacterium]
Length=317

 Score = 44.4 bits (101),  Expect = 0.077, Method: Composition-based stats.
 Identities = 29/184 (16%), Positives = 58/184 (32%), Gaps = 5/184 (3%)

Query  136  IFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL  195
            I    +     W   Q         + +  Y  +  +     ++  I    +     + +
Sbjct  132  ILVMFVSLAVIWALRQTYEQHQDRTIKSSFYKGMYPAVPYILVWFVIILQMIPALIGLSI  191

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
                  +   +     ++     LL +   +       F  Y++   ++  ++AL  +R 
Sbjct  192  YGLVQSNGLAVSGFEQILWLFVLLLGLSISIYLTSSSIFASYIVTLPDMTPMRALRSARK  251

Query  256  LVSGHWWA-----IFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYL  310
            LV    +      IF   VL L+  +    L    P   E   L FSL +      Y Y 
Sbjct  252  LVKFRRFTILRKVIFLPAVLSLLAGIIFFPLVLFAPVAAEVLFLLFSLAVIIIGHSYLYQ  311

Query  311  IYSD  314
            +Y +
Sbjct  312  LYRE  315


>MBE6381682.1 hypothetical protein [Lentisphaerae bacterium]
Length=297

 Score = 44.4 bits (101),  Expect = 0.077, Method: Composition-based stats.
 Identities = 22/237 (9%), Positives = 57/237 (24%), Gaps = 7/237 (3%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            + CPHCG E +    +         C  C +  +    +++      +  T     +   
Sbjct  3    INCPHCGTEYDIEQREF---GKYVTCQVCGKGFVAGARQTRAQMVRPDNETSEQVDVPPE  59

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLG  123
                +    +             +         G     I       W          + 
Sbjct  60   SACLK----AWAKYWFARLLIVFVVSFVLGLLGGFVGSLIGCAPFADWPKDFPERMAHIV  115

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            +Y   I   +     A        +         + + ++    +     ++  M + + 
Sbjct  116  VYAAWIGGIYMSWRLAWWGYKRFAVGNLLPELSASNMRSSWMIPIFVNIALSLLMPMGLQ  175

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
               +G++  +            L+   I V       +    +   + F     V+ 
Sbjct  176  IAALGVYGYIVWAFTIWIVVDYLMFRFISVNLLCGKEIDTQWICPAILFCIFIVVMH  232


>MBA04561.1 hypothetical protein [Gammaproteobacteria bacterium]HAO55256.1 
hypothetical protein [Gammaproteobacteria bacterium]
Length=245

 Score = 44.0 bits (100),  Expect = 0.077, Method: Composition-based stats.
 Identities = 25/208 (12%), Positives = 57/208 (27%), Gaps = 20/208 (10%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI---LLGLSWMTGSMF  179
             I  +            +       +               +      LL  +       
Sbjct  29   AIMYVISQRNLFLKALLVPGCVLIGIAVLTDTLATQQSPLLLVLFTANLLVQTIFAVISH  88

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG------------GGSLLLIIPGLL  227
                     +            S  LL +L    +              G  + I+  ++
Sbjct  89   RLTLIGSTAVGSWFSFDWSRRESLFLLRVLACAFLTGIPAAFALSIPNVGIPIAILLAII  148

Query  228  FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL-SFLTARIP  286
                        A ++     ++ ++  +   H   +FG  VL  +I + L ++L +++P
Sbjct  149  LITRLSLIFPATAMNHQT---SISQAWQMSRRHQIPLFGIVVLFPLIFVLLPAWLLSQLP  205

Query  287  YVGEAANLAFSLLLTPFSFLYYYLIYSD  314
            Y     +   S++ T F+     L Y +
Sbjct  206  Y-ATILSSLLSMVGTIFTLAAVSLAYRE  232


>PCJ64762.1 hypothetical protein COA61_18855 [Zetaproteobacteria bacterium]
Length=256

 Score = 44.0 bits (100),  Expect = 0.078, Method: Composition-based stats.
 Identities = 9/70 (13%), Positives = 22/70 (31%), Gaps = 0/70 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  ++C HC  +         A     +C +C + +     +S +       A+      
Sbjct  1   MEYIQCQHCHKKYRVNEQVRAAAGRMVKCKDCGEAIEIVIFQSLQDDVPMPEASQEKKHD  60

Query  61  QRRIPSDRLE  70
             ++    + 
Sbjct  61  GEQVEPSHIP  70


>OJV85663.1 hypothetical protein BGO43_02970 [Gammaproteobacteria bacterium 
39-13]
Length=263

 Score = 44.0 bits (100),  Expect = 0.078, Method: Composition-based stats.
 Identities = 31/190 (16%), Positives = 66/190 (35%), Gaps = 10/190 (5%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             I           IF     +   +    ++     I +     IL+    +   MF+ I
Sbjct  63   LITSFFFCCVIYAIFLKHYQQNLKYGELISKGLTRVIPMIFSGIILISPLILMMGMFMII  122

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                V    +  +  +         I+ +++    S+      L+  V+ +    ++   
Sbjct  123  SVIIVPDSNNASMTSK---------IIFLIMQLIISVATFFYVLIVFVYCYLSGVLIVCK  173

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
            N      +++S  LVS +WW +F R + LLVI +  S L      V + A+   ++    
Sbjct  174  NSTAWIGIKQSWRLVSDNWWYVFSRMLSLLVIIIIPSILL-YHFLVQQTADGIITVFTFS  232

Query  303  FSFLYYYLIY  312
                   +++
Sbjct  233  LGPCLMVILH  242


>KKU86576.1 hypothetical protein UY17_C0043G0002 [Candidatus Beckwithbacteria 
bacterium GW2011_GWC2_47_9]
Length=652

 Score = 45.1 bits (103),  Expect = 0.078, Method: Composition-based stats.
 Identities = 48/512 (9%), Positives = 122/512 (24%), Gaps = 38/512 (7%)

Query  132  AFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM--TGSMFIYICKTDVGL  189
                     L           +       +     + +G        +MF  +    + L
Sbjct  54   FIYYAILTPLSWLLGIAGTFAEWMLQPAYIVNSTVVQIGWGVTRDLANMFFILILLGIAL  113

Query  190  FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF------CQYVLADDN  243
               +        +  +L+++ +L+     +  I          FF          L  +N
Sbjct  114  DYILFQSFGVKHALPMLIVVALLINFSLPIAGIFIDFANVFSNFFISKITGDCLNLNVEN  173

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
             G   A+ ++  L   +        ++ ++ +  L    A   +      L  +  L   
Sbjct  174  CGFTMAIAQNLNLTKLYETGGAANLIVNMIFAALLMLGMAFTLFALGIMFLLRTGWLYAL  233

Query  304  SFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLL  363
              L   ++                 + +     A      +   +LV  +  N      +
Sbjct  234  LILLPLVLVLMPFPKTSQYFGRWTNKFFQWTMFAPVAMFFLYLSMLVFQANINPGETNNI  293

Query  364  SAGKDIQQRLGTQPQQTPDLNRSLPE-EPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTL  422
                 +      Q          + +     +        L   +  +  G  +   +  
Sbjct  294  KTMSYLVNGAQAQVWNQGVGGAFIEQVIKYVVVWFFMLGSLMAAQSMSITGAGAALGMIK  353

Query  423  FADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPA  482
             A+++            L  +     +  +       + K+                   
Sbjct  354  SAEKWAGGKVKNAGKRGLSAAGRGVGADKKMEDLARGLQKIPGIGGALSSS------VRG  407

Query  483  FHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQLTRNDIGKT  542
                     ++ +  +          +    +      ++  LP             G  
Sbjct  408  VASKTKTAMEKQEALTAKEKANFEGLSDEALLDEYNTYIKSNLP-------------GNK  454

Query  543  LQIGGKQLILQRLGSNAVTL-----RFLGDRTDLLNVHASN---SHAEPLREIGFTWQKS  594
             +  G  L+L +   NA+T+         ++T  L   A +   +H           +  
Sbjct  455  AKANGIALMLSQRKGNALTVKRADGSIDENKTAELREAAYDNAKAHGNRAVMETLMEKDP  514

Query  595  GDAFS-LRQMFDGNIESITVLVAGDSM-TQSY  624
                  + Q +DG      +        T+ Y
Sbjct  515  RVKMHAINQKYDGKPAGTKIDGKTKDEATRDY  546


>MBE9521216.1 zinc-ribbon domain-containing protein [Proteobacteria bacterium]
Length=30

 Score = 39.4 bits (88),  Expect = 0.079, Method: Composition-based stats.
 Identities = 11/27 (41%), Positives = 15/27 (56%), Gaps = 0/27 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARC  29
            V CPHC  + +    K+PA  + ARC
Sbjct  2   LVICPHCKKKHSIDEKKIPANVTKARC  28


>MBA4419182.1 hypothetical protein [Syntrophus sp. (in: Bacteria)]
Length=73

 Score = 40.5 bits (91),  Expect = 0.080, Method: Composition-based stats.
 Identities = 8/28 (29%), Positives = 14/28 (50%), Gaps = 0/28 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARC  29
             VRCP CG +   P+++   +    +C
Sbjct  5   IIVRCPQCGTKNRIPANRQGEQGICGKC  32


>MBF0186832.1 hypothetical protein [Magnetococcales bacterium]
Length=731

 Score = 45.1 bits (103),  Expect = 0.080, Method: Composition-based stats.
 Identities = 30/311 (10%), Positives = 79/311 (25%), Gaps = 24/311 (8%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            VRCP CG   +    ++P      +C  C         +                  + R
Sbjct  18   VRCPDCGMRYHVNFGRIPLLGKKVKCRRCLHMFPVLKQDDWYRDNLPVSMAAYVIYRKHR  77

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRA-----SGSGLRSISQLLADSWELFCRRG  118
            + +  L+     +   +   +   +   E        +   + +  + L  S   +   G
Sbjct  78   MRTHNLKYDMDRIQSFKTWLAEYHRQRPETGRGLRAVTVQDVHTYLEELDRSEGDYPHHG  137

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY----------IL  168
                     GI+     +   ++ +  + ++   ++         +             L
Sbjct  138  MVATLTEFFGILYNEGLVDVNVMEQLRSDMDDVLEDLGPFSHEVELYIKHRRTMNDVASL  197

Query  169  LGLSWMTGSMFIYICKTDVGLFR--------SMKLGLRHVGSFTLLLILLILVVGGGSLL  220
                     M +++ + +  L                R   +  L      +      L 
Sbjct  198  PFDLVRIKEMEVFLAERERSLITADDDDLEEFFVHISRRFSTEKLDGFRSTVEGLCDVLS  257

Query  221  LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF  280
             +    +  +           +  G   A +  R  V      +    V+ +V  + + F
Sbjct  258  SMEVVDVTHLCMADELEWQEIEEHGP-DAKDVKRSRVKREKQRLKVNNVVFVVGMVAICF  316

Query  281  LTARIPYVGEA  291
            +     +  + 
Sbjct  317  IIWAATWFLDI  327


>NYZ76501.1 hypothetical protein [Candidatus Micrarchaeota archaeon]
Length=273

 Score = 44.0 bits (100),  Expect = 0.080, Method: Composition-based stats.
 Identities = 22/186 (12%), Positives = 56/186 (30%), Gaps = 2/186 (1%)

Query  134  APIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSM  193
              I     +         +   +           ++ +  +   +               
Sbjct  85   ITIAVIKPMDEIIGKKTVSDWTKHFFPQLFNVIKVMVVRGLVSLIIFTPLVLVALGSIPA  144

Query  194  KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKS  253
             + L+   +  LLL   IL++    ++ +I   +      F +  +     G   A+ +S
Sbjct  145  LIALKGNINPALLLGGGILLILIVGVISVIVWSIVMFLLTFLEVEIVLGGRGIFGAMSRS  204

Query  254  RLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYS  313
              LV  + W +F   +L  +I + +  L   +  +     +  + +  PF      L+  
Sbjct  205  IRLVMSNLWDVFVFSILWFLIRMGVGVL--NLMLMCTICLIPLTFVTIPFIVEPVELMSK  262

Query  314  DLKANY  319
             +    
Sbjct  263  VVLWRK  268


>MBI3070741.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=494

 Score = 44.8 bits (102),  Expect = 0.080, Method: Composition-based stats.
 Identities = 12/36 (33%), Positives = 18/36 (50%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V CP C +  +    K+PA+ +  RCP+C    
Sbjct  1   MLAV-CPGCRSTYSVRDEKIPAQGAQIRCPKCQTAF  35


>MSR31338.1 hypothetical protein [Gemmataceae bacterium]
Length=280

 Score = 44.0 bits (100),  Expect = 0.081, Method: Composition-based stats.
 Identities = 22/229 (10%), Positives = 59/229 (26%), Gaps = 0/229 (0%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            + CP C A+     + +  +     C +  Q    +        ++ +  T P   ++  
Sbjct  5    IDCPSCAAKLRLADNLIGKQIQCPFCKKPLQIPSPEQPPEAALGSSASKETLPEEEMESG  64

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLG  123
            +       + K    R  +       E  +     G   I  ++A    +        L 
Sbjct  65   MEEPPKSKKRKKKRVRSNSSEPEPVNEYAWAWWLYGGVGIVVVVAVLARILMNPESSGLV  124

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
             +    ++   PI   +        +              +        ++   + I   
Sbjct  125  RFYATQLMIMLPISMVIFFAAVLLSSLVFGELDIGEFHVALVKAFFLCLFVNLVLLITFG  184

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
                       + +          IL+ +      +L+++   +    F
Sbjct  185  GMISFFAWLFGVMVFFRLDPWETRILVFINWALNWILIMVLAAVMMSKF  233


>MBF0285537.1 zinc-ribbon domain-containing protein [Magnetococcales bacterium]
Length=800

 Score = 45.1 bits (103),  Expect = 0.081, Method: Composition-based stats.
 Identities = 11/38 (29%), Positives = 14/38 (37%), Gaps = 0/38 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
            VRCP C         +L  +  S RC +C       P
Sbjct  2   LVRCPQCETRFAVSEQQLGPRGRSLRCSQCRTVFFQPP  39


>MBF0310654.1 zinc-ribbon domain-containing protein [Magnetococcales bacterium]
Length=686

 Score = 44.8 bits (102),  Expect = 0.082, Method: Composition-based stats.
 Identities = 9/38 (24%), Positives = 14/38 (37%), Gaps = 1/38 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
           M  V+C +CG+      + L  K    +C  C      
Sbjct  1   MI-VQCENCGSRFEVDQNLLGTKGRKLKCSRCKHIFFQ  37


>RQD73129.1 hypothetical protein D5S03_13350, partial [Desulfonatronospira 
sp. MSAO_Bac3]
Length=187

 Score = 43.2 bits (98),  Expect = 0.083, Method: Composition-based stats.
 Identities = 25/179 (14%), Positives = 43/179 (24%), Gaps = 0/179 (0%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             + CP C  +++ P  K+P     A CP+C +   F   E     T ++           
Sbjct  2    RITCPKCMFQQDVPDEKIPPHAKKATCPKCKEKFQFRDLEEPEEFTLEDEPAAEEEASSD  61

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                D    Q +     +       +          G     Q             W  L
Sbjct  62   SREHDFATYQDEKPAQEKSREEDQEESLWSKLEDLGGPTEEPQREQTGPGESSASPWENL  121

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              Y              L   P                   V+ I +  S++     ++
Sbjct  122  QHYGFFPGFFETVKRVMLAPGPFFRKMQPGGLGMPLAFFILVSVIQVLASFLWNMTGMF  180


>HBH06630.1 hypothetical protein [Flavobacteriales bacterium]
Length=295

 Score = 44.4 bits (101),  Expect = 0.083, Method: Composition-based stats.
 Identities = 17/115 (15%), Positives = 44/115 (38%), Gaps = 1/115 (1%)

Query  200  VGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSR-LLVS  258
            +    ++  +  L+     ++  +  +     +      +A D+ G ++A   S   +  
Sbjct  149  IIMAAVISGIAGLLHWSLGVIGFVALIGIAYRWLIVPAAIALDDKGVMEAFTLSMKKISF  208

Query  259  GHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYS  313
            G  +  FG  VL+++    L  + + IP +  + +    +L    + L   +I  
Sbjct  209  GKAYVYFGISVLVIIALFILIVILSVIPAILGSLSEIGIVLQYIINALSGGVITM  263


>MBF0476205.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=150

 Score = 42.4 bits (96),  Expect = 0.083, Method: Composition-based stats.
 Identities = 12/34 (35%), Positives = 16/34 (47%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP C   R+ P  K+P K   A CP+C    
Sbjct  2   LITCPQCNFSRDIPDQKVPYKPVRAICPKCHHRF  35


>WP_182054359.1 DUF975 family protein, partial [Escherichia coli]MBA5790322.1 
DUF975 family protein [Escherichia coli]
Length=105

 Score = 41.7 bits (94),  Expect = 0.083, Method: Composition-based stats.
 Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 1/77 (1%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
               SLLLI+PG++    +    ++L D+ NI  L A+ +SR +++GH   +FG  +  L+
Sbjct  1    FLWSLLLIVPGIIKTYSYSQTFFILRDNPNISELDAITESRHMMNGHKGRLFGLSLTFLL  60

Query  274  ISLTLSFLTARIPYVGE  290
              L    +      +  
Sbjct  61   WYLIPLAVAIAGTVIVA  77


>QKJ99168.1 hypothetical protein HND40_06140 [Ignavibacteriae bacterium]
Length=215

 Score = 43.6 bits (99),  Expect = 0.084, Method: Composition-based stats.
 Identities = 19/110 (17%), Positives = 36/110 (33%), Gaps = 0/110 (0%)

Query  189  LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQ  248
            LF         +            +      +L I  +   V       ++  + IG   
Sbjct  48   LFLIYLTVFLLIIVAVSGGTSGSGLGIFLFFILFILLVFIMVKMCLAYMIMLYERIGIWA  107

Query  249  ALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
            ++++S  L    WW  FG   +L +I   + F+     Y+     +  SL
Sbjct  108  SIQRSFYLTKNKWWFSFGLIFVLSLIQSLMGFIFQIPQYIIMVTTMFNSL  157


>MBC8067732.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=142

 Score = 42.4 bits (96),  Expect = 0.085, Method: Composition-based stats.
 Identities = 11/50 (22%), Positives = 15/50 (30%), Gaps = 1/50 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTD  50
           M  VRC  C  E      ++    +S RC  C      D    +      
Sbjct  1   MI-VRCASCNTEFALDDRQVGPDGASVRCSVCQSVFRIDGEALEDEPPWQ  49


>RZC46974.1 hypothetical protein C5167_039955 [Papaver somniferum]
Length=387

 Score = 44.4 bits (101),  Expect = 0.085, Method: Composition-based stats.
 Identities = 30/217 (14%), Positives = 64/217 (29%), Gaps = 16/217 (7%)

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
                  L  I+ L   L    +   ++    T  +   +             I+  L  +
Sbjct  148  HWVFELLYAIFTLFFTLLTTSMVVYIVACIYTSRDITFKRVIGVFPKVWGRLIVTFLWCL  207

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
               +        +GLF    L +   G     +++  + +    ++     +     +  
Sbjct  208  LVWV--IYTGVAIGLFLWFFLSVNDGGQENDKVLIFGICLFIPFMI---GSVYMENVWSV  262

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG-----  289
               +   ++  G +AL KS  L+ G  W     F+ L      + F  + +   G     
Sbjct  263  AMVISVLEDDYGRKALGKSMKLIKGKVWVSSSVFLTLHFALSGVVFAFSLLVVYGDILSL  322

Query  290  ------EAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
                    A     +LL  F+ +   + Y   K+   
Sbjct  323  ERKIYVGIACSVLLMLLIHFTLVIQTIFYFVCKSYND  359


>WP_181549701.1 zinc-ribbon domain-containing protein [Desulfosalsimonas propionicica]MBA2880027.1 
putative Zn finger-like uncharacterized 
protein [Desulfosalsimonas propionicica]
Length=1083

 Score = 45.1 bits (103),  Expect = 0.085, Method: Composition-based stats.
 Identities = 14/67 (21%), Positives = 20/67 (30%), Gaps = 1/67 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V C  CG      SS +    S+ RC  C    +  PA S+      +         
Sbjct  1   MI-VTCEACGTSYKVKSSMIRPSGSTVRCSRCQHVFVAYPAVSESQFAKPSEKQAETPPP  59

Query  61  QRRIPSD  67
           +      
Sbjct  60  EADFSHP  66


>NCC21459.1 hypothetical protein [Alphaproteobacteria bacterium]
Length=273

 Score = 44.0 bits (100),  Expect = 0.085, Method: Composition-based stats.
 Identities = 12/70 (17%), Positives = 16/70 (23%), Gaps = 1/70 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M    CP+C       +  L       RC  C      +P E      T   A+      
Sbjct  1   MIL-TCPNCSVRYLLDAQVLAPDGRLVRCSACQNVWHQEPDEDFEEFDTPKAASGDDDFE  59

Query  61  QRRIPSDRLE  70
                     
Sbjct  60  FVPAGVRPQP  69


>VEN74250.1 hypothetical protein EPICR_30185 [uncultured Desulfobacteraceae 
bacterium]
Length=312

 Score = 44.4 bits (101),  Expect = 0.086, Method: Composition-based stats.
 Identities = 11/72 (15%), Positives = 17/72 (24%), Gaps = 1/72 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  + C  C  +     S+L    S  RC  C       P  +   ++            
Sbjct  1   MI-IHCGKCDTKYRLDESRLEKNGSRVRCSSCGDVFTAYPPGAPDAESLFEETPDLTDFP  59

Query  61  QRRIPSDRLEIQ  72
                 D     
Sbjct  60  DPSTLYDDAPDP  71


>WP_089815367.1 hypothetical protein [Halomicrobium zhouii]SFR94349.1 hypothetical 
protein SAMN05216559_1505 [Halomicrobium zhouii]
Length=266

 Score = 44.0 bits (100),  Expect = 0.088, Method: Composition-based stats.
 Identities = 31/174 (18%), Positives = 62/174 (36%), Gaps = 11/174 (6%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            + ++ +   L          L     +           L      I+LG+        + 
Sbjct  51   VFLFAVPFYLGGIIGTLEEGLHGRATVGRFFSAGTSNYLSIFGGTIVLGIVTFVLYFVVG  110

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                 + +F      +   GS   +    + V+  G LL ++  LL   +  F    +  
Sbjct  111  FVGLILSIF------VLGFGSMADVTSAAVAVLLVGVLLSLLVVLLPWFFLQFFPAAVVV  164

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLA  295
            D++G + + ++S  LV  ++ ++ G  +L  +IS     L + IP V   A  A
Sbjct  165  DDLGLVDSFKRSGSLVKNNFLSVVGFDLLAFLIS-----LVSYIPTVYMLALSA  213


>HEE45818.1 hypothetical protein [Candidatus Dadabacteria bacterium]HEL85211.1 
hypothetical protein [Candidatus Dadabacteria bacterium]HEQ90104.1 
hypothetical protein [Candidatus Dadabacteria bacterium]
Length=376

 Score = 44.4 bits (101),  Expect = 0.088, Method: Composition-based stats.
 Identities = 7/37 (19%), Positives = 14/37 (38%), Gaps = 1/37 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M  ++C  C  +     S++    +  RC +C     
Sbjct  1   MI-IQCERCRRKFRIDDSRIQPPGNRVRCSKCGNVFF  36


>WP_153577741.1 DUF975 family protein, partial [Bacillus thuringiensis]MRB24758.1 
DUF975 family protein [Bacillus thuringiensis]
Length=93

 Score = 41.3 bits (93),  Expect = 0.089, Method: Composition-based stats.
 Identities = 17/88 (19%), Positives = 43/88 (49%), Gaps = 1/88 (1%)

Query  198  RHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLL  256
            +++     L ++  + +   SLLLI+PG++    +    Y+L ++ +    +AL +S+ +
Sbjct  2    KNLFKSIKLGLMQAIFLFLWSLLLIVPGIIKYFSYSMSYYILVENPDYTASEALRESKRI  61

Query  257  VSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            + G    +F  ++  +   L  +F+   
Sbjct  62   MKGQKLKLFVLWLSFIGWFLLAAFIGMF  89


>TXD38075.1 hypothetical protein FRC98_04040 [Bradymonadales bacterium TMQ4]
Length=320

 Score = 44.4 bits (101),  Expect = 0.089, Method: Composition-based stats.
 Identities = 13/69 (19%), Positives = 24/69 (35%), Gaps = 0/69 (0%)

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
                   +L  ++   +QAL ++  LV+G W  +F  F+L    SL +  +         
Sbjct  208  ALLPYVAILFLEHKSPMQALRRNIELVTGRWGLMFVCFLLTGFWSLMMLMIAMSATMALT  267

Query  291  AANLAFSLL  299
                     
Sbjct  268  VLTHVLLAP  276


>MTI60200.1 hypothetical protein [Firmicutes bacterium]
Length=190

 Score = 43.2 bits (98),  Expect = 0.089, Method: Composition-based stats.
 Identities = 17/160 (11%), Positives = 49/160 (31%), Gaps = 7/160 (4%)

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
              L     +  +       G      + L    +   +   L  +    S++  +  +++
Sbjct  30   FRLFLALFAANLIALGFAFGFLLLSFMVLLFAVTIFKINYTLPGIFTLSSIIFFLTSIIY  89

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGH--WWAIFGRFVLLLVISLTLSFLTARIP  286
             + F F  +++  +N    +A  KS  LV  +     +F   + L +I++ L  + +   
Sbjct  90   QLRFAFIPFLVVLENTDSFEAWNKSSRLVKVNSLEGKLFLTGMFLFLIAIMLIIVFSGFS  149

Query  287  YVGE-----AANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
            +           + +      +     ++           
Sbjct  150  FANIMNYKYLIRMLYEYWGVFYLIYNLFIYRFYNVNCRER  189


>RPH70928.1 hypothetical protein EHM78_09510 [Myxococcaceae bacterium]
Length=749

 Score = 44.8 bits (102),  Expect = 0.090, Method: Composition-based stats.
 Identities = 10/39 (26%), Positives = 13/39 (33%), Gaps = 0/39 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
             + C  C      P  K+ A+    RC  C  T    P
Sbjct  55  VIIPCRQCHTRFKVPDGKVKARGLKVRCSRCGHTFRIYP  93


>PWA83336.1 hypothetical protein CTI12_AA172870 [Artemisia annua]
Length=406

 Score = 44.4 bits (101),  Expect = 0.090, Method: Composition-based stats.
 Identities = 19/213 (9%), Positives = 61/213 (29%), Gaps = 10/213 (5%)

Query  70   EIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGI  129
             +    +   + +       E     +         +    W  +         +  +  
Sbjct  41   PLTLIFIAHSQISHHLFYNIETSRLFTYDDSDRYRNISVTDWIYYWLFKIIYFTLLTIFS  100

Query  130  VLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGL  189
            +L+ A I   +    A          +    +    ++     +M   ++  +      +
Sbjct  101  LLSTAAIVFTIASVYAGRDVAFRHVIKIVPRIWKKLFVTFVYIYMALFLYNVVYGVIAVI  160

Query  190  FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQA  249
             R++                  +++    +L +   L   V +     V   +++ G  A
Sbjct  161  LRAIF----------GYTTFSFVLLLVLLVLYLYGFLYLSVVWQLASVVTVLEDLNGFNA  210

Query  250  LEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            ++K ++L+ G          ++ V+ L +  + 
Sbjct  211  MKKGKILLYGKKKVGMPIAFVMYVLLLGIMIVL  243


>NPD87431.1 hypothetical protein [Asgard group archaeon]
Length=454

 Score = 44.4 bits (101),  Expect = 0.090, Method: Composition-based stats.
 Identities = 26/235 (11%), Positives = 73/235 (31%), Gaps = 12/235 (5%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            + I+ +G+ ++   I S + L  A  L+  +  ++  + +     ++LG   +   +   
Sbjct  18   IVIFWIGLAISLIGIVSTIALSVALQLDTDDWRFRAVLSVFICGVLILGFGLIVRRLRRK  77

Query  182  ICKTDV--GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW-FFFCQYV  238
              +  V   +   +   L +      ++ + I        L+++ G+L  ++ +F   Y 
Sbjct  78   GIEDFVLGAITAFLGFILLYFPVLMFIMNIAIFQREYPYFLIMLGGILVIIFGYFMEVYD  137

Query  239  LADD----NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV-----G  289
            L            +AL++    V+            L+ +++ +      +P +      
Sbjct  138  LNIKFLQMMKRLKEALKRMWERVNWKLVRSPWNIFTLIGLTVIILAAIGILPLLEKRYYY  197

Query  290  EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLI  344
                    + L          I+  +            K            +  +
Sbjct  198  IIGAGLILINLIFHFRKELAEIFKTIGRIIETITQAWWKVTKQIPRILKKFFKWL  252


>XP_013980498.1 PREDICTED: proline-rich protein 36-like [Salmo salar]
Length=962

 Score = 44.8 bits (102),  Expect = 0.090, Method: Composition-based stats.
 Identities = 24/222 (11%), Positives = 34/222 (15%), Gaps = 0/222 (0%)

Query  286  PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIP  345
            P++    N   S L  PF                  PQ P      LP+           
Sbjct  360  PFLSGLPNPFLSGLPNPFLSGLPNPFLISQWPTQPLPQLPVQTLPQLPVQPLTQWHAQTL  419

Query  346  GLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSK  405
                     Q           + + QR      Q PD                  L    
Sbjct  420  PQRPAQPLTQKPDQPIPQKPDQPLPQRPAQPLHQWPDQPLPQMPAQPLPQWPAQPLPQWP  479

Query  406  QRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLD  465
                       L          W     P    +                   +      
Sbjct  480  DHPLPQRPAQPLPQWPTQPLPQWPAQPLPQRPAQPLPQRLAQPLPQWPAQPFPQWPTQPL  539

Query  466  DDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQ  507
                     Q   +                      + +L Q
Sbjct  540  PQRPAQPLPQWPAQPRPQTHAQPRPQTPAQFLPQKPAQFLPQ  581


>OIP66012.1 hypothetical protein AUK29_01630 [Nitrospirae bacterium CG2_30_53_67]PIS38125.1 
hypothetical protein COT35_02435 [Nitrospirae 
bacterium CG08_land_8_20_14_0_20_52_24]PIV84935.1 hypothetical 
protein COW52_04990 [Nitrospirae bacterium CG17_big_fil_post_rev_8_21_14_2_50_50_9]PIW86223.1 
hypothetical protein 
COZ95_00420 [Nitrospirae bacterium CG_4_8_14_3_um_filter_50_41]PIX86808.1 
hypothetical protein COZ32_01430 [Nitrospirae 
bacterium CG_4_10_14_3_um_filter_53_41]PJA77296.1 hypothetical 
protein CO150_01570 [Nitrospirae bacterium CG_4_9_14_3_um_filter_53_35]
Length=220

 Score = 43.6 bits (99),  Expect = 0.091, Method: Composition-based stats.
 Identities = 9/60 (15%), Positives = 15/60 (25%), Gaps = 0/60 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + C  CG        +L  +    RC  C   L+    +    Q             + 
Sbjct  2   LIHCKGCGKAYRVDEKRLTPEGIRVRCRHCGSVLMIRLRKEHEMQAVKEEEPAAAVPPEP  61


>RLB48936.1 hypothetical protein DRJ42_22120 [Deltaproteobacteria bacterium]
Length=274

 Score = 44.0 bits (100),  Expect = 0.093, Method: Composition-based stats.
 Identities = 14/100 (14%), Positives = 38/100 (38%), Gaps = 1/100 (1%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
              G+++L I  +++    +     +  + +G   A ++S+ L++     +    + L V+
Sbjct  130  IVGTIVLRILRVIWTTATYVVLPAMVIEGMGFFPAFKRSKDLMAQDPTQVGVGIIGLGVM  189

Query  275  SLTLSFL-TARIPYVGEAANLAFSLLLTPFSFLYYYLIYS  313
               LS +      ++    +     +L    F     +Y 
Sbjct  190  FSLLSLVTIGVGGWLAGVISSLIHPILGGLVFFTMVNVYW  229


>KFZ27160.1 hypothetical protein KQ78_00619 [Candidatus Izimaplasma sp. HR2]
Length=291

 Score = 44.0 bits (100),  Expect = 0.093, Method: Composition-based stats.
 Identities = 24/167 (14%), Positives = 54/167 (32%), Gaps = 18/167 (11%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV-------  214
                  +  +  +   ++         L   +           L   +   V+       
Sbjct  99   LGWGLFVSLIFSVYLYIYWDYVLFFFDLIAFISSEAYLNNPDILGSYIENYVIGAPTTKT  158

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
               S +  I  ++  + F F  +++ D +    +AL+KS L+ SG+WW +F   +  L+ 
Sbjct  159  LIISTVYSIFVIIITIRFSFALFIIGDTDENIFEALKKSWLITSGNWWRLFFFPLSFLLW  218

Query  275  SLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
               + F                 + +TP+  +    +Y+ L      
Sbjct  219  IFAVIFTFG-----------LAIIYVTPYMAVAQASMYNRLLKESEF  254


>NBU30202.1 hypothetical protein [Actinobacteria bacterium]
Length=314

 Score = 44.0 bits (100),  Expect = 0.093, Method: Composition-based stats.
 Identities = 11/78 (14%), Positives = 29/78 (37%), Gaps = 1/78 (1%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
                L L +  L   + ++F    +  + +  + A++ S  L       +    +  L++
Sbjct  184  LFIGLPLAVLALYLTIGWYFTPQTVVIEGVKPIAAMKASAALTRRSRLRVISILLATLLV  243

Query  275  SLTLSFLTAR-IPYVGEA  291
            +  +S   A  I ++   
Sbjct  244  TGFISLAVATPISWIATL  261


>MBS39610.1 hypothetical protein [Rhodobiaceae bacterium]
Length=66

 Score = 40.1 bits (90),  Expect = 0.093, Method: Composition-based stats.
 Identities = 8/36 (22%), Positives = 13/36 (36%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  ++CP C    +    K   K    +C +C    
Sbjct  1   MI-IQCPACNTSFSVSKDKFGTKNRKVKCSKCYFVW  35


>CAB4072221.1 unnamed protein product [Lactuca saligna]
Length=515

 Score = 44.4 bits (101),  Expect = 0.096, Method: Composition-based stats.
 Identities = 21/237 (9%), Positives = 52/237 (22%), Gaps = 22/237 (9%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
                I                +   +   +         + L  +  I L      G++ 
Sbjct  284  HFPTITYSTYHGFSGKPIKFFVALNSLTFSFFPLVSTACVALVLLFLISLTFLLFVGAIV  343

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
            +        L     +      +     +++I+             L   + +     V+
Sbjct  344  MLGQNLGFVLIDYNSIHFTWFSAVVGATLIVII-------------LYVHMNWSLAFVVV  390

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL---------TARIPYVGE  290
              ++  G   L +S  LV G         +   V      ++         +     +  
Sbjct  391  VAESKWGFAPLIRSWYLVKGMRSVSLWLLLYFGVFLGCTVWVNSDALHAMSSQTYALLPM  450

Query  291  AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGL  347
                +  +    +S     ++Y   K  +                   F     P  
Sbjct  451  ILGSSMLMWFLLWSTAANTVLYMYCKTFHGELAIKMADGVAHNYINLPFDDEKAPHA  507


>WP_147729643.1 zinc-ribbon domain-containing protein [Methylobacterium sp. WL116]TXM87917.1 
hypothetical protein FV223_26035 [Methylobacterium 
sp. WL116]
Length=146

 Score = 42.4 bits (96),  Expect = 0.097, Method: Composition-based stats.
 Identities = 9/35 (26%), Positives = 17/35 (49%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            + CP C +E    ++++  +  S RC  C +T  
Sbjct  2   LIVCPTCASEYRIETARVGTEGRSVRCAACRETWF  36


>HAZ62755.1 hypothetical protein [Armatimonadetes bacterium]
Length=107

 Score = 41.3 bits (93),  Expect = 0.098, Method: Composition-based stats.
 Identities = 9/41 (22%), Positives = 14/41 (34%), Gaps = 0/41 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
           M + RCP C      P   +  ++  A C +  Q       
Sbjct  1   MISYRCPGCDQRLEVPPHAVHDEQRCAACGQTSQVPAAAYH  41


>HDP81212.1 zinc ribbon domain-containing protein [Spirochaetes bacterium]
Length=453

 Score = 44.4 bits (101),  Expect = 0.098, Method: Composition-based stats.
 Identities = 45/313 (14%), Positives = 78/313 (25%), Gaps = 18/313 (6%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            TVRCP CG   +     +   +    C +C    I    ES                   
Sbjct  40   TVRCPGCGTAYSITFDFVKNTRYRVNCKKCLARFIITFPESCTVPPAPAADRPDEALSPS  99

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                       K  +  R +                G     + +  +            
Sbjct  100  APKPPAGPTAPKPRSPYRES-----IYTINGFLRAVGSSFTRKKILPASLAVAVVYALGS  154

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL-----------GL  171
             I  L   L        LL               + +  A VA I L             
Sbjct  155  LIGALLSRLPLPAGADRLLTPFTLSKQMLMFFVIYLMGSALVAGITLDETRENRRPLGVY  214

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGG-SLLLIIPGLLFCV  230
                 +    I      +   ++L +   GS  L+  LL  ++      L +       V
Sbjct  215  LRGVTAHVAPIAAGAAAVLLLIELAVLVFGSIPLVGTLLYALLFIPVYGLSLAVAAAAVV  274

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLV-SGHWWAIFGRFVLLLVISLTLSFLTARIPYVG  289
             F+F   ++A    G  +++ +        ++  +F   +L         FL        
Sbjct  275  GFWFYAPIMARHRGGFRRSMIELYRFTVRHNFSLLFNIAILAAGALAATGFLYLLHSLGL  334

Query  290  EAANLAFSLLLTP  302
             AA     + + P
Sbjct  335  AAALGLSRVFIGP  347


>CCD00318.1 protein of unknown function [Azospirillum baldaniorum]
Length=87

 Score = 40.9 bits (92),  Expect = 0.098, Method: Composition-based stats.
 Identities = 9/29 (31%), Positives = 11/29 (38%), Gaps = 0/29 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPEC  32
           V CPHC      P ++L A      C   
Sbjct  7   VVCPHCDTTNRVPRARLGAGGKCGACHRP  35


>WP_069655230.1 DUF975 family protein [Enterococcus plantarum]OEG10112.1 hypothetical 
protein BCR22_06585 [Enterococcus plantarum]
Length=309

 Score = 44.0 bits (100),  Expect = 0.098, Method: Composition-based stats.
 Identities = 29/266 (11%), Positives = 74/266 (28%), Gaps = 13/266 (5%)

Query  71   IQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIV  130
                         +  +         G           D  +   +    L  +  +   
Sbjct  55   HYHSWQEDYAEKANRSMDDYERGYEDGYSDGYDDGFYQDDSDDDSQNEKNLHSLATIPGT  114

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
                   S    +  T          + + LA +  ++L    +  +    + + +  L 
Sbjct  115  SLTHQGRSVTYSETTTLETGIGGLVGFLVWLAFLLVMILYRGMIQWAAVDNVERRNFSLK  174

Query  191  RSMKLGLR-HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV-LADDNIGGLQ  248
                  ++ +         L+ L     SLL +IPG++  + +    Y+   D  +   +
Sbjct  175  TVFISFIKENGKRTVSANSLMALYTFLWSLLFVIPGVIKQLSYGMTNYLLKKDPTLTAKE  234

Query  249  ALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYY  308
            A++ SR+L+ G+           ++      F                S+ + P+  +  
Sbjct  235  AIDLSRVLMKGYKLEYLIFSYSFILWQFAAFFSFG-----------LVSVYVIPYYSVSE  283

Query  309  YLIYSDLKANYRGPQHPPIKRQWLPL  334
             L +  + A+         +  +   
Sbjct  284  VLFFDRIVADKHHLFTQEKEAGFADF  309


>PSP97841.1 hypothetical protein BRC89_10555 [Halobacteriales archaeon QS_4_70_19]
Length=318

 Score = 44.0 bits (100),  Expect = 0.099, Method: Composition-based stats.
 Identities = 19/99 (19%), Positives = 42/99 (42%), Gaps = 0/99 (0%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +   LL  +    ++ ++          ++        +  L   L  + +  G +  
Sbjct  118  MVLVTALLAETLHVVAIRVFARDARAFPREALHDLAPRAVASFLANSLAAVAIFAGLVAF  177

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGH  260
            ++PG+L  + F+  + V+A +  G + AL +S  LV+GH
Sbjct  178  LLPGILLAIAFYLVRAVVAIEGAGVVAALRRSWQLVAGH  216


>NQV23783.1 hypothetical protein [Rhodopirellula sp.]
Length=374

 Score = 44.4 bits (101),  Expect = 0.099, Method: Composition-based stats.
 Identities = 10/31 (32%), Positives = 11/31 (35%), Gaps = 0/31 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPEC  32
               CP CGA    P +   AK S   C   
Sbjct  3   IEFNCPTCGAAIRVPDAAAGAKGSCPTCHTK  33


>PLY12567.1 hypothetical protein C0624_00835 [Desulfuromonas sp.]
Length=271

 Score = 44.0 bits (100),  Expect = 0.099, Method: Composition-based stats.
 Identities = 25/274 (9%), Positives = 62/274 (23%), Gaps = 23/274 (8%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             + CPHC   ++   + +P       CPEC  T  F   +        +    P   +  
Sbjct  2    LITCPHCQYAKDVNQAWIPPGDCQVECPECNGTFFFSHTQGAS----ISQPQAPVENMDC  57

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                           C      +  + +             +   + S     + G+ + 
Sbjct  58   PACGLAQPKGDHCTGCGIVYAKWQKRHQATEVEDEEFGFESTATASFSVHKADKGGFWIR  117

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
                    L    +                      + +                     
Sbjct  118  VAANFVDSLVLMVVLGIPFYFLIFNEFIGMYGSMMQMQMGGFP-----------------  160

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                  + + M   +       +   L+ L+     ++                 V++++
Sbjct  161  SDDPAYMQQFMAEAMEIQQRMMMYGALVNLLGLLYYIVPTAISGQTLGKKVCGIRVVSEN  220

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
               G   L        G W + F   +  ++++ 
Sbjct  221  GKVGW--LRAILRETVGKWISAFILGIGFIMVAF  252


>KKT11990.1 hypothetical protein UV92_C0036G0006, partial [Parcubacteria 
group bacterium GW2011_GWA1_43_27]
Length=277

 Score = 44.0 bits (100),  Expect = 0.100, Method: Composition-based stats.
 Identities = 16/161 (10%), Positives = 52/161 (32%), Gaps = 7/161 (4%)

Query  141  LLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHV  200
              +  T        W++ I L +   + + L+ +       +    +             
Sbjct  109  FHEFVTKPAKPIWWWRFLISLFSALVLGMVLTSLIPGKLQEVITEALSNPWRSLGWGALW  168

Query  201  GSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGH  260
                 +++++++V   G  L ++   ++ +       V        +++      L    
Sbjct  169  AFIVPIMVIVLMVTIIGLPLALVLAAIYFIGLILAPIVAGASLGWFIKSKSGEGWLTKQR  228

Query  261  WWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
                    ++++++ + +  L   IP++G    L  +L   
Sbjct  229  -------LLVVVLVGIFIYRLIVFIPFIGGLVGLVGALWAW  262


>OPX19969.1 hypothetical protein BZ151_06540 [Desulfobacca sp. 4484_104]RLA86810.1 
hypothetical protein DRG58_11575 [Deltaproteobacteria 
bacterium]
Length=302

 Score = 44.0 bits (100),  Expect = 0.10, Method: Composition-based stats.
 Identities = 10/101 (10%), Positives = 22/101 (22%), Gaps = 1/101 (1%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  + C  C  + +    ++  + +  RC  C           +       +        
Sbjct  1    MI-ITCEKCETKFHLDEDRIKGQSAKVRCSHCQHVFEVTKETEEDADLLAYLKEESDIPS  59

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLR  101
            +    +      +             L   R   A  S   
Sbjct  60   EEADETSESPTAATESPSISFPAPESLPRARPRPAWFSRRW  100


>HID76639.1 hypothetical protein [Planctomycetaceae bacterium]
Length=55

 Score = 39.8 bits (89),  Expect = 0.10, Method: Composition-based stats.
 Identities = 9/38 (24%), Positives = 13/38 (34%), Gaps = 3/38 (8%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
               CP CGA        + +   +ARC  C   +   
Sbjct  15  IHFPCPQCGARYRVS---VDSAGKTARCKRCQALMPIP  49


>WP_179943892.1 zinc-ribbon domain-containing protein, partial [Wolbachia endosymbiont 
of Operophtera brumata]
Length=79

 Score = 40.5 bits (91),  Expect = 0.10, Method: Composition-based stats.
 Identities = 7/60 (12%), Positives = 13/60 (22%), Gaps = 0/60 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            ++C +C         ++       +C  C         E         I        Q 
Sbjct  2   KIQCHNCTKTYLVSRGQIGESGRKVKCTNCNHMWHEYLKEMSSELCPAGIQEKKANWRQS  61


>HIJ19836.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=54

 Score = 39.8 bits (89),  Expect = 0.10, Method: Composition-based stats.
 Identities = 10/32 (31%), Positives = 13/32 (41%), Gaps = 3/32 (9%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPEC  32
           M    CP CGA  N   S++P K       + 
Sbjct  1   MI---CPKCGAVYNVDDSEIPDKGVRKEGRDW  29


>OPX37867.1 hypothetical protein B1H13_12190 [Desulfobacteraceae bacterium 
4484_190.3]
Length=179

 Score = 42.8 bits (97),  Expect = 0.10, Method: Composition-based stats.
 Identities = 12/37 (32%), Positives = 15/37 (41%), Gaps = 0/37 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
             + CP C   +  PS K+P       CP C Q   F
Sbjct  3   IEITCPFCQFSKRVPSEKIPDGVKWVTCPRCRQRFEF  39


>HEN14015.1 hypothetical protein [Schlesneria paludicola]
Length=81

 Score = 40.5 bits (91),  Expect = 0.10, Method: Composition-based stats.
 Identities = 16/91 (18%), Positives = 33/91 (36%), Gaps = 11/91 (12%)

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
             F+   +VL D++  G++ L +S+   + +W A+F   +               I  +G 
Sbjct  1    MFWPYVFVLVDEDPPGIECLSRSKDYTAQNWGAVFVLVLAAF-----------AINLLGV  49

Query  291  AANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
             A     +   P + L   + Y  +      
Sbjct  50   CALGIGLIFTAPLTSLMMAVAYCKMSGQRTM  80


>WP_129791223.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Sphingosinicella sp. CPCC 101087]
Length=267

 Score = 43.6 bits (99),  Expect = 0.10, Method: Composition-based stats.
 Identities = 20/163 (12%), Positives = 47/163 (29%), Gaps = 0/163 (0%)

Query  132  AFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFR  191
             +  +    +          +            +  +    ++       +      L  
Sbjct  56   PWLLVLPLAIAASMVGTLAISFLALRPGASVAESLQVGLRRFIYLLGAALVIGISAALLA  115

Query  192  SMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALE  251
               + L    +          +     L  +   L F V       V A + +G L+ + 
Sbjct  116  VPLIILAGAAAMGGGEAGAASLAALLMLAALPVLLFFWVRLMLMTPVAAMEEVGPLRIIA  175

Query  252  KSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
            +S  L  GH+W + G   L+ V ++ + F+   +  +   A  
Sbjct  176  RSWELTRGHFWKLLGFVALVAVAAIVVMFVVTMLGGLLVFALA  218


>HBL46452.1 hypothetical protein [Planctomycetaceae bacterium]
Length=544

 Score = 44.4 bits (101),  Expect = 0.11, Method: Composition-based stats.
 Identities = 33/325 (10%), Positives = 78/325 (24%), Gaps = 39/325 (12%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIAT-------  54
              VRC  C    +       A+    RC EC   +    A++ +T+++ +          
Sbjct  3    IKVRCKECNTTFSVRDE---AEGKRVRCKECGAPVKVTAAKTNKTRSSSSKPADTDDFLA  59

Query  55   --------------CPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGL  100
                          CP CG                ++  R + +   +  R+  A     
Sbjct  60   TLDIDKIEDKAAKICPRCGYDVGEEDIECANCGVDLSTGRMSEATRRKRRRKGPAIEDFY  119

Query  101  RSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAIL  160
                   +                  +   L +  +F  +                 +I+
Sbjct  120  SKSWGDASKFLGNHKGLAVKTFIYSAIASTLFYCSVFMMMWCHRTPPRAFWGFIAFVSIM  179

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT---------------L  205
                    +    +  ++        +     +   L     F                 
Sbjct  180  AIPGWIWFIQTEVVRYALQKKDKLKRINFDFFLCSALGIKFIFWIILFSLPAQAILGSMG  239

Query  206  LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
              ++    V  G++L+ I  +   + F      +   +      + K        +   F
Sbjct  240  FYLISNGSVPTGAILIAIGFIPTFLMFPLAIPHMTVVDSTPGWLMHKLVKTFLKLFKPAF  299

Query  266  GRFVLLLVISLTLSFLTARIPYVGE  290
                + +V +L      A +  +  
Sbjct  300  FWCFVFIVTTLPALGCLAAVGVMSG  324


>XP_037871600.1 uncharacterized protein LOC119629549 [Bombyx mori]
Length=396

 Score = 44.4 bits (101),  Expect = 0.11, Method: Composition-based stats.
 Identities = 15/244 (6%), Positives = 31/244 (13%), Gaps = 8/244 (3%)

Query  36   LIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRA  95
              +             +     C   R           +                     
Sbjct  134  CWWFWLCRFLWCWRFRLWMFFWCWRFRLCRFFWCWGFRQCRFLWCWRFRLWRFLWCWGFR  193

Query  96   SGSGLRSI---SQLLADSWEL----FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWL  148
                              W      F       L  +L    L           +   +L
Sbjct  194  LCRLFWCWGFRQFRFLWCWRFRLCRFFWCWGFRLWRFLWCWRLRLWRFLWCWGFRLWRFL  253

Query  149  NPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLI  208
              +       +      +      W         C+           G R          
Sbjct  254  WCRGFRLCRFLWCWGFRFCRFLWCWGFRLWRFLWCRGFRLCRFLWCWGFRLWRFLWCRGF  313

Query  209  LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALE-KSRLLVSGHWWAIFGR  267
             L   +      L               +         L     +    +    + +   
Sbjct  314  RLCRFLWCWRFRLWRFLWCRGFRLCRFLWCWRFRLWRFLWCWRFRLWRFLWCRGFRLCRF  373

Query  268  FVLL  271
                
Sbjct  374  LWCW  377


>WP_173764179.1 zinc-ribbon domain-containing protein [Azoarcus sp. M9-3-2]QID17012.1 
hypothetical protein G3580_04765 [Azoarcus sp. M9-3-2]
Length=106

 Score = 41.3 bits (93),  Expect = 0.11, Method: Composition-based stats.
 Identities = 10/37 (27%), Positives = 13/37 (35%), Gaps = 0/37 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
           M   RCP C       S +L  ++   RC  C     
Sbjct  1   MMRTRCPACQTVFRITSEQLRIRQGKVRCGHCRHVFN  37


>HGR91282.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=625

 Score = 44.4 bits (101),  Expect = 0.11, Method: Composition-based stats.
 Identities = 27/158 (17%), Positives = 50/158 (32%), Gaps = 9/158 (6%)

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
            +     +        + +                 L+   L+V  GS+L ++PGLL    
Sbjct  456  AVALLLVEHIGSARWLNVSDLFGRLWHRRVQLGSSLLPAALLVALGSVLFVVPGLLLAAL  515

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP-----  286
              F   V   +   G+ AL++S  LV    W +    +   ++  T   L   +      
Sbjct  516  LLFVPAVAIFERASGMAALKRSVALVRSDPWRVLVVMLASFILGATAYALAEAVMPAGTW  575

Query  287  ----YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
                ++   A   F     P + L    +Y D +    
Sbjct  576  RSSIFLRLLAGDFFLAATFPITALAAARLYIDRRGQEG  613


>OGP84945.1 hypothetical protein A2Z08_12055 [Deltaproteobacteria bacterium 
RBG_16_54_11]
Length=307

 Score = 44.0 bits (100),  Expect = 0.11, Method: Composition-based stats.
 Identities = 9/41 (22%), Positives = 14/41 (34%), Gaps = 1/41 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
           M  V C  C  + N   +K+   ++  RC  C         
Sbjct  1   MI-VLCDRCKTQYNLNDAKVKPGETKVRCSRCQHVFTVPHP  40


>HAY78460.1 hypothetical protein [Planctomycetaceae bacterium]
Length=280

 Score = 43.6 bits (99),  Expect = 0.11, Method: Composition-based stats.
 Identities = 12/75 (16%), Positives = 20/75 (27%), Gaps = 0/75 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M   +C  CGA+   P +    K   + C    Q              + N + C     
Sbjct  1   MIEFQCEKCGAQMQAPEASAGKKGRCSECGSIQQIPSAPAEAPSLAVISLNCSECGEELR  60

Query  61  QRRIPSDRLEIQSKT  75
                + +     K 
Sbjct  61  VPAANAGKKGKCPKC  75


>WP_185157027.1 zinc-ribbon domain-containing protein, partial [Methylobacterium 
symbioticum]
Length=98

 Score = 40.9 bits (92),  Expect = 0.11, Method: Composition-based stats.
 Identities = 9/35 (26%), Positives = 15/35 (43%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            + CP C +     + +L  +  S RC  C +T  
Sbjct  2   LIVCPTCASAYRVETGRLGMEGRSVRCAACRETWF  36


>NRA74281.1 hypothetical protein [Rickettsiales bacterium]
Length=224

 Score = 43.2 bits (98),  Expect = 0.11, Method: Composition-based stats.
 Identities = 24/206 (12%), Positives = 68/206 (33%), Gaps = 12/206 (6%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
                  + I      L+ + ++        T+++ +++ +   + +  V    + +  + 
Sbjct  11   FNENIPIFIKYWWFFLSISLLYHGYQYLILTFISVKSEYFSVILAVYHVLITPIIMLVIM  70

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV--GGGSLLLIIPGLLFCVWFF  233
              +  Y        F  +    R       L+ L I  V    G    +I  ++  + F 
Sbjct  71   YVVDCYNKNKLPTDFNKLIKDTRRCYIRIFLIYLTIYFVRKYFGFGATLIIFVIMYIKFP  130

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE---  290
            F +  +   +    +A++ +  L   +  +      +L+V+ L +  +  +I  V     
Sbjct  131  FLEQEIFFRDTSLWKAIKNNNELT--NQESTIKTITVLIVLFLIVYGVITKISQVIISYE  188

Query  291  -----AANLAFSLLLTPFSFLYYYLI  311
                       ++L+  F      ++
Sbjct  189  IQHEKIILYGLNILMIAFFLFCKAVL  214


>HCU77251.1 hypothetical protein [Microbacterium sp.]
Length=244

 Score = 43.6 bits (99),  Expect = 0.11, Method: Composition-based stats.
 Identities = 16/67 (24%), Positives = 26/67 (39%), Gaps = 0/67 (0%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            ++     L  I   L   V       VL  ++     AL +S  L  G +W I G  VL+
Sbjct  178  ILGILVILAAIPLSLWLAVKLLLVPAVLIVEHTSLGAALGRSWRLSRGRFWVILGILVLV  237

Query  272  LVISLTL  278
             ++   +
Sbjct  238  SLVFGAV  244


>WP_154809514.1 hypothetical protein [Methanolobus vulcani]TQD25883.1 hypothetical 
protein FKV42_06840 [Methanolobus vulcani]
Length=200

 Score = 42.8 bits (97),  Expect = 0.11, Method: Composition-based stats.
 Identities = 21/173 (12%), Positives = 51/173 (29%), Gaps = 11/173 (6%)

Query  143  KPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGS  202
                         +    +  V  I + LS +   +  ++     G+  ++      +  
Sbjct  15   MDILDGFRDGNFVRSWKYMLFVMIIAVLLSIVAMILLFFVGILSFGIIFTLGSSSSSIPE  74

Query  203  FTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWW  262
               L I  IL +    ++     L+   +  +   +          AL +S  +V  +  
Sbjct  75   SVTLAIAGILYLAVICII-----LIPVFFMLYMLPLYVTKGYDVTDALSESAAIVKSNIT  129

Query  263  AIFGRFVLLLVISLTLSFLTARIPYVGE------AANLAFSLLLTPFSFLYYY  309
            A     +++ +++L          ++G          +   LL  P S     
Sbjct  130  ASVIISLIVGLVALVGVLPYYAGLFLGWPVAFNLLIYIIGVLLTMPLSQQILV  182


>XP_005110944.1 uncharacterized protein LOC101861007 [Aplysia californica]
Length=1076

 Score = 44.8 bits (102),  Expect = 0.11, Method: Composition-based stats.
 Identities = 20/208 (10%), Positives = 43/208 (21%), Gaps = 6/208 (3%)

Query  5    RCPHCGAERNTPSSKLP-----AKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCG  59
             CP C +        L            +C EC     F       T  T          
Sbjct  424  TCPACSSVHTIDPDALTTKKKRKNGLKVQCSECHLQWCFLCQAPFHTGVTCKQYRKGDEM  483

Query  60   LQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWEL-FCRRG  118
            +++     +    +     +              + +        +       + F    
Sbjct  484  VRKWAKEVKHGKFNAQRCPKCKTFIQKTDGCDHMKCTQCSTDFCYRCGDRYIRIKFIGNH  543

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
            +    ++     L         L++   +          A L       LLG S      
Sbjct  544  FSRFSLFGCKYRLMPKKPVLRRLIRGTNFAARMVAGVVLAGLGLAAGIGLLGASVFILPG  603

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLL  206
            +              +  +     + +L
Sbjct  604  YGIFRLHRHRKIAKKRKHVERRMHYMIL  631


>PWL49465.1 hypothetical protein DBY36_07800 [Clostridiales bacterium]
Length=293

 Score = 43.6 bits (99),  Expect = 0.11, Method: Composition-based stats.
 Identities = 22/163 (13%), Positives = 50/163 (31%), Gaps = 13/163 (8%)

Query  160  LLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGL-RHVGSFTLLLILLILVVGGGS  218
                   + +    +      Y+  T   +       L   + + +L   L+     G  
Sbjct  119  YFLYTVALAVLTVPLYTVPSSYLLSTANAVLADFSDQLTAGLAAVSLNWSLVDFRAVGIC  178

Query  219  LLLIIPGLLFCVWFFFCQYVLADDN-IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT  277
             +L++   L  V     QY   DD+ +    A  +S  ++ GH        +  +   + 
Sbjct  179  AVLLLVYALLSVKLLLTQYYFVDDDALSPFSAAYRSWRVMRGHALEFILLALSFIGWYIA  238

Query  278  LSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
             S                 +L L P+  +   +    ++A++ 
Sbjct  239  CSLTV-----------FIVALYLLPYLRMSVVIFTEYVRADHE  270


>MBI29240.1 hypothetical protein [Pelagibacteraceae bacterium]PPR44995.1 
hypothetical protein CFH21_00084 [Alphaproteobacteria bacterium 
MarineAlpha5_Bin11]PPR51401.1 hypothetical protein CFH20_00594 
[Alphaproteobacteria bacterium MarineAlpha5_Bin10]
Length=133

 Score = 41.7 bits (94),  Expect = 0.11, Method: Composition-based stats.
 Identities = 11/39 (28%), Positives = 13/39 (33%), Gaps = 1/39 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M  V CP C +     S+ L  K     C  C      D
Sbjct  1   MI-VNCPSCNSRYLVNSADLQPKGRIVHCTICSYEWFND  38


>NEP27797.1 hypothetical protein [Moorea sp. SIO3I6]
Length=136

 Score = 42.1 bits (95),  Expect = 0.11, Method: Composition-based stats.
 Identities = 25/113 (22%), Positives = 45/113 (40%), Gaps = 6/113 (5%)

Query  198  RHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLV  257
              + +   L +    + G G LL+IIPG+++ V   +        +     A   SR +V
Sbjct  1    SRLIALIGLNLSFNYIFGTGLLLMIIPGIIYWVNNGYYGLAFVLRDQRSRAAFAYSRAIV  60

Query  258  SGHWWAIFGRFVLLLVISL----TLSFLTARIPYV--GEAANLAFSLLLTPFS  304
             G+WW +F    + L + +        + + IP +   E      + LL  F 
Sbjct  61   KGNWWRVFLFGFIYLFVIVIAIKIFHIIFSFIPIINSSEVLMTVITSLLAGFI  113


>AJF61271.1 hypothetical protein QT06_C0001G0431 [archaeon GW2011_AR15]HIH41033.1 
hypothetical protein [Candidatus Woesearchaeota archaeon]
Length=311

 Score = 44.0 bits (100),  Expect = 0.11, Method: Composition-based stats.
 Identities = 23/203 (11%), Positives = 55/203 (27%), Gaps = 5/203 (2%)

Query  110  SWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
            ++  F  R   LLGI  +   +          +    ++                 + L 
Sbjct  60   NFASFITRYGLLLGIAFIAFFVLMLFFTYLNSVFTFMFIEGVLHKSIKIRKSFRANHSLG  119

Query  170  GLSWMTGSMFIYICKTDVG-LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
               +    +   I    +  +   + + L +    +    L + +  G  +  +   + +
Sbjct  120  LNIFFVKLITGMITIAGIAVILSPVIIALLNGTLSSFNYWLFVPMFLGFFIFFLALIIFW  179

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
             + + F   V+   N    +A       +        G   L  +I + LS        +
Sbjct  180  FIVYDFVMPVMYLKNYRFARAWHHVIKKIRDRK----GEIGLYWLIKVGLSIALNIATII  235

Query  289  GEAANLAFSLLLTPFSFLYYYLI  311
                     L+      L  Y +
Sbjct  236  LAIFVAIILLIPFALIGLGIYFL  258


>OLQ00702.1 hypothetical protein AK812_SmicGene16598 [Symbiodinium microadriaticum]
Length=1104

 Score = 44.8 bits (102),  Expect = 0.11, Method: Composition-based stats.
 Identities = 32/334 (10%), Positives = 78/334 (23%), Gaps = 5/334 (1%)

Query  22   AKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRC  81
             +     C     +L               +       L   +    +      +     
Sbjct  182  PEGCDDDCYYIYYSLACTVCSLLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLV  241

Query  82   NRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGI--YLLGIVLAFAPIFSA  139
                              L          W +    GW +  +  +L+G ++ +   +  
Sbjct  242  GWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLV  301

Query  140  LLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFR-SMKLGLR  198
              L              W +       +   + W+ G +  ++    VG     +   L 
Sbjct  302  GWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLV  361

Query  199  HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVS  258
                  L+  L+  +VG     L+   + + V +     V           +      + 
Sbjct  362  GWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLV  421

Query  259  GHWWAIFGRFVLLLVISLTLSFLTARI--PYVGEAANLAFSLLLTPFSFLYYYLIYSDLK  316
            G        +++  ++   + +L   +    VG         L+          +   L 
Sbjct  422  GWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLV  481

Query  317  ANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLV  350
                G     +    +          L+  L+  
Sbjct  482  GWLVGWLVGWLVGWLVGWLVGWLVGWLVGWLVGW  515


>MBE0536951.1 hypothetical protein [Phycisphaerae bacterium]
Length=380

 Score = 44.0 bits (100),  Expect = 0.12, Method: Composition-based stats.
 Identities = 19/224 (8%), Positives = 47/224 (21%), Gaps = 14/224 (6%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M   RCPHC  +   P +         RC  C +        +     +           
Sbjct  1    MIKFRCPHCDQKLGVPEA---YAGRRVRCTRCQEVTEAPSPSAAALDKSAAALDVQPQKD  57

Query  61   QRRIPSDRLEIQSKTVNCRRCN-----------RSFCLQPEREFRASGSGLRSISQLLAD  109
            +      +L  +                          +     RA   G        + 
Sbjct  58   EATEDGLQLRAEEPAPEVWVDPGLLLDETEAARMEAIARARPPQRAPARGTAKPRSAASR  117

Query  110  SWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
              + +       +    + +  + A    A  +                 L+  +   + 
Sbjct  118  PDDTYDPDERPGMKRMAIAVACSVAATLIAAAIWAPMMAYLSMVGCWSGPLMLLLEVGIA  177

Query  170  GLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILV  213
                +     +      +GL  ++      +     +     + 
Sbjct  178  IAGALVLGGAMQRTGIRLGLLAAVIGIGGILVGKCFVAKWYWVP  221


>WP_083809629.1 zinc-ribbon domain-containing protein [Candidatus Midichloria 
mitochondrii]
Length=98

 Score = 40.9 bits (92),  Expect = 0.12, Method: Composition-based stats.
 Identities = 10/35 (29%), Positives = 14/35 (40%), Gaps = 0/35 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
             V C  CGA+    SS++ +     RC  C    
Sbjct  7   IIVSCESCGAKFYVSSSEIHSTGRFVRCTICEHEW  41


>WP_040658730.1 YARHG domain-containing protein [Oscillibacter ruminantium]
Length=530

 Score = 44.4 bits (101),  Expect = 0.12, Method: Composition-based stats.
 Identities = 27/236 (11%), Positives = 58/236 (25%), Gaps = 7/236 (3%)

Query  5    RCPHCGAERNTPSSKLPAKKSSA-------RCPECCQTLIFDPAESQRTQTTDNIATCPH  57
             CP+C AE +  ++                 C +       +  ++   +T  N  T   
Sbjct  30   TCPNCRAEIDPDATFCTQCGQPVSKTSSESHCRQDGSPTDNNRDKTVYGETGRNEETNAA  89

Query  58   CGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRR  117
                +   ++              N  + L    +    G    + +  L      F R+
Sbjct  90   EEQAKAAQAEHDRQLHDFAYVIGKNTEYYLPEFEKAGRGGKVKFNWAAFLLVPMFCFYRK  149

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
               L   + L  ++         L+  A                     + +    + G 
Sbjct  150  CGDLFIKFFLLPLIVENAGGIISLIGMAVLPFQSTVALVMMAGGVLSLLVGIVWLAINGI  209

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
             F               +  R +      +   IL      L+  + G++  V   
Sbjct  210  RFGLHFNELYYHHCCQLINSRDIKHCGTSVWHAILYAVVAYLIETMLGVILVVAMI  265


>HIG12636.1 hypothetical protein [Planctomycetes bacterium]HIL52046.1 hypothetical 
protein [Planctomycetes bacterium]
Length=314

 Score = 44.0 bits (100),  Expect = 0.12, Method: Composition-based stats.
 Identities = 16/191 (8%), Positives = 54/191 (28%), Gaps = 14/191 (7%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +L I +          +   + K       + ++           +++  + ++     +
Sbjct  71   VLQILIQLGSSWLMAGYFQTIRKVMVEGQAEFEDLFRPGGRWWTLFLVRLVFFLVVLASL  130

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                    +   +   L    +  + + +       G L ++       +   F      
Sbjct  131  LPLAIPPTVAMMLSEVLDLSSAGAVAVTV------LGELAVLPMLAYVFLGLAFAGPAAV  184

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
             ++    QA   S  L  G+ W +        ++ +  S +      +     +  + +L
Sbjct  185  LEDCTVAQAFGHSWDLAHGNRWQL--------LVFVLFSIVLVIAGLLLCCVGIVPATIL  236

Query  301  TPFSFLYYYLI  311
                +   Y++
Sbjct  237  INVMWCEAYIV  247


>WP_191138168.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Hazenella sp. IB182353]MBD1367685.1 glycerophosphoryl 
diester phosphodiesterase membrane domain-containing 
protein [Hazenella sp. IB182353]
Length=283

 Score = 43.6 bits (99),  Expect = 0.12, Method: Composition-based stats.
 Identities = 21/109 (19%), Positives = 36/109 (33%), Gaps = 11/109 (10%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            V    LL+ +    F + F     V   +      A++KS  L  G  W  FG    L +
Sbjct  146  VFLCELLIALLASYFFIRFALTIPVFMMERRTLRFAIKKSWTLSRGKVWQTFGALFFLGM  205

Query  274  ISLTLSFL-----------TARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
                ++ L            A +  +        ++ + P  FL+  L 
Sbjct  206  FKSLIAMLSMSFLDWALWDLAYLSLLWYTIGSLITIGIGPILFLFTPLY  254


>OFX19659.1 hypothetical protein A2V77_07340 [Anaeromyxobacter sp. RBG_16_69_14]
Length=214

 Score = 43.2 bits (98),  Expect = 0.12, Method: Composition-based stats.
 Identities = 6/34 (18%), Positives = 12/34 (35%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            +RC  C          +P + +  +C +C    
Sbjct  2   LIRCDKCSTLYELEEDLIPPRGAPVQCSKCQFVF  35


>KKS94389.1 hypothetical protein UV70_C0001G0003 [Parcubacteria group bacterium 
GW2011_GWA2_43_13]OGY68973.1 hypothetical protein A3B94_02220 
[Candidatus Jacksonbacteria bacterium RIFCSPHIGHO2_02_FULL_43_10]OGY70360.1 
hypothetical protein A2986_00185 [Candidatus 
Jacksonbacteria bacterium RIFCSPLOWO2_01_FULL_44_13]OGY72698.1 
hypothetical protein A3H59_01550 [Candidatus Jacksonbacteria 
bacterium RIFCSPLOWO2_02_FULL_43_9]HAZ16552.1 
hypothetical protein [Candidatus Jacksonbacteria bacterium]
Length=271

 Score = 43.6 bits (99),  Expect = 0.12, Method: Composition-based stats.
 Identities = 32/215 (15%), Positives = 63/215 (29%), Gaps = 9/215 (4%)

Query  82   NRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALL  141
                              +  +  + A    ++      LL I         A +    L
Sbjct  19   WCVVRAHWREFLHLVKWYILPLIIVHAIGQLIYFGGYTLLLTIVFYVFFGLLALLVQLCL  78

Query  142  LKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVG  201
            ++     +           L +    L     +   + I      + +   M +      
Sbjct  79   IRAMKQYSQGESGGFSKKHLLSTLVYLPAAVGIVAIVVIVSLLAILIVSLPMVVLTAFSL  138

Query  202  SFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHW  261
                  ++  L++      L        V+F F  YVL D++      + KS  LV G W
Sbjct  139  HIPGASVITFLLILAVFFAL-------SVYFSFPLYVLIDEHCSMSDTVRKSIALVRGKW  191

Query  262  WAIFGRFVLLLVISLTLSFLT--ARIPYVGEAANL  294
             A+F R   L+++ L +S +     +  +G     
Sbjct  192  CALFIRSAWLVIVGLAVSVIALYGMVFIIGLLIGA  226


>BBM83698.1 hypothetical protein UABAM_02051 [Planctomycetes bacterium SRT547]
Length=1690

 Score = 44.8 bits (102),  Expect = 0.12, Method: Composition-based stats.
 Identities = 10/78 (13%), Positives = 19/78 (24%), Gaps = 3/78 (4%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  V CP C  + +             RC  C    I      +   +T          +
Sbjct  1   MIKVTCPACQKKYSVIDR---MGGKEIRCQSCDYVFIAPRKRGENWVSTICPQCSKLYKV  57

Query  61  QRRIPSDRLEIQSKTVNC  78
            + +       +    + 
Sbjct  58  SQEMLGKEFTCRRCHHDF  75


>WP_148115202.1 zinc-ribbon domain-containing protein [Wolbachia pipientis]
Length=113

 Score = 41.3 bits (93),  Expect = 0.12, Method: Composition-based stats.
 Identities = 9/95 (9%), Positives = 27/95 (28%), Gaps = 0/95 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            ++C +C    + P+ ++     + +C  C      +P + +       +  C       
Sbjct  2   EIQCQNCTKVYSVPADQIGKSGRTVKCTSCEFIWHANPYKKRGNYLVLTVLICIVICFIA  61

Query  63  RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASG  97
             P+   +     +     ++        E     
Sbjct  62  ANPNKIKKPYKHFMYKLIGHKKEYNIETNELYQDY  96


>WP_096909167.1 hypothetical protein [Halobacteriovorax marinus]ATH07524.1 hypothetical 
protein BIY24_06075 [Halobacteriovorax marinus]
Length=206

 Score = 42.8 bits (97),  Expect = 0.12, Method: Composition-based stats.
 Identities = 25/121 (21%), Positives = 46/121 (38%), Gaps = 4/121 (3%)

Query  165  AYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIP  224
                L     +   +  I    +GL    K    +   +  + IL  L    G+L+LIIP
Sbjct  49   FLFGLFEVIGSCVFYALILGRLLGLQ---KFNFLNFFVYLRVNILYSLFYLLGALVLIIP  105

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            GL    +F+F   +LA + +       KSR +     W++    + L+ + +        
Sbjct  106  GLYILTFFYFA-PILALEGVECESYFSKSREMTKKSPWSVLAVSLSLVFLMILDLLFLGY  164

Query  285  I  285
            +
Sbjct  165  V  165


>NLW72924.1 hypothetical protein [Chloroflexi bacterium]
Length=358

 Score = 44.0 bits (100),  Expect = 0.12, Method: Composition-based stats.
 Identities = 27/186 (15%), Positives = 60/186 (32%), Gaps = 0/186 (0%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
                      L  I L+ ++   + + S              ++      L         
Sbjct  74   IPPLAWVWIMLSIIALVFLLSLISLLVSTAARGGLIKGLLIAEDRYTDNRLTFKEVWRAM  133

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
              +    +F+ +     GL  S  L +  + S  L L L + ++   ++L I  G L   
Sbjct  134  KPFFWRLLFLRVLIGIGGLILSFILIIAFIISIILTLGLSLCLIVPLTILAIPVGWLINA  193

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
            +       L D+++   +AL +S  + + + W        + +IS   +       ++G 
Sbjct  194  FVGMSSVALIDEDLDTFKALARSWEVATKNIWPTLATMFFISLISFLTTIFVLLFLFLGS  253

Query  291  AANLAF  296
               +  
Sbjct  254  LPLVIA  259


>OGV41067.1 hypothetical protein A2X48_11945 [Lentisphaerae bacterium GWF2_49_21]HBC85474.1 
hypothetical protein [Lentisphaeria bacterium]
Length=196

 Score = 42.8 bits (97),  Expect = 0.12, Method: Composition-based stats.
 Identities = 20/179 (11%), Positives = 42/179 (23%), Gaps = 3/179 (2%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M   +CPHC  + +    +         C  C         ++              C  
Sbjct  1    MLEFKCPHCQKDYSVKEDQ---AGKMFECAACGNIFHTPSPQACPECQQLLEPGVVVCIK  57

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
                   + ++++                       G     I         L     + 
Sbjct  58   CGFNLQTKQKMETVIHIDDPTPIWLKFLRFMYDLMPGLFRPLIIFSFLACIALAIFLMFM  117

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
             L +  +G+ L+   I +  L+  A  +          +  A V +           +F
Sbjct  118  GLMVISMGVFLSGIFICAGALMVYAQGVAFLLAGEFQILKSALVDFTERQTWVFVLLVF  176


>NLE84788.1 hypothetical protein [Myxococcales bacterium]
Length=457

 Score = 44.0 bits (100),  Expect = 0.12, Method: Composition-based stats.
 Identities = 9/31 (29%), Positives = 15/31 (48%), Gaps = 0/31 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
           V CP C +    P  K+  +++  +C  C Q
Sbjct  3   VTCPACSSRYAIPDEKIAGRRARIKCKRCGQ  33


>MBI2923660.1 hypothetical protein [Planctomycetes bacterium]
Length=289

 Score = 43.6 bits (99),  Expect = 0.12, Method: Composition-based stats.
 Identities = 20/139 (14%), Positives = 40/139 (29%), Gaps = 0/139 (0%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
            A      L  +     +   +              LR        +  L ++   G    
Sbjct  80   ALFVARGLSHAAAIEFVRAEVYGERRTPEGCWLPALRRAADHAFFVGGLSVIQWLGLAAG  139

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
            I PGL +       Q     +++   +AL +SR L+ G+   +     L L   +  +  
Sbjct  140  IYPGLAWLNTNGLAQSAAIFEDLPAPRALRRSRELMKGNIAGMGVWAGLFLAWLVLFANA  199

Query  282  TARIPYVGEAANLAFSLLL  300
                 ++ +       L  
Sbjct  200  LCFGAFMPDIVRSFLGLSF  218


>NQZ95038.1 zinc-ribbon domain-containing protein [Myxococcales bacterium]
Length=490

 Score = 44.0 bits (100),  Expect = 0.12, Method: Composition-based stats.
 Identities = 9/37 (24%), Positives = 13/37 (35%), Gaps = 0/37 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
           V C  C  +     +++  K    RC  C     F P
Sbjct  5   VTCGECDTQFRLDDTRVSPKGIRVRCSVCKHAFRFTP  41


>HCL81743.1 hypothetical protein [Nitrospiraceae bacterium]
Length=555

 Score = 44.4 bits (101),  Expect = 0.12, Method: Composition-based stats.
 Identities = 41/346 (12%), Positives = 94/346 (27%), Gaps = 35/346 (10%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
             L+      +    +L ++     +   +                   +    + +   +
Sbjct  23   NLWFFIKLQIAAFLILLVMSLIPVLIEKVFPLQQAGAFSIPLVILSLTIGIVFSIVPSII  82

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV-  230
                  + +  C  +   F  +    R +  +    +   + + GG  L+ +  L   + 
Sbjct  83   YIGFTKISLRFCDNETPEFIELFSHYRLILKYLFASLFFAVFMFGGVGLISLGFLSGAIP  142

Query  231  --------------------WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
                                 F F  Y + D   G L AL  S L+  G    +F   ++
Sbjct  143  GMLKVVPAIAGLVLVLFLSLTFMFFPYAVVDKGFGPLNALRISYLITKGAKLKLFLFLLV  202

Query  271  LLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQ  330
            +             + +VG    L   L+  P + + +  IY  L +           + 
Sbjct  203  IF-----------AVNFVGALLLLIGLLITVPLTMVAFAHIYRSLVSAAETEGADLSAKG  251

Query  331  ---WLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSL  387
                +    A+   + I    LV + R  L +     A             +        
Sbjct  252  VRENMSFVIALIVAITIAAGALVMVYRSTLGSVSAQEARHYADNTFTQVFSRWDADKLLA  311

Query  388  PEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQN  433
               P+  S+   +    + +    +    LG + ++   F     +
Sbjct  312  EASPEFFSAMPGQQAAEQTKNLFKDASEKLGMMKIYNIVFQEFKTS  357


>MBI3821877.1 hypothetical protein [Planctomycetes bacterium]
Length=253

 Score = 43.2 bits (98),  Expect = 0.12, Method: Composition-based stats.
 Identities = 13/39 (33%), Positives = 14/39 (36%), Gaps = 3/39 (8%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M TV CPHC      P         + RC  C QT    
Sbjct  1   MITVDCPHCSRSLQLPDDL---AGITVRCANCQQTFAMP  36


>GFZ94090.1 hypothetical protein CYANOKiyG1_04760 [Okeania sp. KiyG1]
Length=146

 Score = 42.1 bits (95),  Expect = 0.12, Method: Composition-based stats.
 Identities = 11/56 (20%), Positives = 28/56 (50%), Gaps = 0/56 (0%)

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            + FF   ++ ++NI   ++L +S +L  G+   IF   ++  +++L +  +     
Sbjct  7    FIFFEMPLVVEENITATRSLGRSWVLTKGYIRRIFVILLVATLVALPVGVILYIFS  62


>QHI68099.1 hypothetical protein GT409_01070 [Kiritimatiellaeota bacterium 
S-5007]
Length=262

 Score = 43.6 bits (99),  Expect = 0.13, Method: Composition-based stats.
 Identities = 27/224 (12%), Positives = 59/224 (26%), Gaps = 22/224 (10%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            +    +      GI    +      +               N +W      A+  +  LG
Sbjct  16   FYASWKIYCRRFGIIFGVLFCWQVLLIGVDSTFTGNLFGELNHHWFVWEFRASSLFGFLG  75

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLI-------------------  211
            + ++       +    V L + +   + ++     ++ L+                    
Sbjct  76   MLFVIFLTAQSVRTEQVQLKK-VFQCVGNICLSAFVIRLMFLAGGIISSGLVKFLAFVEC  134

Query  212  --LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFV  269
              + V      +++   +  V+F F    L             S  +V G WW +FG  +
Sbjct  135  PEIGVLIFRFAVLVIETVLGVYFIFVYQALILKQKRAFSVFICSFQVVRGSWWKLFGIQL  194

Query  270  LLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYS  313
            L          +   +    E A   F      + F     +  
Sbjct  195  LFKAPVGLPLTILMVVAVGREMAVEWFGSPWMLYLFHPLMAVLY  238


>RPI78644.1 HAMP domain-containing protein [Desulfobacteraceae bacterium]
Length=231

 Score = 43.2 bits (98),  Expect = 0.13, Method: Composition-based stats.
 Identities = 22/189 (12%), Positives = 49/189 (26%), Gaps = 1/189 (1%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQT-TDNIATCPHCG  59
            M   RC  CG        K+   ++   C  C  T+       + T T            
Sbjct  1    MIQTRCEKCGKRYRVDEKKILGYRAKFECTACQNTVYIIKPPPKFTPTEKITPWNESTVI  60

Query  60   LQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGW  119
            L+R    +  E Q   +   +  +   L+   +          +   +   + L   +  
Sbjct  61   LKRESYFESGEWQRMKIPLPKEEKHRGLRFASKLTILMLIFILLPVFIFFGFYLKPTQNK  120

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
              + I  +    A  P +        +             ++     I+   +     + 
Sbjct  121  IKIEIGQVLTQDAPNPSYPLGNRMTQSNQTYYIFGAVLIYVILGTWLIVFFFTRSLTKII  180

Query  180  IYICKTDVG  188
                +  +G
Sbjct  181  RLANRMSLG  189


>PMC61510.1 hypothetical protein CJ204_10755 [Corynebacterium xerosis]
Length=283

 Score = 43.6 bits (99),  Expect = 0.13, Method: Composition-based stats.
 Identities = 29/193 (15%), Positives = 58/193 (30%), Gaps = 11/193 (6%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            +G    GI+L +  +F+ + +          +            + +     + G+  I 
Sbjct  91   IGAAFTGILLMYPVLFALMFVVMVFMYRGAFEEIDGRRPSFGTFFRVNRWGALIGAWLIT  150

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                         L    +G+ T        +V    LLL+   +           ++ D
Sbjct  151  GLMAVAAQIPGYILMFMGLGASTQSEGAGGALVILSYLLLVAGSIAVAPITALIPLLVMD  210

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
                 L+A   +  LV G +W++ G  +           L   +  +G       +L   
Sbjct  211  GRAKVLEAPALAWNLVKGRFWSVLGAML-----------LAGLVGAIGIMLCYIGALYTM  259

Query  302  PFSFLYYYLIYSD  314
            P   + Y  IY  
Sbjct  260  PIQMVAYAEIYRQ  272


>PCJ92756.1 hypothetical protein COA52_07305 [Rhizobiales bacterium]
Length=367

 Score = 44.0 bits (100),  Expect = 0.13, Method: Composition-based stats.
 Identities = 6/34 (18%), Positives = 12/34 (35%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP+C  +    +  L       +C +C    
Sbjct  44  KIACPNCDTKYALSAEALGDAGRKMKCAKCAHIW  77


>NIS31609.1 hypothetical protein [Actinobacteria bacterium]NIU66724.1 hypothetical 
protein [Actinobacteria bacterium]NIV87375.1 hypothetical 
protein [Actinobacteria bacterium]NIW28525.1 hypothetical 
protein [Actinobacteria bacterium]NIX21009.1 hypothetical 
protein [Actinobacteria bacterium]
Length=117

 Score = 41.3 bits (93),  Expect = 0.13, Method: Composition-based stats.
 Identities = 15/78 (19%), Positives = 28/78 (36%), Gaps = 4/78 (5%)

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGR---FVLLLVISLTLSFLTARI-PYVGEAANLAF  296
              ++  + A+E S  L SGH   +F      +L  +++    FL   + P +G    +  
Sbjct  3    VRDVNFVDAMEGSWSLASGHRLELFALAVVILLAGLVASIPGFLLGLVAPALGVTLGVVG  62

Query  297  SLLLTPFSFLYYYLIYSD  314
               +  F        Y  
Sbjct  63   RAAVAVFGIASAARAYDQ  80


>KCW62326.1 hypothetical protein EUGRSUZ_H04969 [Eucalyptus grandis]
Length=256

 Score = 43.2 bits (98),  Expect = 0.13, Method: Composition-based stats.
 Identities = 13/204 (6%), Positives = 49/204 (24%), Gaps = 13/204 (6%)

Query  158  AILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGG  217
             ++          + +      +      + +       +       +   L + V+   
Sbjct  40   CVVSFLFMAQFPEIGFEKLMGVVPKLWKKLIVTFIWANAMLLAYHVIVWSALNLRVIIEN  99

Query  218  SLLLIIPGLLFCVWFFFCQYVLA-------DDNIGGLQALEKSRLLVSGHWWAIFGRFVL  270
            + LL +  + + + F +              +++  ++A+ KS  L+  +   +     +
Sbjct  100  TFLLAVLSIAYALGFMYIIMAWDMANVVSVLEDVYVIKAIMKSNGLIKHNIGTVVFISYV  159

Query  271  LLVISLTLSFLTARIPYVGEAA------NLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
            ++   L L                            +  S +  Y + +     +     
Sbjct  160  IVYFYLVLQLPILLFVMFDVVVRERVLRIGIGVAYFSLMSMVVMYFVCNSNVGPHENIDK  219

Query  325  PPIKRQWLPLTAAIFGWMLIPGLL  348
              +                +    
Sbjct  220  SFLAEHLQAYLGGENAQPTLSFPP  243


>MBE6986978.1 DUF975 family protein [Ruminococcaceae bacterium]
Length=325

 Score = 43.6 bits (99),  Expect = 0.13, Method: Composition-based stats.
 Identities = 25/237 (11%), Positives = 60/237 (25%), Gaps = 27/237 (11%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            + +    + +      +              ++       +  I             Y+ 
Sbjct  81   VLIPFWQIGYLYAMLRICRGEPAGPETLFTGFRRFFPFLRLHLIQFASYAGVALACSYVA  140

Query  184  KTDVGLFRSMKLGLRHVGS-----------------FTLLLILLILVVGGGSLLLIIPGL  226
               +         +  +                      +  ++I ++    +L +  G+
Sbjct  141  GNVIFFTPWGSNMMDSLIPLLSESEAVDMAVLEEAIMAAMDQIMIPLLAIFGVLFLAIGV  200

Query  227  LFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT------LS  279
                 +   QY+L DD  +G L AL  SR +  G+  A+    +      L       L 
Sbjct  201  PMYYRYRMAQYLLMDDPRMGALAALRTSRYMTRGNRVALLWLDLSFWWFYLLDGMVTALW  260

Query  280  FLTARIPYVGEAAN---LAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLP  333
            F  + +P +G +          +      +    +Y   +                P
Sbjct  261  FGGSLLPLLGISLPWSEGVSQFIFYGLYMICQLGLYLGFRNRVEVTYAHAYSALRQP  317


>RJS68686.1 hypothetical protein CW714_09725 [Methanophagales archaeon]
Length=510

 Score = 44.0 bits (100),  Expect = 0.13, Method: Composition-based stats.
 Identities = 41/355 (12%), Positives = 89/355 (25%), Gaps = 29/355 (8%)

Query  2    PTVRCPHCGAERN-----TPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCP  56
              + CP+CG E          ++L       +C            +              
Sbjct  4    IRLSCPYCGEEIKGGGILLSENELKKDYIVVKCTNKECKRDIKLKKHTFKCIHCKETCEI  63

Query  57   HCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCR  116
                + +I            NC +  +    +   +F       R     L     LF  
Sbjct  64   FIPEETQILEIHYVDTPCERNCSKEEKKERKKELGKFYFLKCEERFHYSTLHPGELLFRC  123

Query  117  R-GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
                 L  + L+    +   + +     PA+ L+     +       T+    +      
Sbjct  124  NKCKRLFHVVLIRGRKSVCLLQTDKEGIPASPLHILAGRFGRLFEEITLRVHTISGGCKP  183

Query  176  GSMFIYICK-TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
               +    +  ++ L   +         +  +  L          + +       ++F F
Sbjct  184  FLTYGRARRLGEIILPFFLMSLCSIFYFYLCISTLFSYPTSIIYSVFVSVAFASLIYFSF  243

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR----------  284
                     +G ++ L+       G +   F    + L+I    S +             
Sbjct  244  LVLKETVIVLGEIE-LKSVEDFARGMYDCFFTFRKIWLLIGFMFSLILLVEEILSFGFRL  302

Query  285  -------IPYVGEAAN----LAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIK  328
                   I ++   A       F+ L  P  F +   +Y      YR   +   +
Sbjct  303  PATEVEKIRWIDSIACIPGMFIFTSLTIPIFFSFGAFLYGMWSYIYRRYPNRVFR  357


>WP_174525436.1 zinc-ribbon domain-containing protein, partial [Wolbachia endosymbiont 
of Nomada ferruginata]
Length=37

 Score = 39.0 bits (87),  Expect = 0.13, Method: Composition-based stats.
 Identities = 5/35 (14%), Positives = 10/35 (29%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            ++C  C         ++ A     +C  C     
Sbjct  2   KIQCNSCTKTYLVSPKQIGASGRKVKCTNCNHIWH  36


>OFW70235.1 hypothetical protein A2065_03550 [Alphaproteobacteria bacterium 
GWB1_45_5]OFW75835.1 hypothetical protein A3K20_03310 [Alphaproteobacteria 
bacterium GWA1_45_9]OFW89923.1 hypothetical 
protein A2621_03510 [Alphaproteobacteria bacterium RIFCSPHIGHO2_01_FULL_41_14]HCI48361.1 
hypothetical protein [Holosporales 
bacterium]
Length=246

 Score = 43.2 bits (98),  Expect = 0.13, Method: Composition-based stats.
 Identities = 9/40 (23%), Positives = 11/40 (28%), Gaps = 0/40 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPA  41
            T+ CP C          L  +     C EC       P 
Sbjct  7   ITISCPECTTPFEIEGDLLGTEGRYVACSECDHQWFQYPP  46


>RME88612.1 hypothetical protein D6785_00700, partial [Planctomycetes bacterium]
Length=287

 Score = 43.6 bits (99),  Expect = 0.13, Method: Composition-based stats.
 Identities = 29/188 (15%), Positives = 62/188 (33%), Gaps = 27/188 (14%)

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
                  LL I    + L      ++L + P      +            +       S +
Sbjct  98   FIHYLALLFISFAPVALLAVFGVTSLRISPVVMGWGKFILILLVFFAIFLTAFSFFYSSL  157

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILV---------------------  213
              S+F ++ K +  L  +++ G   +G    + + L ++                     
Sbjct  158  IYSVFQHVRKRNPSLGEALQKGWARMGKVFSVGLFLFIIFFFLAIAVRFISSPVLLVIAR  217

Query  214  -----VGGGSLLLIIPGLLFCVW-FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGR  267
                 +  G+  L    + F +  +F    V   +N  G  ++++S  L  G+   IF  
Sbjct  218  MGSDVLFMGAQFLQTLLIWFLLTPYFVAFPVTIVENRDGFVSIQRSSQLTRGNRLRIFVL  277

Query  268  FVLLLVIS  275
            F++  +IS
Sbjct  278  FIITTLIS  285


>MBI3841470.1 stage II sporulation protein M [Thaumarchaeota archaeon]
Length=468

 Score = 44.0 bits (100),  Expect = 0.13, Method: Composition-based stats.
 Identities = 26/191 (14%), Positives = 57/191 (30%), Gaps = 1/191 (1%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
             +   L  +VLA            A       +    + +            W   +  +
Sbjct  37   FVAYALGALVLAVLVAVLTSGFVTAAEYGSYWKAIGGSRVDIASVLASFVDRWKPMAWTL  96

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILV-VGGGSLLLIIPGLLFCVWFFFCQYVL  239
            ++  +   L       L       L   L++L  V    L  ++      ++F +    +
Sbjct  97   FLSYSLTFLPIIGAFLLTIFAVVFLEGSLVLLGAVIMTYLAAVVATAFISLFFIYTPVAV  156

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLL  299
              DN+ G +A+ KS   V            + + ++  +    + IP +G       ++ 
Sbjct  157  VSDNVSGFEAIRKSVRQVRKAKKTAATYGFVYIFLTTLVGATPSLIPGIGLPLASLATVG  216

Query  300  LTPFSFLYYYL  310
            L        +L
Sbjct  217  LLIIVVPILHL  227


>KKR63361.1 hypothetical protein UU02_C0026G0004 [Candidatus Woesebacteria 
bacterium GW2011_GWA1_40_43]
Length=95

 Score = 40.5 bits (91),  Expect = 0.14, Method: Composition-based stats.
 Identities = 21/93 (23%), Positives = 45/93 (48%), Gaps = 2/93 (2%)

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            +LI+P +L  +WF F ++V+ D  +G   +L +S+ +V G +W + GR ++  +      
Sbjct  1    MLIVPFVLVVIWFAFSKFVMVDKGVGIKVSLLESKGMVKGIFWQVLGRLIIFGLFWFFSQ  60

Query  280  FLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
             + + +P+      + + L    F    + L  
Sbjct  61   MVLSVLPF--GIGTVIYGLCGGLFVIPSFLLYR  91


>MQA06694.1 hypothetical protein [Streptosporangiales bacterium]
Length=665

 Score = 44.4 bits (101),  Expect = 0.14, Method: Composition-based stats.
 Identities = 17/81 (21%), Positives = 27/81 (33%), Gaps = 1/81 (1%)

Query  207  LILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFG  266
                 L +G     + +   L    F     V   +  G   AL +S  L  G  + +  
Sbjct  181  STFTELPIGVAITFVGLVCWLVFSPFAVAWPVALAERRGPFGALRRSVQLTRGRMFRLGL  240

Query  267  RFVLL-LVISLTLSFLTARIP  286
              +     I   LS+  +RIP
Sbjct  241  LVLFFGYGIPWGLSYGVSRIP  261


>HCH35343.1 hypothetical protein [Dehalococcoidia bacterium]
Length=185

 Score = 42.4 bits (96),  Expect = 0.14, Method: Composition-based stats.
 Identities = 14/110 (13%), Positives = 37/110 (34%), Gaps = 0/110 (0%)

Query  163  TVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
            ++       ++    +     K  +    S  +          L+     ++   ++  I
Sbjct  73   SIIIGFHTSTFYLVVLASRQRKLQISSSVSALVAFGPKLFLLALITSAAALLLIPTIFGI  132

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
               +   V     +  +  +N   LQA+ +S  L+ G+W   F    +++
Sbjct  133  FIVIYVGVRLSLIRPFIVLENTTPLQAVLESWKLIEGNWVRTFTVQFIVI  182


>NIR16342.1 hypothetical protein [Desulfobacterales bacterium]
Length=91

 Score = 40.5 bits (91),  Expect = 0.14, Method: Composition-based stats.
 Identities = 12/42 (29%), Positives = 16/42 (38%), Gaps = 1/42 (2%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAE  42
           M  + C  C ++ N     L +  S  RC  C    I  P E
Sbjct  1   MI-IECESCESKYNLDEGLLKSGGSKVRCSVCKNVFIAYPPE  41


>NPA43632.1 hypothetical protein [Chlorobi bacterium]
Length=294

 Score = 43.6 bits (99),  Expect = 0.14, Method: Composition-based stats.
 Identities = 18/206 (9%), Positives = 57/206 (28%), Gaps = 14/206 (7%)

Query  102  SISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILL  161
               +       L+ R        +L+     F   F  +    ++          + ++L
Sbjct  19   FFRRYGKHFLGLYFRLLGPFYLFFLVVYTWIFYRGFQNVEPLISSVPGTVILIAFYLVIL  78

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMK------------LGLRHVGSFTLLLIL  209
                   + +              DV                   + +     F  + ++
Sbjct  79   LAALLHSVYVPAYFYFFKERGTDFDVKDIWRFFLQNLGRILLFSLVMIVIFVPFLAVALV  138

Query  210  LILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFV  269
            + +++    + +IIP ++  + F         +  G   A  ++  L+   +W   G   
Sbjct  139  MAVILLVTIVGIIIPFVILNLLFSLWFMEYLYERKGIGAAFREAWDLMFTRFWTYTGALT  198

Query  270  LLLVISLTLSFLTARI--PYVGEAAN  293
            +   + + + +    +   ++     
Sbjct  199  VTAWLGIIVIWGLQLLGQSFILMIFF  224


>PYT03044.1 hypothetical protein DMF60_19570, partial [Acidobacteria bacterium]
Length=105

 Score = 40.9 bits (92),  Expect = 0.14, Method: Composition-based stats.
 Identities = 17/106 (16%), Positives = 43/106 (41%), Gaps = 1/106 (1%)

Query  157  WAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGG  216
                   +  ++ G++    +         + L  +     + + +  + + +  +    
Sbjct  1    MFFANFVINALIAGVTIRLVTQLFLSPLRPLNLRTAYHAVRKRLKALLVTIAIASIRWIL  60

Query  217  GSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWW  262
            G LLLI+PG++  + +     V+  + + G  AL++S+ LV   W 
Sbjct  61   G-LLLIVPGIIMFINYSLAAPVVMMEGLKGRAALKRSKALVRRSWR  105


>MQL89493.1 hypothetical protein [Colocasia esculenta]
Length=865

 Score = 44.4 bits (101),  Expect = 0.14, Method: Composition-based stats.
 Identities = 27/197 (14%), Positives = 52/197 (26%), Gaps = 1/197 (1%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                L+ +Y L  V +    F  +L+         +        L            +T 
Sbjct  637  MEIVLMTVYFLCSVASRFLGFFEVLITTFAASAIYSGEHLSPSELLKRMRGNWRGPALTA  696

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
                 +      +      G    G     L L+ L      L+ ++   L  V +   Q
Sbjct  697  LYVAVLRSGFWVVVNIFTGGFPIFGDGLFALFLIALNGLLSLLVQLLYYYLD-VGWASGQ  755

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
                 D  GGL AL+++  L  G         +   +       +   +  +        
Sbjct  756  AASVVDGCGGLSALKRAAELTRGRRLRGICVVLFWGIAYDAADVVFGLLVRLFSPVLAGG  815

Query  297  SLLLTPFSFLYYYLIYS  313
            S++L          +  
Sbjct  816  SVVLIAVCCSNILELMY  832


>MBI5180597.1 zinc-ribbon domain-containing protein [Nitrospirae bacterium]
Length=416

 Score = 44.0 bits (100),  Expect = 0.14, Method: Composition-based stats.
 Identities = 36/367 (10%), Positives = 93/367 (25%), Gaps = 4/367 (1%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            ++CP C    N  ++K+       +C  C    +     +        +A+       + 
Sbjct  3    IQCPRCSVRYNLDNAKITRGSIKVKCTNCGNVFVVQKQRADIEPIHRRVASETIREQIQT  62

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLG  123
               + +   ++                      G+ L        D   +       +L 
Sbjct  63   RKREDVPASTRGPVSAGIFDYVPGLIVMFLLTLGAILLGNFLKSIDPNLIGRYHLNYVLI  122

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            + ++GI           L+       P  +     + L      +  +  +   +     
Sbjct  123  LLVVGIAWRNLIGIPDSLMYGIGLGRPLLKIGIIIMGLRFGLGAVADIGIIGLLIIALFV  182

Query  184  KTDVGLFRSMKLGLRHVGSFTLL-LILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
               VGL   +        SF+ +    + +     ++              +    +   
Sbjct  183  FGSVGLILLLGKKFEMADSFSGVLASAIGICGVSAAIAAAPVVKAKDTEIAYSIVTIILW  242

Query  243  NIGGLQA---LEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLL  299
             +  L A   +  +  L    + A  G  +L     +  + +  +             ++
Sbjct  243  GLIFLFAFPFIGSAFGLTQYQFGAWAGTGILNSGQVIASASIYGKEARDIATLYNIIRVI  302

Query  300  LTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSA  359
              PF  L+    Y   +A         I     P+    F  +++     +         
Sbjct  303  GIPFVVLFIAFWYVGREARGEKVAAGEIIFSKFPIFVIGFFVLVLVRSFGLVTDADYKLI  362

Query  360  EQLLSAG  366
              ++   
Sbjct  363  NPIVDWF  369


>MBF1189903.1 DUF975 family protein [[Eubacterium] sulci]
Length=134

 Score = 41.7 bits (94),  Expect = 0.14, Method: Composition-based stats.
 Identities = 20/110 (18%), Positives = 44/110 (40%), Gaps = 6/110 (5%)

Query  210  LILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRF  268
            + +     S+L  IPG++    +    Y+LAD+  +  ++ L +S++++ G+   +FG  
Sbjct  1    MNVFAFLWSMLFFIPGVIAYFRYSLAFYILADNPELSAMECLRRSKIMMRGNKGYLFGLN  60

Query  269  VLLLVISLTLSFLTAR-----IPYVGEAANLAFSLLLTPFSFLYYYLIYS  313
            +     +L             I +V        S++        Y L+  
Sbjct  61   LSFFGWALLAILAVVLMTDTVILFVPHVNIYITSIMQIFLLIPTYILMSY  110


>RJO71902.1 tetratricopeptide repeat protein [Myxococcales bacterium]
Length=1218

 Score = 44.4 bits (101),  Expect = 0.14, Method: Composition-based stats.
 Identities = 12/33 (36%), Positives = 17/33 (52%), Gaps = 0/33 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQT  35
            V C  CGA+      K+PA+  S +CP+C   
Sbjct  2   KVSCDKCGAKFKIDDGKIPAQGLSMKCPKCQAP  34


>MSQ33989.1 hypothetical protein [Dehalococcoidia bacterium]
Length=161

 Score = 42.1 bits (95),  Expect = 0.14, Method: Composition-based stats.
 Identities = 18/85 (21%), Positives = 29/85 (34%), Gaps = 2/85 (2%)

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY--VGEAANLAFS  297
                    +A+ +S LLV GH+W   G F+L+ +I          I    VG    +  +
Sbjct  71   FLSRSSPAEAIRRSVLLVRGHFWPALGLFILVNLILQGTPLAWRLITGHPVGALVAMLGN  130

Query  298  LLLTPFSFLYYYLIYSDLKANYRGP  322
              +         + Y D       P
Sbjct  131  AYIGTGVMAGTLVFYRDRAGIQGVP  155


>HAD04567.1 hypothetical protein [Desulfuromonas sp.]
Length=263

 Score = 43.2 bits (98),  Expect = 0.14, Method: Composition-based stats.
 Identities = 8/35 (23%), Positives = 13/35 (37%), Gaps = 0/35 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
             + C HC         KL  + +  RC +C +  
Sbjct  10  VVIECAHCQTRFKLADDKLRPEGTKVRCSKCKEIF  44


>MXX00040.1 hypothetical protein [Acidimicrobiia bacterium]
Length=272

 Score = 43.2 bits (98),  Expect = 0.14, Method: Composition-based stats.
 Identities = 34/219 (16%), Positives = 70/219 (32%), Gaps = 1/219 (0%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
                   L          +  +G  +         L   A             +LL  V 
Sbjct  44   QWLPGVGLMMSFLLLFAELLAIGAFIRIVASHCVDLHFSAAEAIRLAWRQYGNMLLMVVV  103

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
            + L   +       I      V           + G    +    +L     +L++++P 
Sbjct  104  FGLAVAATAAVMTMIGSAILAVVAPGFAAEVSSYGGDPLSMPAETLLPFLLWTLVMVLPA  163

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            +   + ++     L  +  G + +L +S  LV  + W      +L L++      +  R+
Sbjct  164  ICLAMTWWVAPMGLTVEGTGAIPSLVRSWKLVLPNLWRTIKILLLALLVVALPFLVIYRL  223

Query  286  PYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
             +    A LA ++   PFS++   ++Y DL+    G   
Sbjct  224  -FPYHWAVLALNVFGLPFSWVVATVLYLDLRVRSEGLDP  261


>NNJ64170.1 thiol reductase thioredoxin [Xanthomonadales bacterium]
Length=47

 Score = 39.4 bits (88),  Expect = 0.14, Method: Composition-based stats.
 Identities = 7/27 (26%), Positives = 10/27 (37%), Gaps = 0/27 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARC  29
            V CP+C      P  +L       +C
Sbjct  2   RVVCPNCHTTNQVPEERLQDGPRCGKC  28


>WP_085536141.1 FHA domain-containing protein [Massilibacteroides vaginae]
Length=170

 Score = 42.1 bits (95),  Expect = 0.15, Method: Composition-based stats.
 Identities = 11/33 (33%), Positives = 18/33 (55%), Gaps = 0/33 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECC  33
           M +++CP+C        +KLP    S +CP+C 
Sbjct  1   MISLKCPYCQVGLKVDETKLPEGIKSFKCPKCK  33


>PYQ07574.1 hypothetical protein DMF82_03555 [Acidobacteria bacterium]
Length=98

 Score = 40.5 bits (91),  Expect = 0.15, Method: Composition-based stats.
 Identities = 9/34 (26%), Positives = 15/34 (44%), Gaps = 2/34 (6%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKK-SSARCPECC  33
           M  + CP C +      SKL  +  +  +C +C 
Sbjct  1   MI-IVCPACQSRYKFDESKLGERPKARTKCAKCG  33


>RZV37437.1 hypothetical protein EVJ48_08995 [Candidatus Acidulodesulfobacterium 
acidiphilum]
Length=180

 Score = 42.4 bits (96),  Expect = 0.15, Method: Composition-based stats.
 Identities = 7/33 (21%), Positives = 12/33 (36%), Gaps = 0/33 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           + CP+CG       + +P      +C  C    
Sbjct  3   INCPYCGTHYELNEALIPDGDIKVKCRVCSNVF  35


>WP_044075409.1 hypothetical protein [Prevotella pectinovora]KIP54114.1 hypothetical 
protein ST43_11420 [Prevotella pectinovora]
Length=219

 Score = 42.8 bits (97),  Expect = 0.15, Method: Composition-based stats.
 Identities = 16/206 (8%), Positives = 44/206 (21%), Gaps = 2/206 (1%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKS--SARCPECCQTLIFDPAESQRTQTTDNIATCPHC  58
            M  V+CP CG   +      P          P+  + +             +N    P  
Sbjct  1    MNQVKCPDCGEVYSENMQSCPNCGCPNDNWKPKQEEQVHETTPTDYEEDFNENGQYSPFS  60

Query  59   GLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRG  118
                                 + +          +    +        + ++   FC   
Sbjct  61   PTSWFFADPWPLKNYPRKAFEKKHPFLGWLFGPWYLTCKNESEKEEYAVINNIFYFCNLI  120

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
            +       +        +F   L+    +     +          +   +         +
Sbjct  121  FKTFLYAAIWAFFKGIVVFFVYLMFALIFAFRGLEMANNNSTDGLIVLGVFSAIMFYVVI  180

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFT  204
              +I     G+ +++      +    
Sbjct  181  IYFIVLDCCGMGKALHRYWPSIHKTW  206


>MBE6716609.1 hypothetical protein [Ruminococcaceae bacterium]
Length=304

 Score = 43.6 bits (99),  Expect = 0.15, Method: Composition-based stats.
 Identities = 29/307 (9%), Positives = 67/307 (22%), Gaps = 9/307 (3%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            I+     +  +  +    L     L     +    + L  +  +   +    G + I I 
Sbjct  7    IFWFFEDIFESIGWFIEDLMYELDLEDLFYSLTGGMSLVMIMAVFGIVFLFFGVIAIGIY  66

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
              +          L +  +  L  I  +  +    LL             F      D+ 
Sbjct  67   VLNAVAISKFSKKLGYNTNTGLAWIPFLQGIFVIYLLSKA-----SGRNDFRIDPKIDEK  121

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
                     +  L   +    F    +   +S  + F+   IP+VG   +     ++   
Sbjct  122  FKIQD--RNTSFL--YYVLVYFLGTAVATTVSSIVGFILGLIPFVGVILSPLVGFVIGLI  177

Query  304  SFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLL  363
                  +       +                         I  +L + L          +
Sbjct  178  PAAAMGIFKFVYLRDCLDLLKEDKNTNKTTAIIITIADHFIGLVLPIYLLTLLKCDPLPV  237

Query  364  SAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLF  423
            +     +           + N   PE   +   A      +  +                
Sbjct  238  NHAPYEEYEQYGYDPNGYNSNGYNPEGYNQYGYAPEGYEQNTYQNNGYAPNNYNNIQGNE  297

Query  424  ADRFWAD  430
               +   
Sbjct  298  QPPYQQF  304


>MXX43460.1 hypothetical protein [Acidimicrobiales bacterium]MYI09376.1 hypothetical 
protein [Acidimicrobiales bacterium]
Length=256

 Score = 43.2 bits (98),  Expect = 0.16, Method: Composition-based stats.
 Identities = 26/181 (14%), Positives = 59/181 (33%), Gaps = 25/181 (14%)

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV  214
              +A   A     L+    +T   F  +    +    ++   +R +     L +LL LV+
Sbjct  54   LLFAFGFACCLGFLVQEIAVTRMAFDDVRGRPINFGETLMAAIRRMPKIFGLSVLLGLVM  113

Query  215  -------------------GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
                                   L  ++  ++F         +   +    + +  +   
Sbjct  114  ALVGCGLSVLLLWAAPVLGVLWLLAYLVAFVVFFPLLVVYFVMAYVEPR--IPSPRRWWR  171

Query  256  LVSGHWWAIFGRFVLLL----VISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
            L+ G   AI+GR +LL     ++   L+     +P      +L  +  + P  F+   + 
Sbjct  172  LIRGREAAIWGRVMLLGFAANLVGFGLTAGLGALPLPALYGDLFINAAVFPLIFVLGAVF  231

Query  312  Y  312
            +
Sbjct  232  H  232


>MSR81192.1 hypothetical protein [Gemmataceae bacterium]
Length=281

 Score = 43.2 bits (98),  Expect = 0.16, Method: Composition-based stats.
 Identities = 31/240 (13%), Positives = 69/240 (29%), Gaps = 17/240 (7%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            T+ CP C      P++ L    +  +CP C   L    A+   T  T      P      
Sbjct  56   TIACPKCQTRLMIPNASL---GAKVQCPTCKTILQTQAAKDNSTAPTPKATAKPTAKPPA  112

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
            +  +      +     +  N +F  Q E + +   +      +  + + EL         
Sbjct  113  KPSAPPPSKPTPPPPAKEENFAFQGQEEEDGQYGVNRDAESHRCPSCANELTT-------  165

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
                    +        L  +    L        W  +L  +   L  +  +   ++  +
Sbjct  166  -----AEDVICLTCGYDLRSRTPNRLKKLEHKGFWDYVLWHLLAGLFTVLIIALIVWDVL  220

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
              T +  +   +    ++        L   +   G++  +    +    FFF  +    +
Sbjct  221  VFTKLPGWLGSEESWGNIMVSHGFFRLWNTIFSIGAIWAMGKYAI--ARFFFEPHPPEVE  278


>NLK08425.1 zinc ribbon domain-containing protein [Firmicutes bacterium]
Length=280

 Score = 43.2 bits (98),  Expect = 0.16, Method: Composition-based stats.
 Identities = 28/318 (9%), Positives = 76/318 (24%), Gaps = 44/318 (14%)

Query  6    CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIP  65
            C +CG      ++                    +         +    T  +      + 
Sbjct  7    CTNCGKGAAADANYCSRCGYDINPQVKSGVSTRETVSEAAECLSKIQFTKFYGYGAVTLL  66

Query  66   SDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIY  125
               +E     V         C         S                +F      +L   
Sbjct  67   MLVVEGLKLNVMDFWMREIVCGMRGMRAFTST--------------LIFQFLVVVILEFA  112

Query  126  LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKT  185
             +  ++ F+ +           L    +  +  I      Y L+    +   +   +   
Sbjct  113  AIAYMIKFSLMVYPKKYMSLIDLITPVKELRKTIRRYFFTYALIAAVNLCILICAIL---  169

Query  186  DVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIG  245
                     + +R   +   +  L+++ +   +                  ++  D N+ 
Sbjct  170  ----IMEFLVDVRKFTAIYRVPTLMLIAIYIIN-----------TKLILSPFLALDQNLH  214

Query  246  GLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSF  305
               A+++   L  GH   + G  +++ +++   +                 + +  P S 
Sbjct  215  PSLAIKEGMRLTKGHNGEMLGWALIIFILNGIGTKAV------------IGTAVTIPISV  262

Query  306  LYYYLIYSDLKANYRGPQ  323
                 +Y  L+       
Sbjct  263  YLLTRMYYKLRRENPNLD  280


>NIQ98473.1 hypothetical protein [Desulfuromonadales bacterium]NIR34368.1 
hypothetical protein [Desulfuromonadales bacterium]NIS44334.1 
hypothetical protein [Desulfuromonadales bacterium]
Length=109

 Score = 40.9 bits (92),  Expect = 0.16, Method: Composition-based stats.
 Identities = 14/85 (16%), Positives = 23/85 (27%), Gaps = 0/85 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + CPHC   ++     +P   S   CPEC     F   + +  +     A    C    
Sbjct  2   QITCPHCRFSKDVNQQWIPQNPSQVECPECQGIFYFTEKDGEMLEKPQPPAPRQTCPSCG  61

Query  63  RIPSDRLEIQSKTVNCRRCNRSFCL  87
                     +  +   R       
Sbjct  62  LDQPPGRSCVNCGIVFARHEAEPHP  86


>MBI1920332.1 zinc-ribbon domain-containing protein [Geobacter sp.]
Length=311

 Score = 43.2 bits (98),  Expect = 0.16, Method: Composition-based stats.
 Identities = 12/38 (32%), Positives = 18/38 (47%), Gaps = 0/38 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
             V CPHC   R+  + K+P +     CP+C +   F 
Sbjct  4   VRVSCPHCSFSRDLTADKIPDRPVRVTCPKCKEAFEFR  41


>PIP32948.1 hypothetical protein COX23_02015 [Candidatus Gottesmanbacteria 
bacterium CG23_combo_of_CG06-09_8_20_14_all_37_19]
Length=280

 Score = 43.2 bits (98),  Expect = 0.16, Method: Composition-based stats.
 Identities = 28/155 (18%), Positives = 57/155 (37%), Gaps = 11/155 (7%)

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
            +   L      ++       + +   + L L       L+L +  L     +        
Sbjct  133  VFFYLKQGFLHVWKLWLGVLISVLMIIGLALAVKLIILLVLGIFNLPQIIWNYYSQFLFN  192

Query  227  LFCVWFF----FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
            +    F     F  Y++ +   G   +  +SR LVS H  A+     L  +       L 
Sbjct  193  IAAFAFISNIIFSPYIVIEQKKGVFDSFRESRALVSEHLGAVMVHTFLFFL-------LI  245

Query  283  ARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKA  317
              + ++G +  L    ++ PF  +Y+Y++Y+ +KA
Sbjct  246  TLVDFLGRSGKLLLVFVILPFVQIYFYMLYTKIKA  280


>RAI38450.1 hypothetical protein CH341_27855, partial [Rhodoplanes roseus]
Length=37

 Score = 38.6 bits (86),  Expect = 0.16, Method: Composition-based stats.
 Identities = 8/34 (24%), Positives = 14/34 (41%), Gaps = 1/34 (3%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP+C A    P + +  +  + RC  C    
Sbjct  2   KIVCPNCEAWYEVPEASVG-RGRTVRCISCQTVW  34


>NLA43499.1 hypothetical protein [Candidatus Saccharibacteria bacterium]
Length=269

 Score = 43.2 bits (98),  Expect = 0.16, Method: Composition-based stats.
 Identities = 16/173 (9%), Positives = 45/173 (26%), Gaps = 11/173 (6%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +  + ++ +    +            W                  + LL +  +    F 
Sbjct  74   IYELVVITLCFELSYRGEKANALELAWAGFVRLKSFLHPSGFIYFFALLVIVTLLDFPFA  133

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                +  G+   +         F  +            +   +  L +   + F  + + 
Sbjct  134  SSLVSVTGIPDYLHNYFERHPVFLAVG-----------VGFAVLVLWWFGAYVFSLHYII  182

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAAN  293
                   QA+ K++ ++ G    I  R  L  ++      +   +  +  A  
Sbjct  183  VGKTTARQAMRKTKAIIKGRRILINLRIFLWYILVFAALLVAYGLVLLLFAGA  235


>MBC8200311.1 zinc-ribbon domain-containing protein [Desulfobacteraceae bacterium]
Length=196

 Score = 42.4 bits (96),  Expect = 0.16, Method: Composition-based stats.
 Identities = 19/179 (11%), Positives = 34/179 (19%), Gaps = 8/179 (4%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  + CP CGAE +  +   P            +       +              +   
Sbjct  1    MIVI-CPECGAEISEEADLCPHCGLK-------RAGWRSILKRDFEHNLSETQKRMNDEK  52

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
                    LE+                       A        S                
Sbjct  53   CPFCRHKGLELNHYDHIESWPIWKCPNCNLDLSTAYDCRRHIKSYSATIYVSFIGLMSSL  112

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
            +  +     V     IF+A  +     L          IL+       +    +    +
Sbjct  113  MGLVLHSLFVYFLGAIFTAAGVIILIILEEFEGIIAILILVLAFIISPVCAFMLIIFSY  171


>MRR11338.1 hypothetical protein [bacterium]
Length=244

 Score = 42.8 bits (97),  Expect = 0.16, Method: Composition-based stats.
 Identities = 30/230 (13%), Positives = 71/230 (31%), Gaps = 8/230 (3%)

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
              R      + ++ ++     +  + L                        + +L    +
Sbjct  14   FCRQHAFSVLCIVLVIYIPIELVLSALPFDDEQFLKSTAREFRLQRTLEALFGVLCSMAL  73

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
               +     K  +    S+ L      +      LL L+  G  L L++P +   +   F
Sbjct  74   AHLVLADHEKRILTTRDSLWLAASRWRASLGTQFLLGLLYLGALLALVLPLVFVGIATVF  133

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIF----GRFVLLLVISLTLSFLTARIP--YV  288
               ++A  +  G+ A++ S  LV G WWA+F       +  L ++         +P  Y+
Sbjct  134  TVPLVALRDHSGVCAIKASWALVRGRWWAVFRLVALLVLAELGLAALAIAPFLFLPEHYL  193

Query  289  GEAANLAFSLLLTPFSFLYY--YLIYSDLKANYRGPQHPPIKRQWLPLTA  336
             + A+     ++     +     +++ +           P       +  
Sbjct  194  LDVASSLLLDIVAALFIVATVKAMLHLEAHPVAEARSAAPALEPMPSIRP  243


>MBL92324.1 hypothetical protein [Myxococcales bacterium]
Length=1256

 Score = 44.0 bits (100),  Expect = 0.17, Method: Composition-based stats.
 Identities = 8/85 (9%), Positives = 21/85 (25%), Gaps = 0/85 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + CP C +E      ++ +  +  RC  C     +   + +      +           
Sbjct  2   LISCPECKSEYELTGDEIGSDGAKIRCAVCDHLFKYAIDQEEGPWVIRHPTGATIEVADL  61

Query  63  RIPSDRLEIQSKTVNCRRCNRSFCL  87
                 +  Q    +      +   
Sbjct  62  SQIQIWILEQKVFDHDEVKQGNNDW  86


>XP_016435567.1 PREDICTED: uncharacterized protein LOC107761799 [Nicotiana tabacum]
Length=734

 Score = 44.0 bits (100),  Expect = 0.17, Method: Composition-based stats.
 Identities = 26/234 (11%), Positives = 59/234 (25%), Gaps = 5/234 (2%)

Query  87   LQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPAT  146
                                       F      +  ++L+ + L      +   +  + 
Sbjct  455  YPSFHLALFHPDYDFISFAQPHLFLSNFEIIVPTMYSLFLVLLFLCAVATTTYSAVHASY  514

Query  147  WLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLL  206
                   +   +I  +    +   +   T  + I +    +            +      
Sbjct  515  GRPINLVSSIKSIRKSFFPLLSTLVISQTIFISITLFFALILTIVVQIFQALELIELKYD  574

Query  207  LILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFG  266
                +  V    ++L+   L   V +     +   +   G + L KS  LV G  W  FG
Sbjct  575  SNHFLFWVVPALIVLVPVLLWLQVNWSLAYVIAVVETKWGYETLRKSAYLVKGQRWVAFG  634

Query  267  RFVLLLVISLTLSFLTARIPYVGEAA-----NLAFSLLLTPFSFLYYYLIYSDL  315
              +        L    +    +  AA          +L T    +  Y++ +  
Sbjct  635  IHLYYGFSMEMLMVCGSMFIVIVGAAKGNQWMSLGVILQTMLVSVMGYIMMNQY  688


>WP_015837784.1 zinc-ribbon domain-containing protein [Geobacter sp. M21]ACT18552.1 
MJ0042 family finger-like protein [Geobacter sp. M21]
Length=442

 Score = 43.6 bits (99),  Expect = 0.17, Method: Composition-based stats.
 Identities = 30/200 (15%), Positives = 56/200 (28%), Gaps = 9/200 (5%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             + CP C    +   +K+P K ++ +CP C +     PAE    Q T     CP CG+++
Sbjct  2    KIECPSCHYAADVDPAKIPPKGANTKCPRCSKVFFVGPAEPVAMQDTSASIVCPKCGVEQ  61

Query  63   RIPSDRLEIQSKTVNCRRCNRSF---------CLQPEREFRASGSGLRSISQLLADSWEL  113
                            R                   +    A  +         A ++  
Sbjct  62   PAADSCASCGIIYEKYRAVQERRLQSEAGDKDEPVTKVAPAAPQTEAAPFPDTDAATFHF  121

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
               RG     I    +     P  S +       +  +      A+    +   L  + +
Sbjct  122  GHGRGELFKWIADGHLAAEALPQASRIAGMLPGPMEWRRFLDGLALWSGAIFLALAVIFF  181

Query  174  MTGSMFIYICKTDVGLFRSM  193
               +       T  G+   +
Sbjct  182  FAYNWKELGSFTRFGIAELL  201


>XP_014671178.1 PREDICTED: anoctamin-4-like [Priapulus caudatus]
Length=509

 Score = 43.6 bits (99),  Expect = 0.17, Method: Composition-based stats.
 Identities = 50/509 (10%), Positives = 105/509 (21%), Gaps = 25/509 (5%)

Query  6    CPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIP  65
            CP C         K P    +  C     T +FD   +        +         +R  
Sbjct  2    CPQC-------DVKCPYWYINETCTYTRFTFLFDNPATVFFAVFMAVWATLFLEFWQRRQ  54

Query  66   SDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIY  125
                                  + +   +      +     +  S     +        +
Sbjct  55   HVLAWDWDVQNFDDEETVRPQYEKQATEKKLNPVTQQEQPHIPWSTRFIAKTYSFCSLFF  114

Query  126  LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKT  185
            LL +V+A         +   T L    ++          +      + +   + I I   
Sbjct  115  LLVLVMAAVFGVIVYRIIIGTLLYMNEKDAVRNTSSIVTSI----TASIINLIVIMILGQ  170

Query  186  DVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF-----------F  234
               L   +   +     + LL +          L++ +  ++    FF            
Sbjct  171  VYKLLAKLFTDMGRYNYYALLDLYCDPSGCMVELVIQLTTIMIGKQFFNNIKEIVMPKLM  230

Query  235  CQYVLADDNIGGLQALEKSRL---LVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
                          A   S     LV G     F    L +VI      L      +   
Sbjct  231  KWIKKILKGETEASAPRTSWERDYLVQGDPLLGFFDEYLEMVIQYGFITLFVAAFPLAPL  290

Query  292  ANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVS  351
              L  +++        Y + Y    A                   A+     +       
Sbjct  291  FALLNNIMEIRIDAYKYTVEYRRPLAARAQDIGIWYSILRSITILAVIFNAFVISTTSDF  350

Query  352  LSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTS  411
            + R         +   +           T D   +   +   ++           R    
Sbjct  351  IQRLVYMYGYSDNKNLEGYMNNSLSVFATKDFPANTGPDYSHITINTELRDTEFCRYRGY  410

Query  412  EGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDL  471
                          + W        ++ +       L+             V D   R+ 
Sbjct  411  RFPPGDVKEYQVNLQHWHIFTAQLAFVVVFEHVVFFLAWFIAVIIPNVPQSVKDQMEREK  470

Query  472  YDRQHSFEHPAFHWVGINQTDENDLFSGI  500
            Y  +              +  ++      
Sbjct  471  YLAKTVIYKLESERAKKARKGQDSSIPES  499


>MAR09255.1 hypothetical protein [Blastopirellula sp.]
Length=493

 Score = 43.6 bits (99),  Expect = 0.17, Method: Composition-based stats.
 Identities = 12/75 (16%), Positives = 20/75 (27%), Gaps = 0/75 (0%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M   +C  CGA+   P +    K   + C    Q              + N + C     
Sbjct  1   MIEFQCEKCGAQMQAPEASAGKKGRCSECGSIQQIPSAPAEAPSLAVISLNCSECGEELR  60

Query  61  QRRIPSDRLEIQSKT  75
                + +     K 
Sbjct  61  VPAANAGKKGKCPKC  75


>NLG07741.1 hypothetical protein [Deinococcales bacterium]
Length=289

 Score = 43.2 bits (98),  Expect = 0.17, Method: Composition-based stats.
 Identities = 30/183 (16%), Positives = 55/183 (30%), Gaps = 13/183 (7%)

Query  157  WAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG-  215
            W     T A   L                 V  +  +   +R   +F LL  L I+ +  
Sbjct  95   WVEAGVTAALTALFAGAFAAYALRRAVGLPV-RYTMLADHMRFFPTFLLLEGLAIVALLV  153

Query  216  ---GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
                G + ++   ++  V F F   ++ D   G  +AL  S  +V  +        V + 
Sbjct  154  LRPLGPMAVLTAWVVGSVAFAFTDLIVVDRGAGVGEALRGSLRVVGANLGQTVLLLVTVS  213

Query  273  VISLTLS--------FLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQH  324
             +S  L+        +L   +  V        ++LL     +     + D          
Sbjct  214  ALSFLLNAPLLLSLRYLPPAVVAVLGLVAGLVAILLRALLSVALACAFRDAVGIRTAGGE  273

Query  325  PPI  327
             P 
Sbjct  274  LPH  276


>WP_112206957.1 zinc-ribbon domain-containing protein [Lactobacillus sakei]SPS07559.1 
hypothetical protein LAS9624_01815 [Lactobacillus sakei]
Length=287

 Score = 43.2 bits (98),  Expect = 0.17, Method: Composition-based stats.
 Identities = 37/296 (13%), Positives = 83/296 (28%), Gaps = 12/296 (4%)

Query  5    RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRI  64
            +CP+CG++  T +   P+   +        T    P    +      +           +
Sbjct  2    KCPNCGSQLKTGAKFCPSCGKAV----AVTTPTSAPTSPVKPTPVSPVQPMAKQNAGTTV  57

Query  65   PSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGI  124
              D  +++++  N           P      S +    I  ++ +    F    +  L +
Sbjct  58   SIDTQDLKNQAKNYWSYFVEGLKAPSTTLTQSNNWFGYIQFIILNVLIAFIPTHYFALAM  117

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
              L +          +                       +    L  S +   +   I +
Sbjct  118  NTLEMAKKSLYGGDEVATVVGEMAKRIGVGSSVPEFFFRMFIYSLIFSVIYVLVGFLIVR  177

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
              +    S+               LL + +    L  II  +     +     ++   ++
Sbjct  178  GIMKNEVSLGQYTTRF------GGLLSIEIAILLLADIILQVSSAGNWILVLVLILLASM  231

Query  245  GGLQALEKSRLLVSGHWWAI-FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLL  299
                A     LL   H     F   ++ LVI + L ++ A+I  +    +   S+ 
Sbjct  232  VFSAAFNAVILLHDNHSSINKFHLLLVALVIVMVLIYVVAQIGGI-NLVSNISSMY  286


>MBE6883006.1 DUF975 family protein [Ruminococcaceae bacterium]
Length=384

 Score = 43.6 bits (99),  Expect = 0.17, Method: Composition-based stats.
 Identities = 23/154 (15%), Positives = 45/154 (29%), Gaps = 6/154 (4%)

Query  147  WLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLL  206
            + +   Q               L  + + G     +     G      LG  ++     +
Sbjct  225  FGSIWFQISLGIRCAVWFLLFSLPGAAILGVQLYLLGAFSKGFTLPEVLGNDYLLLIPAV  284

Query  207  LILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFG  266
             +L+   +  G  LL         +F+       D  +    A+++S  +++G    + G
Sbjct  285  ALLVFSWIICGIFLLRYFA---APYFWTSSRENMDTKLTPAAAMKESVKVMNGRKLKLLG  341

Query  267  RFVLLLVISLTLSFL---TARIPYVGEAANLAFS  297
                     L L F+      IPY         S
Sbjct  342  YVFTFAGWGLLLVFILPALYVIPYYQAFLAAFVS  375


>NIA06755.1 hypothetical protein [Actinobacteria bacterium]
Length=184

 Score = 42.1 bits (95),  Expect = 0.17, Method: Composition-based stats.
 Identities = 17/167 (10%), Positives = 39/167 (23%), Gaps = 4/167 (2%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M    C +CG +   P      K     C            +     T+++         
Sbjct  1    MIRFNCTYCGKKIEVPDEYAAKKGRCPSCSHINVIPSPSEGQVPAVNTSNHPNHDQSNPD  60

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLR----SISQLLADSWELFCR  116
                 +D+  + +K ++ R+C        +   +    G         +L     + +  
Sbjct  61   LSSADNDKPYVVAKDLDTRKCPYCAEEIQDEAIKCRFCGEFLVESKTCKLGKSKTKWYFS  120

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLAT  163
                ++ +  LG +      F                          
Sbjct  121  TRVVVIALLCLGPLALPLVWFHPHYKIITKLAVTVVVIAVTVWCYFY  167


>MSU80190.1 hypothetical protein [Gemmataceae bacterium]
Length=272

 Score = 43.2 bits (98),  Expect = 0.18, Method: Composition-based stats.
 Identities = 19/249 (8%), Positives = 60/249 (24%), Gaps = 11/249 (4%)

Query  93   FRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQN  152
                      +   +     L    G   + +  +   +    + +   L     +  + 
Sbjct  22   GWRLIKPNYWLFLGICFVGSLIAGMGPMGILMGPMMCGIHLCLLRAERGLPIDFGMLFKG  81

Query  153  QNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLIL  212
             ++     +A +  ++  +  +  +  ++      G+                    ++ 
Sbjct  82   FDYFLPSFVAALFIMVPMIVILVATYILFFFGMFAGIAAFGPQQRGGGPPGDAFGYYMMS  141

Query  213  VVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
            +     + +++   +    F F   ++ D  + G  A+  S     G+++ +        
Sbjct  142  LFAIMYVSILVTSTVLHAVFLFVFPLIVDRELSGWDAVVLSVRAFLGNFFGVLALV----  197

Query  273  VISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWL  332
                    L   +  +   A         P S       Y  +      P          
Sbjct  198  -------ILVMFLSLLSLLACYVGVFFFLPLSMAMSMAAYRQVFPELESPLDRLHDFDND  250

Query  333  PLTAAIFGW  341
                    W
Sbjct  251  DDLPPAPKW  259


>PJN93938.1 hypothetical protein CNY89_17450, partial [Amaricoccus sp. HAR-UPW-R2A-40]
Length=50

 Score = 39.0 bits (87),  Expect = 0.18, Method: Composition-based stats.
 Identities = 9/35 (26%), Positives = 13/35 (37%), Gaps = 0/35 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLI  37
            + CP+C A       +L    S  +C EC     
Sbjct  2   RLSCPNCAAIYELSEDRLTPGGSHVQCSECHTRWF  36


>NRB21845.1 hypothetical protein [Candidatus Dependentiae bacterium]
Length=297

 Score = 43.2 bits (98),  Expect = 0.18, Method: Composition-based stats.
 Identities = 23/205 (11%), Positives = 57/205 (28%), Gaps = 6/205 (3%)

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWM  174
                  +  IYL+   +    I  A  +          +         +   +   +S +
Sbjct  92   WLHYIWIPFIYLVFNFIHVLFICYAAQIMRTKKAGNIEKALIQTNTSLSDIALWALISTL  151

Query  175  TGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFF  234
                   +      +    +           +L      +    LL      L+ ++ + 
Sbjct  152  IIFPLSLLWSFTKQVTFIKEFMFEDHPFVAFVLTSPSFSIFILFLLF-----LYQIFSYC  206

Query  235  CQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL  294
                +A +N   +QAL+ S  ++      I G  ++L ++  + +        V     +
Sbjct  207  VLPAIALENQSFMQALKHSWHMIKHMGVFILGGTLILSLVWGSATSAFNPFLSVFAV-PI  265

Query  295  AFSLLLTPFSFLYYYLIYSDLKANY  319
              +     F+   YY    +     
Sbjct  266  VINAAALIFAMTAYYECKQEKNGQK  290


>MBE7706338.1 hypothetical protein [Cyanobacteria bacterium SIG30]
Length=261

 Score = 42.8 bits (97),  Expect = 0.18, Method: Composition-based stats.
 Identities = 26/118 (22%), Positives = 49/118 (42%), Gaps = 1/118 (1%)

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
             L  +   +    LL + +     L ++  ++F  +      V+ +D+ G +QAL+ S  
Sbjct  137  MLLGMTFASSSSNLLFITIPFSLALFVLFFMIFVYFMLAGPIVVLEDDKGAVQALKLSYN  196

Query  256  LVSGHWWAIFGRFV-LLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
            +V      I    V +L+ I + ++     IP +G   N   +LL      L+   IY
Sbjct  197  MVKEKKLFIPIIVVSILIFIPILIASFFTNIPVIGYVINFVLNLLSGVLGALWPVYIY  254


>MBI3185600.1 adventurous gliding motility protein GltJ [Myxococcales bacterium]
Length=679

 Score = 44.0 bits (100),  Expect = 0.18, Method: Composition-based stats.
 Identities = 9/32 (28%), Positives = 12/32 (38%), Gaps = 0/32 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
              C  C A+      K+ AK    RC +C  
Sbjct  2   RFVCDSCRAQYMISDEKVGAKGVKVRCKKCGY  33


>CRH97083.1 membrane protein [Streptococcus pneumoniae]
Length=197

 Score = 42.4 bits (96),  Expect = 0.19, Method: Composition-based stats.
 Identities = 22/194 (11%), Positives = 55/194 (28%), Gaps = 12/194 (6%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
                Y + I+L+ A            +L    +       L         +     +M  
Sbjct  9    FGIFYCIMIILSNASYGITSYGYTNVFLQISKREDAKVDYLFEGFRGFKRMMKTMWAMLA  68

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +  T   +   +      +G        +   V     ++ +  + F     +   +  
Sbjct  69   ILLYTGTWIPMLLIGLFALLGEEGNTSFAIAFFVLLAISIVGMIVMYFSYALTYYVMIEN  128

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
             +     QA+++S+ L+ GH   +F  ++  +  ++           +         L L
Sbjct  129  PE-YSVSQAMKESKNLMKGHKLDLFLLWLSFIGWAI-----------LALLTFGIGFLWL  176

Query  301  TPFSFLYYYLIYSD  314
            +P+        Y  
Sbjct  177  SPYMSTTTAHFYRY  190


>XP_010027371.1 PREDICTED: uncharacterized protein LOC104417864 [Eucalyptus grandis]
Length=168

 Score = 42.1 bits (95),  Expect = 0.19, Method: Composition-based stats.
 Identities = 12/141 (9%), Positives = 38/141 (27%), Gaps = 6/141 (4%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
             ++   S+   +  +   + +     V   +++  ++A+ KS  L+  +   +     ++
Sbjct  23   FLLAVLSIAYALGFMYIIMAWDMANVVSVLEDVYVIKAIMKSNGLIKHNIGTVVFISYVI  82

Query  272  LVISLTLSFLTARIPYVGEAA------NLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHP  325
            +   L L                            +  S +  Y + +     +      
Sbjct  83   VYFYLVLQLPILLFVMFDVVVRERVLRIGIGVAYFSLMSMVVMYFVCNSNVGPHENIDKS  142

Query  326  PIKRQWLPLTAAIFGWMLIPG  346
             +              + I G
Sbjct  143  FLAEHLQAYLGGECAPLKING  163


>MBI4691221.1 histone deacetylase [Nitrospirae bacterium]
Length=633

 Score = 43.6 bits (99),  Expect = 0.19, Method: Composition-based stats.
 Identities = 25/201 (12%), Positives = 56/201 (28%), Gaps = 6/201 (3%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                LL +  +GI            +     L      +           +         
Sbjct  379  FFVYLLVVASIGIYAFGGSAGLISRIVREHSLRFNLNTFFSDGKRLFFPLVGFTAIIGAI  438

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             + +       G   +  + +      TL L L I        + +I  +       +  
Sbjct  439  FIVVAFVLGVFGGGAAAIISIAKEYGATLALFLGIFFALLLLCVGLILIVCTIALTLYGT  498

Query  237  YVLADDNIGGLQALEKSRLLVSGH------WWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
              +   + G L+A +++   +  H      +  +FG ++L+  + +        IP VG 
Sbjct  499  AAIVFKDAGPLRATKETIKYLYRHLSALWLYCIVFGGYILISFLLILFGAPFNLIPIVGT  558

Query  291  AANLAFSLLLTPFSFLYYYLI  311
               + + L           +I
Sbjct  559  IIAIPYQLFSYIVQSYLGLVI  579


>XP_020899657.1 uncharacterized protein LOC110238333 [Exaiptasia diaphana]XP_020899665.1 
uncharacterized protein LOC110238333 [Exaiptasia 
diaphana]XP_020899674.1 uncharacterized protein LOC110238333 
[Exaiptasia diaphana]XP_028514761.1 uncharacterized protein 
LOC110238333 [Exaiptasia diaphana]
Length=2434

 Score = 44.0 bits (100),  Expect = 0.19, Method: Composition-based stats.
 Identities = 49/530 (9%), Positives = 125/530 (24%), Gaps = 24/530 (5%)

Query  8     HCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSD  67
             HC AE +               P     + +        +            +       
Sbjct  1748  HCYAEYSMSDEDESTPYLPGWKPFTGNKMDWANISKLCPKPWRYSTAKEIYFVPTWGYHH  1807

Query  68    RLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLL  127
                      +    N +         +      R+ + +L  +           +  Y  
Sbjct  1808  LYGGGGYVADLGYKNSTRSFVTSPLRKNGWFDPRTRALILEFAIFNPSTNQISSVAFYYE  1867

Query  128   GIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDV  187
              +   F   F+ +           + ++     L  + ++L  L+     ++        
Sbjct  1868  VLPTLFGDPFTRIETMTIYGSATGSHDFFLICKLLFIFFVLFYLAREAFKVYKVRKIYFK  1927

Query  188   GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIG--  245
               +   ++    +    ++  L+   +   ++  +       V F         +++   
Sbjct  1928  NFWNWFEIFQIILVVAVVVCYLVKESMILEAVQKLQANPFITVNFQPAIMWKYAEDVTLS  1987

Query  246   --GLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
                  A  K   ++  +   I     L    +  LSF                       
Sbjct  1988  MAVFIATVKILRMLRFNPHIIIFNSSLRRCRATLLSF---------SVLLFVIMFGFALL  2038

Query  304   SFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLL  363
               L         +          +      ++              + +  + L A   +
Sbjct  2039  GHLALGSTMPQYRTLKHSLYTQILVSLGQHMSTTELRDASFFLGNAMDMIYKMLMAFYFV  2098

Query  364   SAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLF  423
             +    +      + +   D N    E  + +      +     RK  +E  + L  V   
Sbjct  2099  NFYIAVINDSYEETKSDTDYNAKQFEMSEFIIERLVDMFFKSFRKVKNESVIELSTVPQH  2158

Query  424   ADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAF  483
                   +D+N       + +        + G  R  +             +Q        
Sbjct  2159  VSSVRLEDENLGKDNGKDGNGNGKDIGKENGEERPAVTS----------SQQDEDVEAMI  2208

Query  484   HWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIESLQ  533
               +  N T+  +      +I            S+    ++TLP  IE++ 
Sbjct  2209  XEIERNSTEXGNFSKVE-TISEATDNPPSTSKSVESSKQITLPAIIETVD  2257


>WP_192031542.1 hypothetical protein [Pseudoxanthomonas sp. CAU 1598]MBD8528122.1 
hypothetical protein [Pseudoxanthomonas sp. CAU 1598]
Length=288

 Score = 43.2 bits (98),  Expect = 0.19, Method: Composition-based stats.
 Identities = 23/174 (13%), Positives = 49/174 (28%), Gaps = 29/174 (17%)

Query  156  QWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV-  214
             W +        + G       +          + ++++  LR + + T   ++  L + 
Sbjct  73   YWLLWALIFVVSVGGQLLAVLRLQQRATAQPQSIRQNLEQVLRRLPAATGAYLIYFLFLA  132

Query  215  --------------------------GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQ  248
                                         ++LL I    F +   F  +    +    L 
Sbjct  133  VCLAPWIAHLVWLIGQPFSADALGLFLLSTVLLWIIPTWFSLAGVFFLFAAGLEGCAALA  192

Query  249  ALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
            +L +S  L+  HWW       ++L+       L   +P +   A         P
Sbjct  193  SLRRSLALIRRHWWHTSAVIGVVLLAYT--GILLVALPLIMSLAMGVSYFRHGP  244


>HFS54482.1 hypothetical protein [Planctomycetes bacterium]
Length=349

 Score = 43.2 bits (98),  Expect = 0.19, Method: Composition-based stats.
 Identities = 50/341 (15%), Positives = 77/341 (23%), Gaps = 25/341 (7%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M    CP CG     P +  PA    A   +       D            +        
Sbjct  4    MIP--CPRCGRTLRVPETSAPAIGCPACGHKFTPRPTADVTAEPLPVAIPEVEPADDPEE  61

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
                P  R + Q                      A  + L +   LL             
Sbjct  62   AADRPKKRRKDQPDVGKSLSSLIREVRSGSDRPGAESARLFAGLSLLLVGIAFGVLSELI  121

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWA----------ILLATVAYILLG  170
            +L + L     A         L     L         A            LA  A  L  
Sbjct  122  VLLVRLAAPYDAMLLAVPPGFLHWLLALLGLGLCAWAATTLEARWLALATLAVSALQLFL  181

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
               +T      I    V  +  +   L    + T       ++   G+ L     L+   
Sbjct  182  AGIITLPTKHPIAGEWVWHWSRLASSLELFLNITAPGFTPGIMAWLGT-LTGFLELVRWA  240

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG-  289
                    +A        AL     LV G         V+  VI   +        + G 
Sbjct  241  LLAVTLRFVAVRLKAPSVAL-HCLYLVIGLGGGTLVLMVINAVIRSMIRGTLMGSGWEGR  299

Query  290  ----------EAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
                         N   +L L  F+    + +++ +    R
Sbjct  300  HTMNFLFDLQFFLNAVATLALMGFTGFLVFQLWNKVWKRRR  340


>KKR97207.1 hypothetical protein UU49_C0034G0010 [Candidatus Magasanikbacteria 
bacterium GW2011_GWC2_41_17]
Length=153

 Score = 41.7 bits (94),  Expect = 0.19, Method: Composition-based stats.
 Identities = 15/93 (16%), Positives = 33/93 (35%), Gaps = 7/93 (8%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHW-------WAIFG  266
            +    L  I+  L+      +    +  ++    QA+++S  L  GHW         IF 
Sbjct  1    MALIFLFSILAILVCSFLLIYAAAFVILEDYTFGQAIKESWRLFIGHWLVNLEMAIIIFF  60

Query  267  RFVLLLVISLTLSFLTARIPYVGEAANLAFSLL  299
              +L  +I++  + +      +    +L     
Sbjct  61   INLLAGLITIVAAAIIGIPALIVFLFSLFIQFP  93


>MBC8481828.1 hypothetical protein [Planctomycetes bacterium]
Length=321

 Score = 43.2 bits (98),  Expect = 0.19, Method: Composition-based stats.
 Identities = 22/314 (7%), Positives = 64/314 (20%), Gaps = 4/314 (1%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPEC---CQTLIFDPAESQRTQTTDNIATCPH  57
            M   RC  C  +     + L  +    +C +                 T    +      
Sbjct  1    MIIFRCSGCNQKYKADDNFLGKEVICKKCGQPFIVTAVAAPPNGIKNETLKIPDPDQSQD  60

Query  58   CGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRR  117
                         +        +    F         +       +              
Sbjct  61   NHQYEDPRLQTSLVSGTQHYSGKRTYPFIFDLFLYPYSKAGLFMLLIFFGIPFILRLLGL  120

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
                L +  +  +     I     L  A +       +   +  + +       +     
Sbjct  121  LTTALTVVFVPFLPIAILIKILGALINAVFWLYMFWYYGQCVYESALGSTRAPDTMSQTP  180

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
             F  I      +   + +       ++     +  +        I    +  +       
Sbjct  181  AFGEIFSNLFKIIACIAVFTLPAMIYSHTTGRIDNIFFVLVAFGIFCFPMAFLAVIMFDS  240

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
                + I    +++ +     G    +     +    S  L F+      +  A +    
Sbjct  241  FYGLNPILIAASIKSTFFPYIGLCLLLALLQAIFTACSYLL-FIVPFARVLIPALSAIGM  299

Query  298  LLLTPFSFLYYYLI  311
            ++       ++Y  
Sbjct  300  MISGHLLGRFFYKY  313


>QOV87739.1 hypothetical protein IPV69_15760 [Phycisphaerales bacterium]
Length=466

 Score = 43.6 bits (99),  Expect = 0.19, Method: Composition-based stats.
 Identities = 26/193 (13%), Positives = 52/193 (27%), Gaps = 27/193 (14%)

Query  463  VLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLE  522
            V+DD    L   +         ++  N   E+        +         ++  + G   
Sbjct  270  VVDDHGNSLLPDKQIDLAGDEDFISPNFGAESSYS-LSAWLKYPTKNPGRKIAKLRGSAT  328

Query  523  LTLPLAIESLQLTRNDI-GKTLQIGGKQLILQRL----GSNAVTLRFLGDRTD-------  570
             T+ +  E L +    + G T    G  +    L     +  + +   G+          
Sbjct  329  FTVQVESEKLDVPLKSLRGTTRTFKGLPMTFGELQKVGDNWQLKISMAGNDQHPAWDDLQ  388

Query  571  ---LLNVHASNSHAEPLREIGFTWQKSGDAFSLRQMF-----------DGNIESITVLVA  616
               +  +   +S  +PL   GF     G    +   F            G    +   + 
Sbjct  389  NSIMSQLKVVDSKGQPLDHHGFGSGGGGAGTEITVSFGTSHRPEDGRQSGEPARVVWEIP  448

Query  617  GDSMTQSYPFELT  629
              +     PFE  
Sbjct  449  TKTRALQVPFEFK  461


>MBI1292689.1 hypothetical protein [bacterium]
Length=289

 Score = 42.8 bits (97),  Expect = 0.20, Method: Composition-based stats.
 Identities = 30/227 (13%), Positives = 63/227 (28%), Gaps = 4/227 (2%)

Query  127  LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD  186
            +   +        +L+  A+      +   +  +LA        L  M  ++   +    
Sbjct  16   VKPPVHQPGPLGVMLMPLASLGFVLRRPRLYGWVLAPFLINTALLIGMWSAIGGLLVDPV  75

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG-LLFCVWFFFCQYVLADD---  242
            V         L         LILL+L          I   ++   +  F    + D+   
Sbjct  76   VDFAAEHFSWLGATALAAARLILLLLAFLLSLFASYILFAIISSPFNDFMTEKIEDELLA  135

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
            +   L+A     +    H     G  + +++  + ++F+   IP+VG       SL    
Sbjct  136  DYPHLKATPLPIVKAILHALCEAGIRICIVLPLVVVAFVLGFIPFVGPIIAGGISLANGV  195

Query  303  FSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLL  349
                     Y   +     P      R            + +   + 
Sbjct  196  LFLSIDAYSYCMDRRRIDLPAKMAWLRARRNRWLPFGFGLAVFFAVP  242


>BAJ06951.1 putative uncharacterized protein [uncultured bacterium]BAJ06953.1 
putative uncharacterized protein [uncultured bacterium]
Length=245

 Score = 42.8 bits (97),  Expect = 0.20, Method: Composition-based stats.
 Identities = 20/223 (9%), Positives = 59/223 (26%), Gaps = 20/223 (9%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                   +  +  +     + +++L      +   +        +      L        
Sbjct  9    WQLFTNNLVKILHISLPFLLATSILNGMDQHVWQNSNYLSAITPIIEQLIELSFAPIFIL  68

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF--------  228
             +   +        + +  GL +     ++ IL  + +     L ++  L+         
Sbjct  69   FLSQVVTNETSTKKQLIYNGLTYAPYIIIVFILTFVPMLAFWALELVVELITHKYSSSPF  128

Query  229  ------------CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL  276
                         +   F   ++  +    +++++ S +   G+   I    +   +  L
Sbjct  129  SMIFKVLLTVFVFIKLSFSNCLIVLEGNKPIESIKNSFMFTKGYELKIILSMLAFSIPIL  188

Query  277  TLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
               F+   I       N A S+           L+   +   Y
Sbjct  189  AAGFVGNYILPELIINNFATSIFTYFLFTFLLLLVQVAMFKIY  231


>CVH76891.1 hypothetical protein BN3662_01862 [Clostridiales bacterium CHKCI006]
Length=282

 Score = 42.8 bits (97),  Expect = 0.21, Method: Composition-based stats.
 Identities = 14/154 (9%), Positives = 43/154 (28%), Gaps = 2/154 (1%)

Query  161  LATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL--ILVVGGGS  218
                  +++       S         +   R + +      ++ L    L     V    
Sbjct  100  NVISFLLVIPTILYLASQSGISVGDFLSWVRLIVVDGLENLTYMLDNSFLAESWQVMLSM  159

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            L+  +   +         Y++   ++   +A+ KS  ++ GH   +    ++ +   L  
Sbjct  160  LVSSVISAILSYGLAMVPYLIERYDVSWNEAMMKSWKMMKGHKRDLLFLQLIYIPRYLIY  219

Query  279  SFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
              +      +  A       +    +      ++
Sbjct  220  LIMINFASLLVPATTAFGLAVQLALAIYLPITLW  253


>KPA17729.1 MscS mechanosensitive ion channel, partial [Candidatus Magnetomorum 
sp. HK-1]
Length=128

 Score = 40.9 bits (92),  Expect = 0.21, Method: Composition-based stats.
 Identities = 7/32 (22%), Positives = 11/32 (34%), Gaps = 0/32 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQ  34
            V C  C        S +    +  RC +C +
Sbjct  2   RVECQTCNTRFKLDDSLIKKDGTKVRCSKCKE  33


>MUL56545.1 hypothetical protein [Pseudomonas aeruginosa]
Length=131

 Score = 40.9 bits (92),  Expect = 0.21, Method: Composition-based stats.
 Identities = 25/82 (30%), Positives = 40/82 (49%), Gaps = 0/82 (0%)

Query  197  LRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLL  256
            LR   +F LL  L  L++  G  LLI+PG+   V   F ++ L       LQAL +S   
Sbjct  3    LRLWPAFALLAALTSLLIVLGLSLLILPGIFVMVKLSFAEFCLVLRGRSPLQALRESFEF  62

Query  257  VSGHWWAIFGRFVLLLVISLTL  278
              G ++ I    +++L+   +L
Sbjct  63   TRGRFFLILACSLVILLPVWSL  84


>OHB57512.1 hypothetical protein A2Y12_16000 [Planctomycetes bacterium GWF2_42_9]
Length=341

 Score = 43.2 bits (98),  Expect = 0.21, Method: Composition-based stats.
 Identities = 29/336 (9%), Positives = 70/336 (21%), Gaps = 19/336 (6%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  V C  CG +   P+          RCP+C +  + + AE       +N        +
Sbjct  10   MIRVECKKCGQKIKAPAE---YAGKRIRCPKCKEAFVLESAEMLSLLEIENNPAAVLQQV  66

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
              +      ++          N +  + P +E   +        +               
Sbjct  67   PVQEELKLQKVVPSDFRLLNPNINSDIVPTKELETNIKPSTVFEESKKRKLPAVIDVFLY  126

Query  121  LLGIY-LLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYI--------LLGL  171
               +  ++ IV+         + +               +       +            
Sbjct  127  PASVSGMINIVVFTVLSLLMGVSRLFLMGIMGFMVRFTIVAYLYFYLVECIRDSATGGVR  186

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG--LLFC  229
            +                    +              +L  +                   
Sbjct  187  APNNIDAMPDSIDEAKSKALEVIASFIIFWGPVFGYMLYKIFTTPRGFPSDPFDNTFWCL  246

Query  230  VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV-  288
            + +    + +    I    +       +     +IF  F+   ++ + L F    I  + 
Sbjct  247  LGYGIIFFPMGILAIAIFDSSSGFNPFIW--ITSIFSTFLQYFILLVILGFFCLLIYLIT  304

Query  289  --GEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGP  322
                  +    L L           Y          
Sbjct  305  TRLGIISTPIRLYLLMIMAHILGRFYYLNSERLNWC  340


>KAF1876438.1 hypothetical protein Lal_00029786 [Lupinus albus]
Length=432

 Score = 43.2 bits (98),  Expect = 0.21, Method: Composition-based stats.
 Identities = 24/174 (14%), Positives = 53/174 (30%), Gaps = 0/174 (0%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
              W LL ++    ++                +     +   +      A   +       
Sbjct  72   HEWTLLLVFQFCYLIFLFAFSLLSTAAVVFTVASLYTSKAVSFSSTLSAIPRVFKRLFIT  131

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             +++ +          + L L  V   T   +LL+  +    LL ++  +     +    
Sbjct  132  FLYVTLLMFAYNFVFVLSLFLLIVAIDTDNSLLLLFSIVVILLLFLVVHVYISALWHLAS  191

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
             V   + + G  A++KS  L+ G            LVI   +  + +R+   G 
Sbjct  192  VVSVLEPVYGYAAMKKSCELLKGRAKYAAILVGGYLVICGIIGGVFSRVVVHGG  245


>KAF9586782.1 hypothetical protein IFM89_040000 [Coptis chinensis]
Length=324

 Score = 43.2 bits (98),  Expect = 0.22, Method: Composition-based stats.
 Identities = 15/165 (9%), Positives = 51/165 (31%), Gaps = 0/165 (0%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              +++L  +     +    ++                +    +    +    +    ++ 
Sbjct  97   FPLFMLVDLFLATALSIVSMVAIVYVSAMSYLGKDLTLKDLFLRIRNVWTRSVITWFYVS  156

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            +      +       +  +G+    LI+  + +     L +   L   + +     V   
Sbjct  157  LLYASFLVLVLPVSFVVVLGNSRGSLIIAAVGMIVLFCLAMFLHLYLALIWICSMVVSVL  216

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            ++  GLQAL K+  L+ G          +++++      ++  + 
Sbjct  217  EDCHGLQALGKAEKLIKGKKVVGIALTFVVMILYGASFLVSIMVN  261


>WP_013709369.1 zinc-ribbon domain-containing protein [Coriobacterium glomerans]AEB07627.1 
hypothetical protein Corgl_1528 [Coriobacterium 
glomerans PW2]
Length=361

 Score = 43.2 bits (98),  Expect = 0.23, Method: Composition-based stats.
 Identities = 33/305 (11%), Positives = 68/305 (22%), Gaps = 18/305 (6%)

Query  5    RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRI  64
             CPHCG      +    +                D A +            P        
Sbjct  2    ECPHCGNPLKDGARFCGSCGKPV-----------DGAAAPGPHHPSVNPAAPAGPAAAEG  50

Query  65   PSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGI  124
                  + +           +         A       I       +      G     I
Sbjct  51   AQRPAPVPAGGPAGGPGGPFYPPAGPSAAAAGSQQPYGIQPSGGFGFSQPGCLGAAWHDI  110

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
                       +  A    P           QW    A      + +   +   F+    
Sbjct  111  TSSPGWFKRILLLMAFNCVPFLNWYANGYCIQWGSDRAVGMQGPMPIGTFSKRAFLSGLC  170

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
              V            +    +  I L +++  G  + +  GL       F ++  A +  
Sbjct  171  LTVLSIMGFFALFAVMPLTWIPFIGLPIIIVFGCFVDMFKGLAVMRMAMFDRFGEAFE--  228

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFS  304
                 L            ++F    +  +I   + ++   I  V   A + +S     ++
Sbjct  229  -----LSDIVQKSRRRMGSLFASACVPGIIVGAVLWIIVFILLVSSMARVRYSSDTMSWA  283

Query  305  FLYYY  309
            + +  
Sbjct  284  YNHMA  288


>WP_015284590.1 hypothetical protein [Methanoregula formicica]AGB01626.1 hypothetical 
protein Metfor_0563 [Methanoregula formicica SMSP]
Length=363

 Score = 43.2 bits (98),  Expect = 0.23, Method: Composition-based stats.
 Identities = 24/225 (11%), Positives = 58/225 (26%), Gaps = 8/225 (4%)

Query  90   EREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLN  149
                R +      +  +    + L+       +  + L                  T   
Sbjct  90   MYWTRGNVYYDPFLISIPLGDFILWFDPRLIPILAFCLFCFTFLMAGLIRYRNLIRTEHP  149

Query  150  PQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLI-  208
               +               L ++       + +  T   L       +     +T  L  
Sbjct  150  VTIREGLSGARAHAGPLAALSIAMALAGTILIVVVTSDNLADISIQVMDIFMPYTWFLPD  209

Query  209  -LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGR  267
             L + +      +++   ++  +   +   V+  +N G + AL  S  L+      I G 
Sbjct  210  SLGLSITFWFMSVILCITIILFLAVPYVVPVIVLENKGLVSALGGSITLIRKTRREILGC  269

Query  268  FVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
             ++   + L    ++  +        L       P  +    LI 
Sbjct  270  ILVYGALVLLAGAISLVMN------QLLLHYNYDPVFWYSQGLIP  308


>NQX96170.1 hypothetical protein [Erythrobacter sp.]
Length=294

 Score = 42.8 bits (97),  Expect = 0.23, Method: Composition-based stats.
 Identities = 22/106 (21%), Positives = 42/106 (40%), Gaps = 8/106 (8%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
              G++L  +         +    V+  +  +  ++AL +S  L S     +FG +++L V
Sbjct  166  FLGAVLGFLAWFYLTARLYLTLPVMVIEWELNPVKALLRSWRLTSAARSNVFGFWMMLAV  225

Query  274  ISLT-------LSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
                       +S L + IP  G  A+L   L    FS ++  +  
Sbjct  226  AWFVTLMIQSAVSALLSSIPGPGPTADLIQGLFGGLFSMVWGSIYC  271


>MBK77953.1 hypothetical protein [Flavobacteriaceae bacterium]
Length=194

 Score = 42.1 bits (95),  Expect = 0.23, Method: Composition-based stats.
 Identities = 34/182 (19%), Positives = 64/182 (35%), Gaps = 7/182 (4%)

Query  127  LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD  186
            L            ++        P         L     +      +    + + I + +
Sbjct  14   LSGKWLRVLFPFFIVALIPNLYQPSANYSPPIYLAFISLFASGPFIYGGSLLALKISRGE  73

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIG  245
               F  +  G  H     LL +  IL+V  G LLLIIPG+ F + +  C + L ++  + 
Sbjct  74   DFNFEMIFSGFNHFIKTLLLYVSFILIVIAGLLLLIIPGIYFSLKYSMCFFALVENPELS  133

Query  246  GLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANL-AFSLLLTPFS  304
              +AL++S  L+    + +F  ++L  +I      +   I  +G    L    +    F 
Sbjct  134  IGEALQRSGDLMKEDKYKLFLLYLLYFLIV-----IAGLITIIGWLWALPLIYVSSAIFY  188

Query  305  FL  306
              
Sbjct  189  EY  190


>HBE68206.1 hypothetical protein [Planctomycetaceae bacterium]
Length=282

 Score = 42.8 bits (97),  Expect = 0.23, Method: Composition-based stats.
 Identities = 21/225 (9%), Positives = 56/225 (25%), Gaps = 3/225 (1%)

Query  5    RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRI  64
            +CP C A    PSS +       RCP+C +T   + +         + +  P        
Sbjct  45   QCPACSATLKIPSSAV---GKQVRCPKCSETFAVNASPQLEISPRTSPSPQPSFNPPETT  101

Query  65   PSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGI  124
             +         +            P+    +      + +     +     ++       
Sbjct  102  QTTFEPFGDDNLFGSLPGDLNNQAPDNFGTSGEFQAPASTGYQPYAPPPKRKKKKRRTNK  161

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
                 +     +     +  A             ++   V      +    G   + +  
Sbjct  162  GWQAWLTETDLLLKLFGIVGAVSFGLSAIPVLGVVVFILVLVSYAVVQMAGGIWLVVVAF  221

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC  229
             +  +   + + +     F L+       V     ++ +  L+  
Sbjct  222  QEEPVQGILYILVPFYALFYLITRWDTCRVPFMMCIVSVFSLICS  266


>MBC7765611.1 DUF975 family protein [Hyphomonadaceae bacterium]
Length=320

 Score = 42.8 bits (97),  Expect = 0.24, Method: Composition-based stats.
 Identities = 26/240 (11%), Positives = 61/240 (25%), Gaps = 33/240 (14%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
             +L      I           +    T     +       +   +   +  L       F
Sbjct  43   FMLFAIFAVIASVMFSSQFGEMSSILTTGLGASAIAIGIAIYVGLILCVYILFMGVKKYF  102

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLL-------LILLIL----------------VVGG  216
              + + +      +            +       ++L ++                 +  
Sbjct  103  NELARGNEANLALLFWHFTRFWKTAWVLVKVALAMLLYMIPYVVFLLLIRQYGDNPWLYV  162

Query  217  GSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLL----  271
            G     I  +   + + F  +VLAD  +      ++ S  L+ G+   + G  + +    
Sbjct  163  GLFFSYIGLMYASLRYAFVPFVLADYSDQTAKTIIKISTRLMKGNMLRLIGLGLTICWSV  222

Query  272  -----LVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPP  326
                 +     +      +  V   A     L L P+ +    ++Y DL A         
Sbjct  223  LAVVAVAAMSIIYPPFGVLVIVCIVAFYIALLWLLPYLYTAMAVMYQDLLAQVNHQHAQM  282


>MBA3535518.1 hypothetical protein [Tatlockia sp.]
Length=516

 Score = 43.2 bits (98),  Expect = 0.24, Method: Composition-based stats.
 Identities = 17/208 (8%), Positives = 58/208 (28%), Gaps = 29/208 (14%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
            + L   L F  +   ++      L          + +     +    ++++      +  
Sbjct  299  WHLVYGLKFLFLLIWIVPFLTISLIGSQYVVSEPLRIVFSYGLFFLGTFLSMLGLRRVIN  358

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL------------------  226
              + +       ++      +  +++ +    G+ LL    +                  
Sbjct  359  LPLHIKTICSQYIKKFFHTLVFFVIIYIAAYSGAFLLSYILIKSDLPMIYIVFEELIRSA  418

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP  286
            L    F F   ++       LQA+  + L++  +  +I   ++           + + + 
Sbjct  419  LISFIFAFVLPLILLTESTVLQAIRITFLIIRQNGISIICYWL-----------IMSILL  467

Query  287  YVGEAANLAFSLLLTPFSFLYYYLIYSD  314
                       L   P   +   +++ +
Sbjct  468  IFSSLTFGLGLLWSVPMYCIMSGILFRN  495


>HBG95021.1 hypothetical protein [Chromatiaceae bacterium]
Length=278

 Score = 42.8 bits (97),  Expect = 0.24, Method: Composition-based stats.
 Identities = 13/75 (17%), Positives = 29/75 (39%), Gaps = 1/75 (1%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            +    G +   +    + V +     V   +   G  AL +++ L+ GH        V+ 
Sbjct  45   IGGLLGCIAAGVIC-RYLVEWALTIPVCLYEGETGRSALRRAQELIRGHRLRALALLVIN  103

Query  272  LVISLTLSFLTARIP  286
            L+++L +S     + 
Sbjct  104  LMLALVISAAVLWLS  118


>BAG04142.1 hypothetical protein MAE_43200 [Microcystis aeruginosa NIES-843]
Length=185

 Score = 41.7 bits (94),  Expect = 0.24, Method: Composition-based stats.
 Identities = 23/98 (23%), Positives = 42/98 (43%), Gaps = 4/98 (4%)

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVW-FFFCQYVLADDNIGGLQALEKSRLLVSGHWWA  263
            L+ IL +L++   S  L +  L            ++ +DN+G  QA+++   L     W 
Sbjct  25   LIPILGVLLILLASFCLPVFFLWLSARICIAELPLVIEDNLGSWQAIKRGWHLTKNSAWR  84

Query  264  IFGRFVLLLVISL---TLSFLTARIPYVGEAANLAFSL  298
            I G   +  ++ +    LS   A IP+    ++   SL
Sbjct  85   IVGVIFIASLLIIPVYILSVALASIPFWVNFSSFYSSL  122


>MSS74868.1 hypothetical protein [Candidatus Pacearchaeota archaeon]
Length=306

 Score = 42.8 bits (97),  Expect = 0.24, Method: Composition-based stats.
 Identities = 26/201 (13%), Positives = 57/201 (28%), Gaps = 14/201 (7%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            + GI+ + + L       ALL+             + A +     +  L +  +  ++  
Sbjct  103  ISGIFAVIVGLVGLWGTFALLVGFFGTFGNARALLREARIQYLSLFGTLLVFALLIALGF  162

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +      L   + + ++          L +++   G + ++   L   +   F  Y   
Sbjct  163  VVLFFGGVLLGLLLMSIQAFNE-----PLSLVIGALGLVGIVWLCLYGFLGIGFTLYEAL  217

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI---------PYVGEA  291
                 G  A+  S   + G    +FG   LL +           +            G  
Sbjct  218  ARKTRGFVAVRGSWNRMRGKRLVVFGYSALLCLSLFVCFLPLLLVKMLFDFGTSALFGTF  277

Query  292  ANLAFSLLLTPFSFLYYYLIY  312
             +    L     +    Y  Y
Sbjct  278  FDGIIFLGYMFIAVPVMYFFY  298


>KPA13535.1 chemotaxis protein CheY [Candidatus Magnetomorum sp. HK-1]
Length=863

 Score = 43.6 bits (99),  Expect = 0.25, Method: Composition-based stats.
 Identities = 18/283 (6%), Positives = 64/283 (23%), Gaps = 27/283 (10%)

Query  42   ESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLR  101
               R                       + +             +        +    G+ 
Sbjct  140  MPFRWYLFKLFLQAEKEYTFLMRFQTHMPMTFPISIKTVEKLMYETSKINFVKGCFYGIL  199

Query  102  SISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILL  161
             +  +         +       +  +  +      F     +           +      
Sbjct  200  FLVVIYHLYMFGILKEKKHFYFLAFIVTLFFIRFSFDGFTRQIFFSEYHLVHLFSAYYFP  259

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +  +         S  +   +  +  +  + +         L+L+  +  +     + 
Sbjct  260  VLIPVLFFFGILFVFSFQVSKQRPKIFKYTFLIIKTIWFTFALLVLLDGVSFIVPFFPVF  319

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
             I  L+  +++FF  +                     G+  + +       +++   SF 
Sbjct  320  AILTLISFIFYFFFLWKN-------------------GYSPSNYYIIGWFFLVAGFFSFT  360

Query  282  TARIPYVGEA--------ANLAFSLLLTPFSFLYYYLIYSDLK  316
              R+ ++             +   ++   FSF+Y   +    +
Sbjct  361  LLRLNFIPSNILTEMSMEICIVVMIICFQFSFIYQTHLIRKQQ  403


>HAE64104.1 thioredoxin TrxC [Acinetobacter johnsonii]
Length=59

 Score = 39.0 bits (87),  Expect = 0.25, Method: Composition-based stats.
 Identities = 11/30 (37%), Positives = 14/30 (47%), Gaps = 1/30 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCP  30
           M  + CP C A+   P  KL A  S  +C 
Sbjct  1   MI-IVCPTCLAKNRVPEEKLSANPSCGQCH  29


>HEJ47611.1 hypothetical protein [Gemmata sp.]
Length=223

 Score = 42.1 bits (95),  Expect = 0.25, Method: Composition-based stats.
 Identities = 24/212 (11%), Positives = 47/212 (22%), Gaps = 0/212 (0%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
             V+CP+C A        +   K   RC    Q      A S+   + D   T P      
Sbjct  5    LVQCPNCQARIKVSERLIGKTKPCPRCQTVLQFPADMAAVSRSELSEDAGTTPPVEAPPA  64

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
                      S  V     + S     E E+      +            L         
Sbjct  65   GPSKAVGNYASPPVAPSYPSSSEPAFVEPEWIEEPLPVPQAEYSAKAMPSLPRLPTKRKY  124

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             + ++            L L          +       L +          +  +     
Sbjct  125  PVLVIMGYCFKVLACITLGLFILALFFGFIKYIIADNPLESAMVWAWLRVMLISTPIAIF  184

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVV  214
                +     +   + H       + + ++ +
Sbjct  185  VCMLIWTVGELMFMIIHFEENVRAIGIGVIEL  216


>MBI3205911.1 hypothetical protein [Myxococcales bacterium]
Length=216

 Score = 42.1 bits (95),  Expect = 0.25, Method: Composition-based stats.
 Identities = 20/169 (12%), Positives = 49/169 (29%), Gaps = 4/169 (2%)

Query  144  PATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSF  203
             A  L     +   +      +  L      +    +   +    +F ++ L L  +   
Sbjct  8    LAFRLVAAAWSALLSGGTVIGSIELAAGGRPSLGALVRGIRFSPAVFLALALQLVPLQVL  67

Query  204  TLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA  263
            +  +    +       +     L+  V       ++ D  +G +QA   S  +  G    
Sbjct  68   SRAMDTSSVQQALALAVGAPVVLVLAVRAVAWIPMIVDRRLGPMQAFRASWEVTRGSS--  125

Query  264  IFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
              GR V++  + +  +     +            L + P   +    +Y
Sbjct  126  --GRIVVVWAVLVAFATSAGALALGNPITVHVVGLAVGPVFSVVLAELY  172


>WP_124100516.1 DUF975 family protein [Ruminococcus sp. Marseille-P6503]
Length=423

 Score = 43.2 bits (98),  Expect = 0.25, Method: Composition-based stats.
 Identities = 16/180 (9%), Positives = 44/180 (24%), Gaps = 15/180 (8%)

Query  145  ATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFT  204
                    +     I         + ++ +                 +    +  +    
Sbjct  188  LFSRRYTWEIVVTCISAFFSMLFSIFVANVLIVGEKRFFLESRTYHGTKIGRMGFLYKDR  247

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWF---FFCQYVLADDNIGG--LQALEKSRLLVSG  259
            +   +  ++V     +L    ++        +     +  +N       A   SR ++ G
Sbjct  248  IYKPVKTMLVKDVFTVLWTLTIIGAFIKPFEYMMIPYILAENPSIDTKHAFRLSRQMMKG  307

Query  260  HWWAIFGRFVLLLVISLTLSFLTARIPYVG----------EAANLAFSLLLTPFSFLYYY  309
            + W     ++  +   +  S   + I  V              N+   +L   F   Y  
Sbjct  308  NKWKAAKLWLSYVPWYIAASVPASLIGLVFLGNTSGIHANVVINVFVGILCLMFLNPYKT  367


>WP_145200508.1 hypothetical protein [Thalassoglobus polymorphus]QDT33729.1 hypothetical 
protein Mal48_29840 [Thalassoglobus polymorphus]
Length=380

 Score = 42.8 bits (97),  Expect = 0.25, Method: Composition-based stats.
 Identities = 31/283 (11%), Positives = 78/283 (28%), Gaps = 3/283 (1%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
             T+ CP CG E     +K      +  CP C ++++  PA+ ++ +    +         
Sbjct  12   ITLECPKCGRELR---TKEKNTGRTVPCPGCSESVLIPPAKRRKKKKKRRVNEDASDTTS  68

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                +   E  +                      +          ++         G   
Sbjct  69   SSEFAPWYETSATNAEKNLGANRKRTMQNLPAGWNYVRFGLSLISISTMAVCLITLGIAA  128

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
              + L+ +++A      A +L     L          ++   V+ + + +      +   
Sbjct  129  ATLVLICLLIAAIFEIEAGMLAGTVCLMLTGIVGLKLVMDEVVSLLHVLICAGYILLLFV  188

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            +          + +G   +    LL+     +       L    +     F       + 
Sbjct  189  MTIIAPQFVALLFMGTIPLAGLGLLVGWCCCLAAPNENCLRYFMIGALSTFLGAVICYSI  248

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
            +      A+     +  G  + +     L+L++   L F+   
Sbjct  249  NVQTVFSAIINGGNIRQGGLFMVLSISALVLMLLSHLLFVLFL  291


>MAJ57073.1 hypothetical protein [Candidatus Pelagibacter sp.]OUW11741.1 
hypothetical protein CBD26_01500 [Candidatus Pelagibacter sp. 
TMED166]
Length=69

 Score = 39.0 bits (87),  Expect = 0.26, Method: Composition-based stats.
 Identities = 9/29 (31%), Positives = 12/29 (41%), Gaps = 0/29 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPEC  32
           V C HCG   +     +P      +C EC
Sbjct  3   VECHHCGIVYDVNDRDIPLSGREVQCNEC  31


>PID34750.1 hypothetical protein CR971_01640 [candidate division SR1 bacterium]
Length=288

 Score = 42.4 bits (96),  Expect = 0.26, Method: Composition-based stats.
 Identities = 24/231 (10%), Positives = 60/231 (26%), Gaps = 6/231 (3%)

Query  88   QPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATW  147
                      +    +  L       +      LL       ++     F   +L     
Sbjct  24   WITTIGHTISNLFLLLWNLNNILIYKYHSGVSILLVFNYFWNIIKEYNFFVWAILLIVFI  83

Query  148  LNPQNQNWQWAILLATVAY-----ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGS  202
            +      +  A                 L       FI+     +    S+      +  
Sbjct  84   ILGYFVFFPIAYGSVLEIIAHKTRFSKALGVGLSKFFIFFEYNTLITSFSIITFFITIAR  143

Query  203  FTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWW  262
               L +L    +     L  I  +L  ++  F + ++A +      A++KS  L   ++ 
Sbjct  144  LFTLNVLENFFIISLLSLWGILVILLKLFLPFAKILIAIEGYDVYPAIKKSMSLAISNFG  203

Query  263  AIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYS  313
             +    +  + + +   F    +  +         +    F+  +  L+  
Sbjct  204  KVIKATIYQIFLFIFFYFRIGILFLIPTIVMYII-IYFQWFTNGFGSLLLW  253


>NIR13333.1 hypothetical protein [Desulfobacterales bacterium]
Length=89

 Score = 39.8 bits (89),  Expect = 0.26, Method: Composition-based stats.
 Identities = 11/35 (31%), Positives = 16/35 (46%), Gaps = 0/35 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
             + CP C   ++ P  K+PA    A CP+C    
Sbjct  3   VEIVCPKCNFSKSIPKQKIPAGARWATCPQCKHRF  37


>NJL30993.1 hypothetical protein [Phycisphaerales bacterium]
Length=168

 Score = 41.3 bits (93),  Expect = 0.26, Method: Composition-based stats.
 Identities = 15/119 (13%), Positives = 32/119 (27%), Gaps = 0/119 (0%)

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
                  T  + + +      +         +         LL  V+    +L+ +  + +
Sbjct  25   WIWLVGTPLLPLVLAMGMGLVLSLAGAFFFNWPVLDAAGALLFGVMLLVGVLITLLLVGW  84

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY  287
             +        LA +      A+ +    V G  W      VL L+         A +  
Sbjct  85   VLSLSLFYPALAVEGTDAFDAVSRCYNYVLGRPWRWLFYNVLALLYGAVCYIFVATVGV  143


>WP_051951045.1 hypothetical protein [Streptomyces yeochonensis]
Length=116

 Score = 40.5 bits (91),  Expect = 0.27, Method: Composition-based stats.
 Identities = 14/61 (23%), Positives = 23/61 (38%), Gaps = 0/61 (0%)

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
               G+++     +   V        L  +      AL++S  LV G WW + G  +L L 
Sbjct  32   TVLGAIVGSACVIWLWVLLSLAPPALVLERQKVFAALKRSAKLVRGAWWRVLGIQILALS  91

Query  274  I  274
             
Sbjct  92   W  92


>WP_195396648.1 DUF975 family protein [[Ruminococcus] gnavus]
Length=522

 Score = 43.2 bits (98),  Expect = 0.27, Method: Composition-based stats.
 Identities = 23/152 (15%), Positives = 46/152 (30%), Gaps = 1/152 (1%)

Query  140  LLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRH  199
                P +              L TV    + L                 + +   L   +
Sbjct  115  AFHDPFSTSFFLLLLGVLLSFLYTVFIQNVLLVGEARFFLEARTYQQTTISKLFFLYKVN  174

Query  200  VGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGL-QALEKSRLLVS  258
              S    ++    V      L +I G++    +    Y+LA++   G   A   SR L+ 
Sbjct  175  FFSHPAWIMTCRCVFQTFWNLTVIGGIIKKYEYSMIPYILAENPTMGRKDAFFLSRQLMR  234

Query  259  GHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
            G+ W +F   +  +  S+        + ++  
Sbjct  235  GNKWRMFLLHLSFIGWSILSLLTFGILDFLFV  266


>GAA52080.1 ATP-binding cassette subfamily A (ABC1) member 5 [Clonorchis 
sinensis]
Length=1753

 Score = 43.6 bits (99),  Expect = 0.27, Method: Composition-based stats.
 Identities = 20/306 (7%), Positives = 60/306 (20%), Gaps = 8/306 (3%)

Query  41    AESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGL  100
              +                G                              +          
Sbjct  995   PKEPYFVKLPFWRELSPTGYSEDAIHPLYNYSRFGSTRVDIAAWHNNSIKITLILGRQDH  1054

Query  101   RSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAIL  160
                 Q++   + L+       L    +         +   +    +     +   +    
Sbjct  1055  DLTQQIILQHFALWNCLHTRNLRQQGILRTDDPTIPWVGDMTSWESRWPETHPQLKLGTA  1114

Query  161   LATVAYILLGLSWMT-GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
                +    +    +        + + ++ +   + +      ++        +       
Sbjct  1115  CIYLIMCSVANCILNPLIAGDIVREKELRILTQLHMYGMKHWAYWGAHFFTHIFQYLMLA  1174

Query  220   LLIIPGLLFC--VWFFFCQYVLADDNIGGLQAL-----EKSRLLVSGHWWAIFGRFVLLL  272
                   +         + Q +   + I  L AL          L   +       F   +
Sbjct  1175  CFTTIVMFPFEDHILSYWQAIFVHNWINVLAALDNILLNYFCCLFFQNSGGATVLFGSTI  1234

Query  273   VISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWL  332
             +I + ++     IP     A     L + PF+  +  L  +D +                
Sbjct  1235  LIVIFVNITIFFIPLSMLEAAYGCILFIFPFADPFLTLFLTDFRVRLEKVYLFGTTNAIF  1294

Query  333   PLTAAI  338
                   
Sbjct  1295  MPPLWR  1300


>MBC8122155.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Gemmatimonadaceae bacterium]
Length=200

 Score = 41.7 bits (94),  Expect = 0.28, Method: Composition-based stats.
 Identities = 28/153 (18%), Positives = 49/153 (32%), Gaps = 0/153 (0%)

Query  144  PATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSF  203
               +                       L  MT ++ +       G   ++  G       
Sbjct  12   SLAFAYGVLTGQTTMGTAFVTGVRTWPLLVMTSALTVLGAMVPTGWLFALAYGHFAQAFN  71

Query  204  TLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWA  263
                 L+I +     L L +PG+   V  F    +L  +      A+ +S +L+ G WW+
Sbjct  72   GGEGFLMIALPFAAMLALAVPGIYIGVRLFAAVPILFVERHTPFSAIRRSWILLKGQWWS  131

Query  264  IFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
                FV L  + L L  L + +      A+L  
Sbjct  132  TALAFVPLGAMLLILGALLSLVLVGSPFADLLL  164


>WP_051013188.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Lactococcus lactis]QJD18551.1 hypothetical 
protein HG420_01425 [Lactococcus lactis subsp. cremoris]
Length=107

 Score = 40.1 bits (90),  Expect = 0.28, Method: Composition-based stats.
 Identities = 14/105 (13%), Positives = 28/105 (27%), Gaps = 2/105 (2%)

Query  199  HVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVS  258
             +  FTL   L         +L                 ++         A+++S     
Sbjct  5    KIPVFTLEFFLKSWQNMLILVLFYAITFWISTRLILTLPLMILKGQPLKLAIKESLKRTK  64

Query  259  GHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
            G     F R  +   +    S +   + ++G        + L  F
Sbjct  65   G--VRKFFRLSVYFGLIGLFSIIMQGLLFMGATLRKIILIKLPLF  107


>WP_129487629.1 hypothetical protein [Fusibacter sp. A1]NPE21708.1 hypothetical 
protein [Fusibacter sp. A1]RXV61283.1 hypothetical protein 
DWB64_07675 [Fusibacter sp. A1]
Length=329

 Score = 42.8 bits (97),  Expect = 0.28, Method: Composition-based stats.
 Identities = 24/146 (16%), Positives = 56/146 (38%), Gaps = 0/146 (0%)

Query  152  NQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLI  211
             ++    I L TV  + L +     +  +        +   +++ +          ++  
Sbjct  110  FKHTFKTIGLVTVMDLPLIVVLWVFAEQLISGNLIEEISLWVEVFIASFNDTFFGTVVFA  169

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
            + +  G LLL +    + VWF F  Y +  +N     A+++S  LV   + +I+   +L+
Sbjct  170  IALTLGILLLNLLMAAYLVWFTFAIYEMCLENQSIKGAIKRSFKLVKVRYKSIYLITILI  229

Query  272  LVISLTLSFLTARIPYVGEAANLAFS  297
            L+  +T+ +    I       +    
Sbjct  230  LLTQMTIQYGINGIGQGANLISQFAG  255


>CDE10768.1 putative uncharacterized protein [Clostridium sp. CAG:354]
Length=383

 Score = 42.8 bits (97),  Expect = 0.28, Method: Composition-based stats.
 Identities = 15/165 (9%), Positives = 50/165 (30%), Gaps = 1/165 (1%)

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
              +       +  + +   ++ +  ++ +    S             L+ L+ + +    
Sbjct  217  WSIVINIVNSIFAASLVYLLYRFFLESSLEKASSCLSHKFGKVFLAGLIALIAIPILSII  276

Query  219  LLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL  278
            L+    G    +        +   +     ++   R+  +  +  +   ++L ++I++ +
Sbjct  277  LIAAQIGFSAGLLLLMLYVFMIGISPYIFASILARRIYNNKKFSKMPLEYLLAILIAVVI  336

Query  279  SFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQ  323
                  IP+VG    +  S+L         +      K       
Sbjct  337  G-ALKMIPFVGIVVTIFISILGFGIIIYNLFFNGHKHKEVKGEEN  380


>XP_023240341.1 nose resistant to fluoxetine protein 6-like [Centruroides sculpturatus]
Length=1017

 Score = 43.2 bits (98),  Expect = 0.29, Method: Composition-based stats.
 Identities = 28/312 (9%), Positives = 80/312 (26%), Gaps = 16/312 (5%)

Query  70   EIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGI  129
             +  +          F          +                      +  +   +LG 
Sbjct  564  PLSIQEKYSEPQESFFIQFILCFSVYTNGAKVFNIGNTGAQLNCLHGIRFFSMSWIILGH  623

Query  130  VLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGL  189
               +A +    ++     +N  +         +   + L+    ++        +    +
Sbjct  624  TYIYAVMSVGNIVDVLEDINSFSFQTISQATFSVDTFFLISGFLLSYLFLKEHFEKQTNV  683

Query  190  FRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV--------WFFFCQYVLAD  241
                    R      +  +LL         L   P              W++   Y+L  
Sbjct  684  NWIYYYVHRIWRLTPVFAMLLGFYSTLWLHLGSGPSWPNETDNGNCKKNWWWNILYILNF  743

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI--SLTLSFLTARIPYVGEAANLAFSL-  298
            ++   +  +  +  L +   + I     LLL+    +  + +   +  +        S+ 
Sbjct  744  EDSNDM-CMGWTWYLANDMQFYIISPIFLLLLWKLPIVGTIVVGIVILISWIVAGVLSIE  802

Query  299  ----LLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSR  354
                 +  F+         D++A           + +  +   + G +    LL+    R
Sbjct  803  YDLSSVLTFNQDLKVDQVMDMQAKKDQYFDIIYDKPYCRIGPYMVGILTGYVLLVAFNHR  862

Query  355  QNLSAEQLLSAG  366
            + + +  L+   
Sbjct  863  KFMKSAFLVIGW  874


>WP_147005555.1 hypothetical protein [Leptotrichia hongkongensis]BBM59505.1 hypothetical 
protein JMUB5056_1089 [Leptotrichia hongkongensis]
Length=283

 Score = 42.4 bits (96),  Expect = 0.29, Method: Composition-based stats.
 Identities = 16/178 (9%), Positives = 57/178 (32%), Gaps = 20/178 (11%)

Query  155  WQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVV  214
              + +    V  I+  +S +         +     F   +  ++ +    + ++L + ++
Sbjct  95   MLFILCFIIVYLIIRFISAVIRKKMGLEVEERENEFSIGETIVKFLTITFVNIVLQVFLI  154

Query  215  GGGSLL---------LIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
              G ++          +I  ++  +   + +       I  ++A + +  +  G+   + 
Sbjct  155  ILGIIITVFTRSTLSFLILWIILQLNLLYFEEAYYLRKINIIEAFKYNFYISKGNRIRML  214

Query  266  GRFVLLLVISLTLSFLTAR-----------IPYVGEAANLAFSLLLTPFSFLYYYLIY  312
               + +  + + +++               +  V         LL T    +   L+Y
Sbjct  215  TVLLTIFTVFVGINYFLGWLFEATIKNQVTLMTVNAVFGGIAGLLYTVLGTMVSNLVY  272


>OGS02707.1 hypothetical protein A2278_05740 [Elusimicrobia bacterium RIFOXYA12_FULL_49_49]OGS07246.1 
hypothetical protein A2204_04890 
[Elusimicrobia bacterium RIFOXYA1_FULL_47_7]OGS09378.1 hypothetical 
protein A2386_04140 [Elusimicrobia bacterium RIFOXYB1_FULL_48_9]OGS15410.1 
hypothetical protein A2251_07570 [Elusimicrobia 
bacterium RIFOXYA2_FULL_47_53]OGS26250.1 hypothetical 
protein A2339_01490 [Elusimicrobia bacterium RIFOXYB12_FULL_50_12]OGS30838.1 
hypothetical protein A2323_00710 [Elusimicrobia 
bacterium RIFOXYB2_FULL_46_23]HBU69124.1 hypothetical 
protein [Elusimicrobia bacterium]
Length=278

 Score = 42.4 bits (96),  Expect = 0.29, Method: Composition-based stats.
 Identities = 25/213 (12%), Positives = 58/213 (27%), Gaps = 27/213 (13%)

Query  136  IFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL  195
             +    L     L      + ++       + L+                +   F     
Sbjct  65   FWGDRFLHYPFNLILFPTLFNYSKNAIDFTFGLIMSGITISMTAQLFQSGEPSWFFGFGK  124

Query  196  GLRHVGSFTLLLILLILVVGGG------------------SLLLIIPGLLFCVWFFFCQY  237
             ++       L + + +++                      LL+ + G+L    F F   
Sbjct  125  SVKRYFRMLGLWVAVFVLIYAFSRTSYLLLEYVTTSVKAVVLLMFLFGVLIQALFAFGIP  184

Query  238  VLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL---------TARIPYV  288
             +  +N    Q+ +++  L   +   +    +   +I L   F+            I  +
Sbjct  185  AVIVENRKLPQSFKRAVALFKNYPVKLLLVVLGPNLIVLPFLFVNMRGMMEKSFPEISIL  244

Query  289  GEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
              AA + F LL   F  +   + +   K     
Sbjct  245  FVAAKIFFILLADIFYTVSVTVFFMKNKEIEGR  277


>PSP59313.1 hypothetical protein BRC72_00190 [Halobacteriales archaeon QH_7_66_36]
Length=262

 Score = 42.4 bits (96),  Expect = 0.29, Method: Composition-based stats.
 Identities = 25/109 (23%), Positives = 39/109 (36%), Gaps = 5/109 (5%)

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL---TLSF  280
            PG+ F V   F    +A D+ G  +AL  +  L SG    +     LL+V+ L    +  
Sbjct  141  PGVFFAVTLLFAHPAVAIDDAGAREALATAWSLASGRRLDVAAIVSLLVVLYLTPRLVGS  200

Query  281  LTARIP--YVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPI  327
            +    P   +G A     SLL +      Y     +        +  P 
Sbjct  201  VVTGAPGLLLGGAVTGIGSLLSSGVVGRAYLAAKEEESREQTAAEEDPY  249


>XP_030470570.1 uncharacterized protein LOC115688784 [Syzygium oleosum]
Length=272

 Score = 42.4 bits (96),  Expect = 0.29, Method: Composition-based stats.
 Identities = 20/168 (12%), Positives = 51/168 (30%), Gaps = 2/168 (1%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            +       A+  I   LLL P + +                  I+  L  +   + I   
Sbjct  75   LAFWVFAAAYYSISMVLLLFPISVVVFTVAWIYTHPQKIGFKIIMGVLPKLWKKLNITFI  134

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
               V +   + +    + +  ++   ++         L    ++    +     V   ++
Sbjct  135  WASVIILAYLLVAWSTLNARVIIENAILFAALSIPYALGFMYIIM--VWNMANAVSVLED  192

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
            + G++A+ K   LV  +   +   F + +   + L         +G  
Sbjct  193  VYGIKAIMKGNALVKHNIGTVVWVFSVYVYFYMVLQLPIQLFVVLGNV  240


>NOZ24287.1 hypothetical protein [Planctomycetes bacterium]
Length=339

 Score = 42.8 bits (97),  Expect = 0.29, Method: Composition-based stats.
 Identities = 28/284 (10%), Positives = 62/284 (22%), Gaps = 22/284 (8%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  V+CP C             +   A   +    +      +         +       
Sbjct  27   MIQVQCPQCQ------------QGVMASEEQAGTLIPCRHCGAWVRVPGTAHSGAESTES  74

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
                    + I++  +             + +  A   GL           E        
Sbjct  75   PCPFCGQSILIETGDIGMPISCPHCRHMIQVQRHAPSGGLILQPGEQETHREPIPWENRR  134

Query  121  LLGIYL-LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
             LG +  L               +        +      I+        +       +  
Sbjct  135  ALGFFRALWQTFKTVLFSPTQFYRRMRPYGLGDAILYLVIIGLFAVVGAVLQQPFWQTTM  194

Query  180  IYICKTDVGLFRSMKLGLRH-VGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
              I              L   +  F   + + ++ +     + I   ++  V        
Sbjct  195  QKIVSVPTPPGAPTSTHLPMALSVFVYAIAIPLIPLIQVLAMFIASAMVHVVLMVLGGVN  254

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
               +N          R+L  G+   +FG   +   +   + FL 
Sbjct  255  QTYEN--------TCRVLCYGNSANVFGLIPVCGSLIAPIWFLV  290


>WP_145370413.1 hypothetical protein [Maioricimonas rarisocia]QDU39207.1 hypothetical 
protein Mal4_35440 [Maioricimonas rarisocia]
Length=193

 Score = 41.7 bits (94),  Expect = 0.30, Method: Composition-based stats.
 Identities = 22/186 (12%), Positives = 41/186 (22%), Gaps = 3/186 (2%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M TVRCP C AE   P S +       RC       +F                      
Sbjct  1    MITVRCPSCQAEVRGPESIV---GKRVRCKRDDCRNVFVFEPPPPPPPAPVDDPWEREAS  57

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
                 S   E           +     + ++    S      +++ L             
Sbjct  58   ALASSSYEDEFDDDESFDELPSPLPKRKKKKTAETSEQRYPLLNKYLEWCRSFAQIVLVL  117

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
             L    + ++   A +  +         +                 + + L      + +
Sbjct  118  YLIGAAITVLGGIAVLLFSDAPSDTRLASVAFAICVGTGAAIVGYIVYVVLMASIQFVHV  177

Query  181  YICKTD  186
             I    
Sbjct  178  IIDIEK  183


>WP_194891411.1 hypothetical protein [Catenulispora pinisilvae]
Length=691

 Score = 43.2 bits (98),  Expect = 0.30, Method: Composition-based stats.
 Identities = 20/142 (14%), Positives = 36/142 (25%), Gaps = 33/142 (23%)

Query  213  VVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
                  +      L      F+   V   + +    A+ +S  L  G WW   G  +L  
Sbjct  165  FGLLLLIPGAGVALYVGGRLFYAAPVTVLEGMPPKAAMRRSWHLDRGVWWRTLGIAMLPG  224

Query  273  VI----------------------SLTLSF-----------LTARIPYVGEAANLAFSLL  299
            ++                          S            + A    +   A +  ++L
Sbjct  225  LVGRAATEVITGGAGNVAARDLPAGFVASMQGGQAPHLSVAVLAIPVTIILIAAVVAAIL  284

Query  300  LTPFSFLYYYLIYSDLKANYRG  321
              P + L   L+Y D       
Sbjct  285  RAPLAPLSQGLLYIDRCIRTER  306


>NLW73749.1 hypothetical protein [Clostridiales bacterium]
Length=265

 Score = 42.4 bits (96),  Expect = 0.31, Method: Composition-based stats.
 Identities = 23/184 (13%), Positives = 51/184 (28%), Gaps = 4/184 (2%)

Query  109  DSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYIL  168
                 + R       IY + +++ +  IF  +L      L   ++         T  +  
Sbjct  60   MPALPWLRDLPYYYNIYDISVIIDYILIFLFVLPTALGALRMAHRMCSGERAGLTDIFYP  119

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLL----LILLILVVGGGSLLLIIP  224
                  T  + + I    + +    +     + +          +L   V   ++ L   
Sbjct  120  YRRLPRTWVVSLIIFLPFLIIAGIWRATPALLDNLKGAALFGGAILRFFVVLAAISLGAG  179

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
             L   + +FF   +   +++   QAL  S     G    I          +         
Sbjct  180  ALYLALRWFFFPGLAMREDMTVRQALFASYSASRGRMCEIIKFIFSFWGWAALSLVSFGV  239

Query  285  IPYV  288
            I  +
Sbjct  240  IFII  243


>PKK94162.1 hypothetical protein CVV61_00790, partial [Tenericutes bacterium 
HGW-Tenericutes-6]
Length=131

 Score = 40.5 bits (91),  Expect = 0.31, Method: Composition-based stats.
 Identities = 19/99 (19%), Positives = 37/99 (37%), Gaps = 8/99 (8%)

Query  219  LLLIIPGLLFCVWFFFCQYV-LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT  277
            LL IIPGL+    +    Y+   D ++   +A+  S  L  G+   +F   +  + I + 
Sbjct  1    LLFIIPGLIKAYAYSMWLYLLDKDPSLLANEAITLSNKLTKGYKLRLFLMDLYFVFIYVA  60

Query  278  LSFLTARI-------PYVGEAANLAFSLLLTPFSFLYYY  309
            L      +       P+      +   ++   F +  Y 
Sbjct  61   LMIFFYILFRSSSMNPFFFVLIFILLLVVFIGFIYPKYM  99


>WP_171815915.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Candidatus Phaeomarinobacter ectocarpi]CDO60852.1 
hypothetical protein BN1012_Phect2639 [Candidatus Phaeomarinobacter 
ectocarpi]
Length=282

 Score = 42.4 bits (96),  Expect = 0.31, Method: Composition-based stats.
 Identities = 30/240 (13%), Positives = 70/240 (29%), Gaps = 18/240 (8%)

Query  76   VNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAP  135
            +  +     F +       A    + ++    A         G  ++ +    I +  A 
Sbjct  16   MPWQHRGTFFKIMWPWLLAAGVVYVVAVVISFAAMAGATPPTGVFVVSMIAGVIAMVLAS  75

Query  136  IFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL  195
                  L+       Q   +        +AY+++   ++            +    +   
Sbjct  76   PAIVGALRYVIADEAQRHGFGERTKRFLLAYLVMLAVYLPVMGLYMFSGPQIAPDGTFGF  135

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
                +    +++ L+IL +                   F    + DD    +    ++  
Sbjct  136  ARPGLAIAVIVITLIILPIVM--------------RLGFVFPAIVDDE--PID-FARAWG  178

Query  256  LVSGHWWAI-FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
               G+ W I FG  +L + I+   SFL   +       +   + ++  FS L   L+   
Sbjct  179  RTKGNTWRILFGYLILSMGITFVGSFLVGFLGAFLTVFSGPLAPVVAIFSMLLNVLLMIY  238


>OQY55980.1 hypothetical protein B6245_18975 [Desulfobacteraceae bacterium 
4572_88]
Length=1587

 Score = 43.2 bits (98),  Expect = 0.31, Method: Composition-based stats.
 Identities = 9/44 (20%), Positives = 17/44 (39%), Gaps = 0/44 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRT  46
            ++C +C A  N   + +    S  RC +C       P  + + 
Sbjct  2   KLKCENCQAAYNVDENAIRPDGSKVRCLKCRYVFTIYPPSAPKQ  45


>MBI5816578.1 zinc-ribbon domain-containing protein [Nitrospinae bacterium]
Length=871

 Score = 43.2 bits (98),  Expect = 0.32, Method: Composition-based stats.
 Identities = 9/70 (13%), Positives = 20/70 (29%), Gaps = 1/70 (1%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
           M  ++CP C +     +S +  +   A+C +C           +               L
Sbjct  1   MI-IKCPRCASRYKIDASSVSEEGQFAKCAKCENVFFIRKRPDEEIARLKERKEGKKAPL  59

Query  61  QRRIPSDRLE  70
                 ++  
Sbjct  60  PTPQVEEKRP  69


>MBI2344543.1 hypothetical protein [Candidatus Dependentiae bacterium]
Length=315

 Score = 42.4 bits (96),  Expect = 0.32, Method: Composition-based stats.
 Identities = 19/172 (11%), Positives = 48/172 (28%), Gaps = 0/172 (0%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
             +L   +L  +     I    +     + N Q            +         +    F
Sbjct  126  YVLIPAILLWMSGVMTISIGYIKTALKFQNEQKAKLHDMYQYIYLLPQYFLGKMIFFLFF  185

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
             ++      +   +   L    +     + + +     +L   +  +       F +Y +
Sbjct  186  SFLVCLIGFVSFFILCLLPLNQNTQNGFVAIGIFFFVIALFAAVLLIYLWQRLRFIKYFI  245

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
             D  +   +A   S  L  G   ++F   ++ L+I++        I      
Sbjct  246  IDQEVSAFKACRLSWNLTKGSVISLFLFSLVTLLITVAHPKAGLLIMLSYWL  297


>PKO16062.1 hypothetical protein CVU37_11960 [candidate division BRC1 bacterium 
HGW-BRC1-1]
Length=264

 Score = 42.1 bits (95),  Expect = 0.32, Method: Composition-based stats.
 Identities = 29/226 (13%), Positives = 52/226 (23%), Gaps = 4/226 (2%)

Query  75   TVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFA  134
                 R    + L            +         +  L      G   +  +       
Sbjct  17   WTYIVRHPSVWPLCVLPLVINVAVVVGVWMWTGGFAERLLGDAFTGTTWLADVLRGFVVL  76

Query  135  PIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMK  194
              F   L                         +   ++  +               R+  
Sbjct  77   LTFLLRLFLLLVAFVVVGSMASAPFNDVLSERVDRAITGWSDEQPFSAKGLMRSFVRTPI  136

Query  195  LGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV---WFFFCQYVLADDNIGGLQALE  251
            +  R +  + L+ ++L ++     L          V   +F   Q     +  G   AL 
Sbjct  137  MEFRRLMVYALITVVLFVLSFIPLLAAFTLPAQIGVSAAFFALDQLSYPLERRGIW-ALR  195

Query  252  KSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
            +    V  H  A FG    L +I L        IP       L F+
Sbjct  196  EKFAYVRRHARASFGFGWALTLIFLVPLVNFLFIPVAVVGGTLLFA  241


>PIN94033.1 hypothetical protein COU54_00680 [Candidatus Pacearchaeota archaeon 
CG10_big_fil_rev_8_21_14_0_10_31_24]
Length=202

 Score = 41.7 bits (94),  Expect = 0.32, Method: Composition-based stats.
 Identities = 25/179 (14%), Positives = 57/179 (32%), Gaps = 10/179 (6%)

Query  129  IVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVG  188
             ++       A  L  A + +      Q  I     +     +S    ++  +I    + 
Sbjct  21   QIIWSIVFSLASFLILAIFFSGLIGIIQETIKKKRSSLKTFFISIKNYTLTNFILLVILT  80

Query  189  LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQ  248
            +  S+   +        L I   +      +LL      F ++F F         +  ++
Sbjct  81   ILYSLITLISLYLGGLFLKINDAMGQIAFLILLFGGLAGFMIFFSFANIFCVTHKLDVIK  140

Query  249  ALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL----------TARIPYVGEAANLAFS  297
            +L+ S  LV   + ++    V+  VI+  +S +             +PY+        +
Sbjct  141  SLKSSFNLVKKEYLSVLSISVIFFVINELISLIKEPYAEIIKTLIVLPYLALILVNFIN  199


>MBI2378740.1 hypothetical protein [Deltaproteobacteria bacterium]
Length=110

 Score = 40.1 bits (90),  Expect = 0.32, Method: Composition-based stats.
 Identities = 21/99 (21%), Positives = 36/99 (36%), Gaps = 3/99 (3%)

Query  205  LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWA  263
            +  +L+ +    G L  ++  ++   +  F  + L     IG + A+  S  LV  H   
Sbjct  1    MASVLVAIACMLGLLGFVVGAIVVGFFLMFTFHELVARPGIGAVDAMRGSVALVKAHPSE  60

Query  264  IFGRFVLLLVISLTLSFL--TARIPYVGEAANLAFSLLL  300
             F   V+ +VI    S +         G    L  SL  
Sbjct  61   SFVMLVVSIVIVALTSSVLPGIGGLIGGPFCMLLGSLFY  99


>HGI34077.1 hypothetical protein [Euryarchaeota archaeon]
Length=336

 Score = 42.4 bits (96),  Expect = 0.32, Method: Composition-based stats.
 Identities = 21/243 (9%), Positives = 61/243 (25%), Gaps = 16/243 (7%)

Query  83   RSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLL  142
                ++   ++          +  +     +       ++    L  +     +      
Sbjct  1    MFENVKIAWQYFKESWHYIRTNPKILKIPFVAMGLWLLIVFPLFLVTLFELWFLVITGSS  60

Query  143  KPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGS  202
                            +   T     +    +  +++        G   ++K     +  
Sbjct  61   SIGIIFIVTLIIDAVLLWFFTYLVSYVYTGMIVAAVYASEIGRKEGYLDNLKGNWGALAK  120

Query  203  FTLLLILLI---------------LVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGL  247
            F +L +L                 +V   G + + I   ++     F    +  + +   
Sbjct  121  FAVLNMLFTGALSAAGSAMRGLSEVVQLIGKVGISIVAWVYRYLTIFTICAIVIEEMKMS  180

Query  248  QALEKSRLLVSGHWWAI-FGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFL  306
            +AL++S  LV      + FG  +  L+  + +  + A             S+        
Sbjct  181  RALKRSVELVRRKPVVVLFGMLMSDLIGVVVVIVIFASFILFLSLGFFLASIYGENVLIW  240

Query  307  YYY  309
               
Sbjct  241  CIM  243


>OHD17722.1 hypothetical protein A2Y37_03925 [Spirochaetes bacterium GWB1_60_80]OHD28835.1 
hypothetical protein A2004_08365 [Spirochaetes 
bacterium GWC1_61_12]OHD42146.1 hypothetical protein A2087_08130 
[Spirochaetes bacterium GWD1_61_31]OHD45433.1 hypothetical 
protein A2Y35_06405 [Spirochaetes bacterium GWE1_60_18]OHD61525.1 
hypothetical protein A2Y32_09425 [Spirochaetes 
bacterium GWF1_60_12]HAP43253.1 hypothetical protein [Spirochaetaceae 
bacterium]
Length=365

 Score = 42.4 bits (96),  Expect = 0.32, Method: Composition-based stats.
 Identities = 30/360 (8%), Positives = 80/360 (22%), Gaps = 5/360 (1%)

Query  31   ECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPE  90
                  I D   +   +    + +  +  L     + +                  +   
Sbjct  1    MAGHGSIIDQRGADVKRQRYTLGSFLNSTLGFWPRALKKAWLPALACLLPSAVLMGIGFA  60

Query  91   REFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNP  150
            R             +  A            L    L   V   +       L  AT    
Sbjct  61   RMGPMLALFTEPGFEFGAWMIGRIVEPYAWLAAANLAAAVGYLSINIIISNLCFATQDGQ  120

Query  151  QNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL  210
                          ++  L    +   + +            +      + +      + 
Sbjct  121  DPVMAPTVRQQLRRSFWPLLGQSILLGLIMGGGLLIATGLIMLLAVGIGLATGVAGGSVA  180

Query  211  ILVVGGGSLLLIIPGLLFCVWFFFC-----QYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
             +++     + +  GL+  +++F          +  + +     + +S  L  G++W IF
Sbjct  181  GVLLMMLLTMTLSLGLVAVLYWFMTRVIIAPQAVIREGVKAWAGIVRSFRLTKGNFWRIF  240

Query  266  GRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHP  325
            G   LL ++      +                      +     L+   + +        
Sbjct  241  GNNFLLQLMLGFAISIVTGPIVFVTVLPGYLKFFSAALNNPQDSLVSLQILSGLFSSMSW  300

Query  326  PIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNR  385
             +    L    A + ++ I   L+ +                +          Q     +
Sbjct  301  GMAFSTLLSGLASWTFLPIFHCLVYTDLAVRHGEVAPDELTSETPPTDAASSIQAEPAPQ  360


>OGS06240.1 hypothetical protein A3J70_03020 [Elusimicrobia bacterium RIFCSPHIGHO2_02_FULL_61_10]
Length=249

 Score = 42.1 bits (95),  Expect = 0.33, Method: Composition-based stats.
 Identities = 17/179 (9%), Positives = 47/179 (26%), Gaps = 7/179 (4%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L     +          +            +        +     +       +    + 
Sbjct  29   LAAWLYIYTCGLLGLEGTYAANTALKSSPTKVLIAYLPGMAVIAWFAAGLAGRLIMDAYK  88

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG-------GGSLLLIIPGLLFCVWFF  233
               ++ +   R           F   L+ L + +           +      +   V   
Sbjct  89   GAPESMLVYARGWYFRKLGWDVFFAALMWLPMPLLASGMPGALLGVAWFFAVIWLGVRVS  148

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
                +   +N+G  ++L++S  L     W +     + ++ +  L++  +RI   G A 
Sbjct  149  LWLNISVTENLGLPESLKRSYALTKDRVWTLLMAGGIPMISANLLNWAISRIISGGPAL  207


>TXI13440.1 hypothetical protein E6Q66_09590 [Pedobacter sp.]
Length=241

 Score = 42.1 bits (95),  Expect = 0.34, Method: Composition-based stats.
 Identities = 15/106 (14%), Positives = 39/106 (37%), Gaps = 0/106 (0%)

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
               ++       I+    +     +LI       V   F    + D + G ++A+ +S +
Sbjct  118  YFFNMPQIDTTDIMTWKHLPLLLGILIPIVAYVSVRMCFAVCFIVDQDSGAIEAIRQSWI  177

Query  256  LVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
            L  GH+W +   F++++ I++  +             +    ++  
Sbjct  178  LSKGHFWFLLLLFLVIVGINMLGAMALFVGLLFTVPLSSLMMIVTY  223


>HAV92694.1 hypothetical protein [candidate division WOR-3 bacterium]
Length=161

 Score = 40.9 bits (92),  Expect = 0.34, Method: Composition-based stats.
 Identities = 9/63 (14%), Positives = 13/63 (21%), Gaps = 0/63 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + CP C  +       +P      RCP C               T             +
Sbjct  2   RIECPKCHKKYEVEDMYIPIGGGPVRCPNCKNIFGIYVEPMDIPMTEIIEEEGEAIADAK  61

Query  63  RIP  65
              
Sbjct  62  GPF  64


>NBO93895.1 hypothetical protein [Planctomycetia bacterium]
Length=231

 Score = 41.7 bits (94),  Expect = 0.35, Method: Composition-based stats.
 Identities = 15/218 (7%), Positives = 46/218 (21%), Gaps = 0/218 (0%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M TVRCP C             +     C                 +             
Sbjct  1    MLTVRCPSCSRLLRGFDDNAGKEVKCPACQTVFLAEPEGGTALPYQRPVTVPDPETPWKP  60

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
            +    +      +         +    + +          R +            +    
Sbjct  61   KSEASNSEPPTPTTQQANDSPFKLKNTKADVALARLQWMARIVLCHSFFFCCSGHKIYLY  120

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
             L  +   + L F       ++     +N    ++     L     ++  L ++      
Sbjct  121  RLDRFSTLLFLLFMIRPLISIIMNLLAVNSLKPDYTMRTKLFLNLILVHLLFYILELFDY  180

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
                +       ++  +      ++  ++   ++    
Sbjct  181  IRLLSGRLPLFIIEWIIPVCLYISVTSLICFFLIFFSW  218


>RKU28812.1 hypothetical protein C6497_08615 [Candidatus Poribacteria bacterium]
Length=301

 Score = 42.4 bits (96),  Expect = 0.35, Method: Composition-based stats.
 Identities = 28/210 (13%), Positives = 66/210 (31%), Gaps = 26/210 (12%)

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
            Y L + +A     + LL        P +   +   +L ++ + ++ +S    +      +
Sbjct  38   YKLYLSIALIYFIALLLEYSLKGFIPGSIQGEIIPVLISMPFGIIAISAGVYATGSLYLE  97

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFC---------------  229
             ++      K     +       +++  ++  G + +    L+                 
Sbjct  98   REISADDIFKRIFHRLVPLIGSHLIVRAILALGLMSVSFSMLMTLRLGLPSILPGIIIGF  157

Query  230  ----------VWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
                      V + F   V+  +      AL+KS  L    WW I     L+L+I   +S
Sbjct  158  IILLISIYIIVSWMFHIPVILYEIPKVGYALKKSYNLTKFSWWRIVFIVFLILIICYAIS  217

Query  280  FLTAR-IPYVGEAANLAFSLLLTPFSFLYY  308
             +    +  +    N+A +          +
Sbjct  218  TIITLSVSSIMYVFNIAGNTNYQDLLQWMF  247


>RLG17831.1 hypothetical protein DRN63_02535 [Nanoarchaeota archaeon]
Length=267

 Score = 42.1 bits (95),  Expect = 0.35, Method: Composition-based stats.
 Identities = 25/206 (12%), Positives = 66/206 (32%), Gaps = 14/206 (7%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
               + +LG+ +L  ++          +   T L    +       +      +     + 
Sbjct  65   WILYKILGLVVLISLIELPVSVFFTAILLLTLLELTERKRYRFTEVLKSVKEIYVQFLIL  124

Query  176  GSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC  235
              ++       +       +  + +     L++ L++       L+    L       F 
Sbjct  125  SLLWFAFYFLILFASIHFLMTGKFLILLPFLILGLLIFGLLFFFLIYPIQL-------FS  177

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI-------PYV  288
                     G + AL+++ +L+   +  +    ++  +I +  + L   +       P +
Sbjct  178  TMEAVCGKAGVISALKQAWMLIKKDYLIVLLVIIVGPIIFVLPAVLLIVLLSLAFPNPIL  237

Query  289  GEAANLAFSLLLTPFSFLYYYLIYSD  314
                +L   +L TP  FL+  L + D
Sbjct  238  TTIFSLVIQILATPTFFLFTILCWKD  263


>OQC14498.1 hypothetical protein BWX73_01757 [Lentisphaerae bacterium ADurb.Bin082]
Length=400

 Score = 42.4 bits (96),  Expect = 0.35, Method: Composition-based stats.
 Identities = 36/362 (10%), Positives = 68/362 (19%), Gaps = 16/362 (4%)

Query  3    TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
               CPHC  +     S       SARCP C   +    AES                   
Sbjct  2    EFNCPHCCHKIEADDSF---AGGSARCPICNGEITVPVAESSAFGVCPKCGQ--------  50

Query  63   RIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLL  122
             +      I     +  R       +  R+ +     L S                   L
Sbjct  51   ALTMPDPVICINCGHRFRETPVDPEKKARQRKMRLQMLPSHILGTLVYLATVFIMFVMPL  110

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
                +  +         ++            +W+  +L   + ++               
Sbjct  111  FPLRIVFLSIMLAAVLGMIAAEWAKFIMLRADWRTGLLAIGIVFLAALTIVSYYIWNSVF  170

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
                  +  S+      V  +        +      L       +               
Sbjct  171  RLERYTMIESIINDPECVSQYPPEPARDEIHRAALGLYSQTYPEVIMSRL-KKGLRTDLC  229

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTP  302
               G Q+      L                ++    S     +          FSL    
Sbjct  230  KKYGKQSFPFDYQL--KRTIQCIREQFSYYLLVFLRS--LFGLSVFNMLLAYVFSLKAHS  285

Query  303  FSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQL  362
            F         S                        +F   L+   +   +         +
Sbjct  286  FFGPVMGFRASWDTELPSHWDDLSEDEAGSAAGDPMFVLALVAVTINALIVGVANVIIWV  345

Query  363  LS  364
            + 
Sbjct  346  IM  347


>MBI1275309.1 hypothetical protein [bacterium]
Length=382

 Score = 42.4 bits (96),  Expect = 0.35, Method: Composition-based stats.
 Identities = 26/206 (13%), Positives = 58/206 (28%), Gaps = 23/206 (11%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
            +  +   + +    WL   ++          V    L   + T  + I I    VGL   
Sbjct  180  YIFMGFVVNIHWKRWLLCGDERLFHGTTYMRVLGAALWWFFWTLLIDIVISVVIVGLTIP  239

Query  193  MKLGLRHVGSFT-----LLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGL  247
            + +   H  ++           +   +    L +    L+  +             +G  
Sbjct  240  VLVVWAHSHNWMRDEVTAASTAISGPITLIILAISSVWLIRTLLLLTSVT------VGDS  293

Query  248  QALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA------------RIPYVGEAANLA  295
             ++++   +  GH+W I    +++    L +  L                  +       
Sbjct  294  WSIKRIWRMTKGHYWRILLSALVVYAPLLIVGILFTAMFMAALKTAEPLHMLLLVICTSL  353

Query  296  FSLLLTPFSFLYYYLIYSDLKANYRG  321
                L      Y  ++Y +LK  Y  
Sbjct  354  IQGTLAFIGIGYAAILYRELKKEYER  379


>WP_137178817.1 zinc-ribbon domain-containing protein [Roseomonas sp. AR75]
Length=163

 Score = 40.9 bits (92),  Expect = 0.35, Method: Composition-based stats.
 Identities = 11/34 (32%), Positives = 13/34 (38%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            + CP+C AE   P S L       RC  C    
Sbjct  2   RIACPNCSAEYEVPESLLTGGARLLRCARCGHQF  35


>CVH77937.1 hypothetical protein BN3662_02518 [Clostridiales bacterium CHKCI006]
Length=269

 Score = 42.1 bits (95),  Expect = 0.36, Method: Composition-based stats.
 Identities = 33/206 (16%), Positives = 65/206 (32%), Gaps = 12/206 (6%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            + +       +   Y +    A   I    L     +    +      ++   + Y+ L 
Sbjct  52   FIVAPLYHGRITASYKVIKQDAPLDIRMDGLCGFVRFKELFSTYGWIELINLILLYLALF  111

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
              +        +   +  L       L    S   L  ++ ++        ++   L  +
Sbjct  112  GIYALVLGETNVLGLEASL-------LSGKVSEIALYRVVQILYLIALAADLVIRWLTNL  164

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL-----TARI  285
            +FF   Y+L    I GL +L+ S  L+ GH   +F   +  L  +    FL         
Sbjct  165  FFFAAPYLLETRQIRGLASLKASVRLMRGHKRDLFLLQLRFLAPAAICFFLNYFSAYYLS  224

Query  286  PYVGEAANLAFSLLLTPFSFLYYYLI  311
             ++   A LA +LL        Y + 
Sbjct  225  AWLYSFAGLAITLLEIYLYQAQYQVA  250


>TPX71151.1 hypothetical protein SpCBS45565_g01187 [Spizellomyces sp. 'palustris']
Length=1741

 Score = 43.2 bits (98),  Expect = 0.36, Method: Composition-based stats.
 Identities = 37/342 (11%), Positives = 90/342 (26%), Gaps = 1/342 (0%)

Query  149  NPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLI  208
                    W+  +    ++ +  S +   +     K   G+F        +  S+ +   
Sbjct  197  MFYAFMEGWSGGMIICVFLPMLYSIVAAILEEKQSKLREGMFMMGLSRASYNLSWLVTYA  256

Query  209  LLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRF  268
            +L L     S +++          F     L   ++  +       +L+           
Sbjct  257  VLFLPAWLISAVIMKATFYTKTNLFILFLWLVISSLPVIGWAFILEVLMKSPRSGGLFSI  316

Query  269  VLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIK  328
             L   +     F+      +G AA +A S  ++PFSF+Y   + +  +    G       
Sbjct  317  ALFTGVGAAAMFINNNQWDLGPAAKVALSF-ISPFSFVYANRVIAHQEGRLTGVHFANWS  375

Query  329  RQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLP  388
             ++  +       +L+   +L +     L      ++          +      ++ +  
Sbjct  376  EEYQGIRFTTCLLILVLDAILYACLGWYLEKIVTSASSGHQPWYSFWRGDPAHQIHNAPF  435

Query  389  EEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNL  448
                  +        S Q   +                  A         + E       
Sbjct  436  SPLSNDAPLIEDTPSSGQIGISIRDLSRSFRHPTTGKVQHAVQDLTLDAYEGETMVLLGK  495

Query  449  SLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQ  490
            + A K +    +  +L   A +   R           +  + 
Sbjct  496  NGAGKTTLISMLTGLLPPTAGEALLRGRKPRDARNAKLAEDT  537


>KKP29518.1 hypothetical protein UR12_C0006G0005 [candidate division TM6 
bacterium GW2011_GWF2_30_66]
Length=297

 Score = 42.1 bits (95),  Expect = 0.36, Method: Composition-based stats.
 Identities = 27/191 (14%), Positives = 63/191 (33%), Gaps = 0/191 (0%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            ++L  ++     I    ++     +     +  +                +     I   
Sbjct  103  LFLFFLIGVILFIACLKIIYDFVIIGWVKLSLAYYDHKKLSIESFFCKPIIYLKYIIATF  162

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
               +       + L      +   I+   +      + +I    F + F+F  Y + + +
Sbjct  163  IFLIISIIPSLIYLWLYLLISYFTIIPDNIAYMTYAISVIFSWYFMLRFWFYPYYIIEGD  222

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPF  303
               ++AL+KS  L  G    I   FV ++++ +   F+      +  A    F L+    
Sbjct  223  SSSVEALQKSYNLNLGFTNVIISLFVFIIILGVPGLFVYWYPNNITYAIFGVFLLVSWIS  282

Query  304  SFLYYYLIYSD  314
            S++ Y  IY +
Sbjct  283  SWMSYAFIYRN  293


>NCC86655.1 DUF975 family protein [Clostridia bacterium]
Length=241

 Score = 42.1 bits (95),  Expect = 0.36, Method: Composition-based stats.
 Identities = 24/210 (11%), Positives = 69/210 (33%), Gaps = 12/210 (6%)

Query  106  LLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVA  165
             L   W          +    +  +L    +   + +  +             +++  + 
Sbjct  23   WLLILWASLTPTFVQQVIGIFIFPLLYLPMMPMTIFMDGSDLYFIITIIVYSLLVILMIM  82

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
             I+     ++G M   +   +        +   +  +F    I+  L +   ++LLIIPG
Sbjct  83   AIIPFTVAVSGYMLKCLRNNEFYSESIFHIVKPNFWNFISTEIIKSLFLLLWTMLLIIPG  142

Query  226  LLFCVWFFFCQYVLADDN-IGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR  284
             +    +   +Y++ D+  +   +A+  S  +  G+ W +F  ++  +            
Sbjct  143  YIKAFAYSMTEYIICDNPSLSSKEAINLSSTITKGYKWDLFVMYLSFIPWY---------  193

Query  285  IPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
               +         + + P+  +   + Y +
Sbjct  194  --LLSIITLGFGYIYVIPYVRITEAMYYEN  221


>PYX10427.1 hypothetical protein DMG88_02280 [Acidobacteria bacterium]
Length=322

 Score = 42.4 bits (96),  Expect = 0.36, Method: Composition-based stats.
 Identities = 16/159 (10%), Positives = 47/159 (30%), Gaps = 10/159 (6%)

Query  153  QNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLIL  212
                    +A    I+         +   +      L   +++      S     +L   
Sbjct  106  WLLLLLFYVANYCVIVYFNVAFASIVLDRMAGGHATLDDGLQIAWARRYSVLQWALLAAT  165

Query  213  VVGGGSLLLIIPGL----------LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWW  262
            V     ++     +          ++ +  +F   +LA +++   +AL +S  ++   W 
Sbjct  166  VGVLLKMIRERSEIEAWIAGALGYIWRLATYFVMPLLALEDVRPGEALYRSAAILKRKWG  225

Query  263  AIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
             +        ++ + L+     + ++       F    T
Sbjct  226  EVIVAGFSFPLLFVVLAVPGVALIFIAGYLGQTFGFAAT  264


>OUW61959.1 hypothetical protein CBD58_02495 [bacterium TMED198]
Length=218

 Score = 41.7 bits (94),  Expect = 0.37, Method: Composition-based stats.
 Identities = 25/177 (14%), Positives = 59/177 (33%), Gaps = 2/177 (1%)

Query  118  GWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGS  177
             + ++ +   GI L        ++ K +   +   + + + +       I+     +   
Sbjct  31   FYIVVFLLQAGISLGIIKCCLQIIDKDSFEPSQIYKQFDFLLSYLFSIAIIAVFGALVFF  90

Query  178  MFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQY  237
              I +      L  S+      + S   +   L ++      +  +  L   + FFF  Y
Sbjct  91   PTIMLIYNKNILNLSLGSLQNTIDSIEKIATSLSVIDTLFLSVSSLMFLYISLRFFFVPY  150

Query  238  VLADDNIGG--LQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
             + D       ++AL  S  L  G    +   F++L+V+    + +T     +    
Sbjct  151  FIVDRASPANNVEALSLSYQLTKGKTLQLVPMFLILIVLGYLPNIVTFFFLPLTILL  207


>XP_013329970.1 Signal transduction protein Syg1 [Rasamsonia emersonii CBS 393.64]KKA23358.1 
Signal transduction protein Syg1 [Rasamsonia 
emersonii CBS 393.64]
Length=1375

 Score = 42.8 bits (97),  Expect = 0.37, Method: Composition-based stats.
 Identities = 36/526 (7%), Positives = 91/526 (17%), Gaps = 28/526 (5%)

Query  26    SARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSF  85
                C    +T I      +                        L +             +
Sbjct  636   CLDCMIWSKTKINYTFVFEFDTRHVLDWRQLSELPCFFFLLLGLFMWLNFSWVDSMFLYW  695

Query  86    CLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPA  145
              +               +    +  W  F      L G+Y +     F            
Sbjct  696   PVVLIFITVLVMFFPARVFYHHSRKWWAFSNWRLLLAGLYPVEFRDFFLGDMYCSQTYAM  755

Query  146   TWLNPQNQNWQWAILL----------ATVAYILLGLSWMTG-----------SMFIYICK  184
               +      +                    +  L   W                   +  
Sbjct  756   GNIELFFCLYATHWQNPPVCNSSHSRLLGFFTTLPSIWRGFQCLRRYRDTKNVFPHLVNF  815

Query  185   TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
                       + L            ++ +V               + +  C    ++  +
Sbjct  816   GKYMFGILYYMTLSMYRIHQSTRFQVVFIVFAFINATYCSVWDLAMDWSLCNPYASNPFL  875

Query  245   GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFS  304
               L A  +         + +     +++  +     +  R        +   S       
Sbjct  876   RELLAFRRVW------IYYVAMVLDVVVRFNWIFYAIFTRDIQHSALLSFFVSFSEICRR  929

Query  305   FLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQLLS  364
              ++      +                      +                ++   +    +
Sbjct  930   GMWTVFRVENEHCTNVHLFRAMRDVPLPYKVPSPPSAGGFGPGEEALPLQEQPPSTPAAT  989

Query  365   AGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFA  424
                       T P  TP L    P     +S     +  +  +    +    L       
Sbjct  990   TTGAADPESATVPTATPSLRARRPSIAGTISRVGNMMATAHSQDFQRKRRSDLVSGEPVD  1049

Query  425   DRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFH  484
              +   D+         +    P+   A      +               R    + P   
Sbjct  1050  PQTPLDESTDEEEDDHDELRTPSREEADPLEEELPNQGSFGSSVAVQSARAPVTQPPQQM  1109

Query  485   WVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKLELTLPLAIE  530
               G             R+         E   +     +  +P   +
Sbjct  1110  SCGGRLPAPQRDSFEERAYSQAP-PSYEATTASPRTEDDNVPDDFK  1154


>MXV82240.1 hypothetical protein [Candidatus Poribacteria bacterium]MYA58232.1 
hypothetical protein [Candidatus Poribacteria bacterium]
Length=314

 Score = 42.1 bits (95),  Expect = 0.38, Method: Composition-based stats.
 Identities = 23/156 (15%), Positives = 52/156 (33%), Gaps = 10/156 (6%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
             GI  L + +           +     + +        +++ +   +L    +  + F  
Sbjct  84   FGIGFLWLAMCPLTFIIVHQYRGTDATSGEAWRQTRRKIVSVLGIWVLFELLVIAAFFTI  143

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            +     G  +     +    S                +++ IP   F V +      +  
Sbjct  144  VIPILTGFSQVFIPYVPVAPSTP----------ILLLMIVGIPAFYFLVKWSLYNQGIII  193

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT  277
            +N+  + AL +S  LV G W   FG ++LL+  ++ 
Sbjct  194  ENLSAITALRRSSELVRGKWKQFFGMYLLLVWGTMV  229


>WP_179899663.1 hypothetical protein [Actinomyces bowdenii]MBF0696148.1 hypothetical 
protein [Actinomyces bowdenii]NYS68321.1 hypothetical 
protein [Actinomyces bowdenii]
Length=276

 Score = 42.1 bits (95),  Expect = 0.38, Method: Composition-based stats.
 Identities = 29/199 (15%), Positives = 57/199 (29%), Gaps = 6/199 (3%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
            W     +      +     +           L           N    +L+       L 
Sbjct  45   WRWAWSQIKDQPLVLAGLPLWLVPSAILFPSLFSLLESVEGGLNGLLVLLIVLAFVSALL  104

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
            LS    +    +      L       + H  S    +++  ++V  G+   I+ GL    
Sbjct  105  LSVTGLNHACLVIAQGRRLHVKDFFVIPHAPSAFTAMLITTVLVMLGNAFFIVAGLAIMY  164

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
            +F F   + AD       A+ +S  LVS           + +     L ++   +  VG 
Sbjct  165  FFAFAAMIAADRGGSPFAAIRRSCSLVSRSNE------FMSVFCITVLLYMAGTLTLVGW  218

Query  291  AANLAFSLLLTPFSFLYYY  309
                   +L+   S++   
Sbjct  219  IILGPVQVLMLARSYVMLA  237


>TRY69347.1 hypothetical protein TCAL_06925 [Tigriopus californicus]
Length=1493

 Score = 42.8 bits (97),  Expect = 0.38, Method: Composition-based stats.
 Identities = 20/285 (7%), Positives = 48/285 (17%), Gaps = 17/285 (6%)

Query  345   PGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLS  404
             P       S      E   + G                                     +
Sbjct  981   PNQPYPPSSDYGGPPEHNSARGPYADDPSDYSNYDDSPDYGYPRPNSSPNYDESGPPNYN  1040

Query  405   KQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEIDKVL  464
                   +       P     D               +    PN +   + +   +     
Sbjct  1041  DGNGPPNYNDGDGPPNYNDGDGPPNYSDRDRPPNYRDRDGPPNYNYDSESTDYRQDGPPD  1100

Query  465   DDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQ-VHSILGKLEL  523
                  +     +S       +   +    +           +             G+ E 
Sbjct  1101  YRKDPEGNSPDYSDPSGLPDFTDYSDRGTDPFGPASDYGPSKDSYDENDGPGGSRGRPEY  1160

Query  524   TLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGDRTDLLNVHASNSHAEP  583
             +             +     +        +  G++       G R         +S    
Sbjct  1161  S-------QYSDYGNRPAPAESQD----FEDYGNDDAFGPHTGPRNRGPGY-DYDSAGFS  1208

Query  584   LREIGFTWQKSGDAFSLRQMFDGNI----ESITVLVAGDSMTQSY  624
                         + F    +FDG      +     ++  +    Y
Sbjct  1209  NAPESAQRNSPPNEFHEVLVFDGPPSDGEDEHQAFLSDKTPKPKY  1253


>MBI5804310.1 hypothetical protein [Candidatus Pacearchaeota archaeon]
Length=238

 Score = 41.7 bits (94),  Expect = 0.38, Method: Composition-based stats.
 Identities = 23/181 (13%), Positives = 45/181 (25%), Gaps = 4/181 (2%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
               I       A   +  A        +   +   +  +           L      + I
Sbjct  52   WKIIVWFVFSSAIFLVLLAYAFSGIIGMCRVSLKRKTNLGDFVANADKFFLRNFLVILII  111

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                  VG                 +     L +      L+   +    +F F  + L 
Sbjct  112  VAVGAAVGQVAYYLAFFIGKSLNFGVNAARALYLLIYFAELVGAVI----FFTFSNFCLV  167

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLL  300
                  + A++ S  +V   + A     VLL VI   L ++      + E   +   +  
Sbjct  168  IYETKLIGAIKHSFSIVKKEYLATLVLSVLLFVIIFLLGWVPGIFGELIEYILIIPLVSS  227

Query  301  T  301
             
Sbjct  228  I  228


>SQB60826.1 Protein of uncharacterised function (DUF975) [Clostridium perfringens]
Length=176

 Score = 40.9 bits (92),  Expect = 0.39, Method: Composition-based stats.
 Identities = 17/50 (34%), Positives = 23/50 (46%), Gaps = 0/50 (0%)

Query  252  KSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
            K++ L+ G W    G  +L  V +  LSFL + IP  G       S  LT
Sbjct  10   KAKELLRGRWENAVGACLLFFVSTFLLSFLLSPIPVFGWIIIALISTFLT  59


>MBA2256010.1 hypothetical protein [Thermoleophilaceae bacterium]
Length=235

 Score = 41.7 bits (94),  Expect = 0.39, Method: Composition-based stats.
 Identities = 32/165 (19%), Positives = 62/165 (38%), Gaps = 9/165 (5%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                   +  + +  +      +        +   L   G      +LL  VV  G L+L
Sbjct  67   VAPVIATVLYAGIVSAAVAARREGTRPSLPRLARTLP-YGRLVGADLLLAPVVAAGILML  125

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS--  279
            I+PGL+F  WF      +  ++ G + A  +SR LV  H+W +    +   ++   ++  
Sbjct  126  IVPGLVFLTWFALVAPAIEIEHRGVVDAFRRSRRLVRRHFWKVASLVLPAFLVEELVASA  185

Query  280  ------FLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKAN  318
                  +        G A  +  +LL  P   L   +++ +L+A 
Sbjct  186  AESGSVWALGDTFVGGWAGFVLGNLLAAPILALAMVILFCELRAR  230


>WP_161600755.1 zinc-ribbon domain-containing protein [Roseomonas oryzae]
Length=185

 Score = 41.3 bits (93),  Expect = 0.40, Method: Composition-based stats.
 Identities = 10/36 (28%), Positives = 13/36 (36%), Gaps = 1/36 (3%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIF  38
            ++CP C A    P + L       RC  C Q    
Sbjct  2   RIQCPDCAAAYEVPEAMLVP-GRPVRCARCGQRWQP  36


>KAF9588719.1 hypothetical protein IFM89_015156 [Coptis chinensis]
Length=601

 Score = 42.8 bits (97),  Expect = 0.40, Method: Composition-based stats.
 Identities = 13/179 (7%), Positives = 44/179 (25%), Gaps = 5/179 (3%)

Query  104  SQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLAT  163
                    ++       ++ +      +    +            +         +    
Sbjct  66   QYHPHICMQIKQFCMVVVVVLMEKVHSILVLVVVERSSGSGLGMHSDGGGGVGPKMCHPL  125

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
                 +              +T +     ++           L + L+L      +L   
Sbjct  126  YLLPPVHWRGSLHFCLHLHFQTSLFQLHPLRHSSCLQTPLHQLPLRLLLDAYIKRVLP--  183

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
                    +     +   + + GL A+++S  L+ G+        +  L+I + +  + 
Sbjct  184  ---YVLAMWHLASVLSVFEPVYGLAAMKRSNKLLKGNKKVAMTLVISYLLICMAIGIVF  239


>MBD3174297.1 hypothetical protein [Armatimonadia bacterium]
Length=249

 Score = 41.7 bits (94),  Expect = 0.40, Method: Composition-based stats.
 Identities = 21/243 (9%), Positives = 50/243 (21%), Gaps = 5/243 (2%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
               +CP C  E   P         + +C  C + +       ++                
Sbjct  9    IKFQCPSCRKEITVPDE---MAGRTGKCGGCGERVTVPTGAEKKAWPPTLPGGASTRKPA  65

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                   L    +    R                    L   +       +     G   
Sbjct  66   DEEELGDLAPPPEPERPRPAWAQQPPPEPERPETPPGVLPPPTPPPPPMPQGPPPPGPSP  125

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWA--ILLATVAYILLGLSWMTGSMF  179
            +G     + +                             +  A      +     T +  
Sbjct  126  IGATPPPMTMPTEDDDERGSPARDWCSMVVGWMGIVWPALAWALFLLSYVASYVGTRAAM  185

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
              +      +  S+  G   +  F +  +   + V G   L++   +    +  +  Y  
Sbjct  186  EGLEPAIWTVLTSLLAGPLLLAWFVIDAMDQGMSVWGLIALIVGLCVPCVGFLVWLWYWC  245

Query  240  ADD  242
            +  
Sbjct  246  SYR  248


>OLD50887.1 hypothetical protein AUI42_01215 [Actinobacteria bacterium 13_1_40CM_2_65_8]OLE80294.1 
hypothetical protein AUG06_05110 [Actinobacteria 
bacterium 13_1_20CM_2_65_11]
Length=238

 Score = 41.7 bits (94),  Expect = 0.40, Method: Composition-based stats.
 Identities = 15/96 (16%), Positives = 35/96 (36%), Gaps = 0/96 (0%)

Query  216  GGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
               L+     L   V F F Q ++  ++    Q+L     L+  H+  +   +++L+ +S
Sbjct  92   VAILVSTAFWLALGVAFQFAQRLVVLEDGHVAQSLSTGFRLIRWHFKEVAFGWLILIALS  151

Query  276  LTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
            + +    A +  V      A          +   ++
Sbjct  152  IAVGIAFAILAVVVAIPAAALGFGGWAMGGMTGAIV  187


>KKP95651.1 hypothetical protein US03_C0003G0051 [candidate division TM6 
bacterium GW2011_GWF2_36_131]KKQ03370.1 hypothetical protein 
US13_C0003G0051 [candidate division TM6 bacterium GW2011_GWE2_36_25]KKQ18620.1 
hypothetical protein US32_C0024G0004 [candidate 
division TM6 bacterium GW2011_GWA2_36_9]HBR70852.1 hypothetical 
protein [Candidatus Dependentiae bacterium]HCU00545.1 
hypothetical protein [Candidatus Dependentiae bacterium]
Length=279

 Score = 42.1 bits (95),  Expect = 0.41, Method: Composition-based stats.
 Identities = 23/197 (12%), Positives = 57/197 (29%), Gaps = 7/197 (4%)

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
             G+  +             +L   T L           +    +  LL            
Sbjct  51   FGVVGIYKTFEQWTQALGKVLNVYTVLALVAVLIFNYWIKVIFSAALLRRVMEILHRKDM  110

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLIL------LILVVGGGSLLLIIPGLLFCVWFFFC  235
                     + +   L       ++++       + +       L+    + + +   F 
Sbjct  111  TFSESFDFGKQLNKKLFFWSVVFIVVVAFILVSSIYIPQITKYYLVYFLLMGWFIGTLFV  170

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY-VGEAANL  294
              VL   N+   +A++K+ ++V  H   I    V ++   L + F+       +   A  
Sbjct  171  LPVLIKRNLSLGKAIKKAFVMVWKHIIEIISATVAMIFYMLLIGFIVGVTGLGLTWLACY  230

Query  295  AFSLLLTPFSFLYYYLI  311
              ++ L     + + L+
Sbjct  231  LLNVPLGIGLIMPFILV  247


>WP_090816307.1 DUF975 family protein [Oribacterium sp. KHPX15]SEA40547.1 Uncharacterized 
membrane protein [Oribacterium sp. KHPX15]
Length=441

 Score = 42.4 bits (96),  Expect = 0.41, Method: Composition-based stats.
 Identities = 10/98 (10%), Positives = 27/98 (28%), Gaps = 12/98 (12%)

Query  216  GGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
              S +      +  ++     +  AD   +G  +A+  S  L+    + +    +  +  
Sbjct  162  ITSYIASFIMYIVALFLSMASFASADHPEMGAWEAIGVSLHLMRKRKFKLMCLELSFIGW  221

Query  275  SLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIY  312
             +                     L + P+      + Y
Sbjct  222  FVIF-----------VITFGIAGLWIIPYFNTSLSIFY  248


>NLE36775.1 zinc ribbon domain-containing protein [Pirellulaceae bacterium]
Length=417

 Score = 42.4 bits (96),  Expect = 0.41, Method: Composition-based stats.
 Identities = 35/388 (9%), Positives = 76/388 (20%), Gaps = 45/388 (12%)

Query  3    TVRCPHCGAERNTPSS------------------------KLPAKKSSARCPECCQTLIF  38
             + CPHCG E   P                           +PA+ S     +       
Sbjct  29   ELTCPHCGTEVEVPDRDAASSTAAEPPAPPASKPAELDAPVVPAEPSPGPPEDEYAIRGM  88

Query  39   DPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGS  98
              A  + +    +    P   L     +                     +        G 
Sbjct  89   TYAIREPSPDEHSGPVVPVVDLPDEYEAPGRRPDRVWEEDEVKPLDERPKLPPRPMVDGV  148

Query  99   GLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWA  158
                    +   W +       +LGI                    A      +      
Sbjct  149  FRIFGQMSVILCWFVLSIMSSVVLGIA---YYAIVLGSVDRGEGAAAAPSWVGSVLLAGV  205

Query  159  ILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGS  218
              ++++  +++  +++   +                L L        L+  L +      
Sbjct  206  ATISSLLIMVVANAYLMAILNDSAAGNARVENWPDVLFLTWATDALFLIASLCVPFLVAL  265

Query  219  LLLIIPGLLFCVWFFFCQYVLA----------DDNIGGLQALEKSRLLVSGHWWAIFGRF  268
             L    G L    ++                  +    L  +    +    H    +G F
Sbjct  266  ALAYPLGGLLAGAWWIGPPCYWLLFPIALLSTLEMQSPLVPVSPVVVQSLWHCGRTWGVF  325

Query  269  VL-------LLVISLTLSFLTARIPYVG-EAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
             L        L+ +         + + G        +     +  L   + +   +    
Sbjct  326  YLETAVLGGTLLFAGFCFHAAGFMNFAGVAILGGLATFAAMLYFRLLGRVAWVCDERFRE  385

Query  321  GPQHPPIKRQWLPLTAAIFGWMLIPGLL  348
                                   I    
Sbjct  386  LRAENEAGEDDDEGEFGDTTPSEIRPTP  413


>MBI4764717.1 zinc-ribbon domain-containing protein [Deltaproteobacteria bacterium]
Length=206

 Score = 41.3 bits (93),  Expect = 0.42, Method: Composition-based stats.
 Identities = 12/109 (11%), Positives = 26/109 (24%), Gaps = 1/109 (1%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M  +RC  C    +    ++    S  RC  C      +P     ++      T      
Sbjct  1    MI-IRCEKCETTYHLDQKQIEPFGSKVRCSRCGHIFWAEPPSFSFSEDPGLQKTPGVFLP  59

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLAD  109
                    +     +        +  L       A    ++ +    + 
Sbjct  60   FPEEGKAAVRPIQSSRKIFWTLGTIFLFILIALTARFFYIQYLHPDWSM  108


>NLH15281.1 hypothetical protein [Phycisphaerae bacterium]
Length=390

 Score = 42.4 bits (96),  Expect = 0.42, Method: Composition-based stats.
 Identities = 19/217 (9%), Positives = 47/217 (22%), Gaps = 0/217 (0%)

Query  1    MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGL  60
            M   RC HC  +   P      +    RC +              + T  ++        
Sbjct  1    MIKFRCIHCNKKYGVPDEWAGKRIRCKRCGDSSLVPHPVIDLQVSSTTPTSLEPPQGEKS  60

Query  61   QRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWG  120
            Q+      +          +   S  +        +    R   +               
Sbjct  61   QKDKKGFGISSDMLDFPVGKAPSSEEMARVSSKADNPHLDRGSLRDSERIGLRAAAYRNK  120

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
                  +   L F      +    + ++          +  A    + +G         +
Sbjct  121  KKWFRGVVFGLVFTLAAGLVWTAISYFVLFDLFILVIGVGWAAGLGVAVGSRKPGFLTGL  180

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGG  217
                  +G   + K+         L+  ++   +   
Sbjct  181  LGVGIGLGGMMAAKVMKAQFLYAPLVRTVMNQEIAYW  217


>XP_030942390.1 uncharacterized protein LOC115967438 [Quercus lobata]
Length=300

 Score = 42.1 bits (95),  Expect = 0.42, Method: Composition-based stats.
 Identities = 21/152 (14%), Positives = 47/152 (31%), Gaps = 5/152 (3%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L  I+ +  ++      +  +      +        ++   A     L         M  
Sbjct  74   LPIIHQIVFLVFTLVPSTIAMFLTIASVYTSEPIPFFSFFRAIFYIFLHYFDTFFLVMAF  133

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
                  + +       L +  +  LLL++  +V+     L           +    +V  
Sbjct  134  VSLYRALLINTICLSILAYHTNNPLLLLVACIVLFLFIALQFFTEH-----WLLASFVSM  188

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
             + I G+ AL++SR L+ G    +      L+
Sbjct  189  FEGIHGMAALKRSRELIKGRTDILVVLLFFLV  220


>CAB1348301.1 unnamed protein product [Coregonus sp. 'balchen']
Length=977

 Score = 42.8 bits (97),  Expect = 0.42, Method: Composition-based stats.
 Identities = 18/196 (9%), Positives = 47/196 (24%), Gaps = 12/196 (6%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
                        F   F    +              +      +      + ++   +  
Sbjct  730  YYHFSSPYCTDPFIHCFLVPFIHCFLVPFIHCYLVPFIHCFLVLFIHCFLVPFIHCFLVP  789

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
            +I    V       +   H      +   L+  +    +  I   L+  +  +   ++  
Sbjct  790  FIHCYLVPFIHCFLVPFIHCFLVLFIHCYLVPFIYCFLVPFIHCFLVQFIHCYLVPFIHC  849

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL-TARIPYVGEAANLAFSLL  299
                  L       L++  H       F++  +    + F+    +P++     L     
Sbjct  850  -----FLVPFIHCFLVLFIHC------FLVPFIHCFLVLFIHCFLVPFIHCFLVLFIHCY  898

Query  300  LTPFSFLYYYLIYSDL  315
            L PF   +        
Sbjct  899  LVPFIHCFLVPFIHCF  914


>WP_146582994.1 hypothetical protein [Rhodopirellula pilleata]
Length=330

 Score = 42.1 bits (95),  Expect = 0.43, Method: Composition-based stats.
 Identities = 42/324 (13%), Positives = 93/324 (29%), Gaps = 28/324 (9%)

Query  1    MPTVRCPHCGAERN----TPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCP  56
            MP + CPHC A         S+      +  R     + +I   + +  ++         
Sbjct  1    MPRLGCPHCDALLETLAPLTSNAQVECGACDRSFLSNEAIIPAFSPAPFSRDLPAGFGRG  60

Query  57   HCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCR  116
                Q  + +  +                 +        S   +     +LA        
Sbjct  61   GKLDQVLLSAIEMLRSHVWALLVTSLFVNAVWFAVVGWPSNFLIGQWRFMLAGESTDLGS  120

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNP-----------QNQNWQWAILLATVA  165
                +   + +G+++A    ++ +++   +                  +       +   
Sbjct  121  FAALMTATFAVGMMIAPMSAYAWIVMARLSLHICRYGTPRPVSVAAAVSHWKVPFRSVWQ  180

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
              +L ++       I +    V +  S+        S TL+  L +  +  G L  +   
Sbjct  181  ISVLFVALGVMFATIVVGGLVVTIALSL---WTDPQSATLIGTLGMGTILIGGLFSMQWL  237

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            L   ++        ADD      A+  S  L   H        + L+ +   L+ L    
Sbjct  238  LWPSLFLI------ADDRANLTTAVRWSVRLAREHRK----LSLSLVTVYFLLATLGKLF  287

Query  286  PYVGEAANLAFSLLLTPFSFLYYY  309
             YVGE   +  +++     +L   
Sbjct  288  FYVGEVVTIPIAIIPMAIGYLKMT  311


>MBD3166946.1 hypothetical protein [bacterium]
Length=419

 Score = 42.4 bits (96),  Expect = 0.43, Method: Composition-based stats.
 Identities = 20/157 (13%), Positives = 50/157 (32%), Gaps = 11/157 (7%)

Query  153  QNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLIL  212
                   LLA +   ++ ++   G +   +         ++ +G   +    ++ +LL++
Sbjct  257  WPGLIWTLLAALITAVVMIALFKGHLKTTLENFIEKPLPTIGIGALGLIVTPIVSVLLMV  316

Query  213  VVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLL  272
            +V    L L++  L     +                 L        G      G      
Sbjct  317  LVIALPLGLMLLALYVSFLYLGWVLFAVIGGTWLWAVLR------KGDTNVWLG-----G  365

Query  273  VISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYY  309
            ++ + + +L A IP++G    +  ++          Y
Sbjct  366  LLGVFVLWLIALIPFLGALVCIFATIAGMGMLMTGVY  402


>RMD59649.1 hypothetical protein D6821_00790 [Candidatus Parcubacteria bacterium]
Length=315

 Score = 42.1 bits (95),  Expect = 0.45, Method: Composition-based stats.
 Identities = 19/206 (9%), Positives = 58/206 (28%), Gaps = 1/206 (0%)

Query  82   NRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALL  141
            +             +          ++ ++ +F      +  I L    L    +     
Sbjct  34   SWVLVAMVILWSYYNFDKFIQAKPPVSVAFVVFYLAIVAVTFIILFFNSLMLELVQQIES  93

Query  142  LKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVG  201
                +      +   + ++      +L  +     ++   I   +    R  +       
Sbjct  94   GTEVSLAKAWAEVINFNLITIFCLSLLWAVVIFILALLRAIFNRNSSGSRRNERFSWENS  153

Query  202  SFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHW  261
            +  L              L ++  L+  + F      +  +N+   +A+++   +++ H 
Sbjct  154  ARVLAGDDGQPFRWSDFGLDVLSELIGLLVF-LILPAITWENMKTKEAIKRGWRILTKHP  212

Query  262  WAIFGRFVLLLVISLTLSFLTARIPY  287
               F  + L   +   LS   A + +
Sbjct  213  TQFFTTYGLGFFLVFLLSVPVALVFW  238


>RLG18588.1 hypothetical protein DRN67_04035, partial [Candidatus Micrarchaeota 
archaeon]
Length=269

 Score = 41.7 bits (94),  Expect = 0.46, Method: Composition-based stats.
 Identities = 23/187 (12%), Positives = 49/187 (26%), Gaps = 15/187 (8%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            ++LL ++         +            +      +   +  I+  +      +     
Sbjct  84   LFLLAVLAPGYFEALTMFSALKAKGFKSVKWTVGKYVNYLLLGIVHFIYAFFSLLNRPYR  143

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL------------LLIIPGLLFCVW  231
                G + S+   +      T    ++        L              I+  +   + 
Sbjct  144  PIVAGFYASLFALIGVFIVLTGGTAIVPAWALTILLVYLILLVGLFGTAYIVLFVYNGIR  203

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
               C  +  +  +G   A+  S  L  G    +F   V   VISL   F    I  + + 
Sbjct  204  QIMCFPLFLESEMGKRAAIRASWELTRGRALDVF---VASFVISLFWGFFIGIINTILQM  260

Query  292  ANLAFSL  298
                 S 
Sbjct  261  FTRLAST  267


>WP_084417361.1 hypothetical protein [Mariniblastus fucicola]QEG23063.1 hypothetical 
protein MFFC18_29550 [Mariniblastus fucicola]
Length=239

 Score = 41.7 bits (94),  Expect = 0.46, Method: Composition-based stats.
 Identities = 26/221 (12%), Positives = 54/221 (24%), Gaps = 2/221 (1%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
                C  CGA    P   +  K    +C E C            ++  +      +    
Sbjct  3    IKFNCHKCGASLRVPEQHIGKKARCPKCDEKCVVPATSQGSESGSEGGEEFEDIGNVLEH  62

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                +      +          +       +  A  S +    Q  + S           
Sbjct  63   PATATPVGLPPAIGAPTAVPTAAPLPPGSSDPPAMASPVAPPFQSNSASPYASPIGKPQG  122

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW--MTGSMF  179
              I    +   +       +    T++           ++    +I +G        ++ 
Sbjct  123  NAISGSIMHPLYEARNMMKIWGWLTFIVGILYCLTIVGIIVAWVFIWMGWLVKGAAEAVT  182

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLL  220
            I I   D+   R     L        +  ++ L + G  LL
Sbjct  183  IGIETGDMAQLRLANERLGTFFKILGVGAIIWLALVGIYLL  223


>MBC8872297.1 hypothetical protein [Planctomycetes bacterium]
Length=553

 Score = 42.4 bits (96),  Expect = 0.46, Method: Composition-based stats.
 Identities = 33/338 (10%), Positives = 74/338 (22%), Gaps = 34/338 (10%)

Query  5    RCPHCGAERNTPSSKLPAKKSSAR-CPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            RCP C A     ++       + +                +  Q  +         L + 
Sbjct  77   RCPSCDAPLAQNAALCVECGYNLQSGKFVKGMGGMGKKSPRGPQKAEGYEGVAEELLSKA  136

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLG  123
              +      +           +              L  +      + E           
Sbjct  137  ERALDAAPMADRGEKESWFIPYLTAGALIAIGGVGFLMWMGFSYLINQEESDSSDLARFV  196

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            ++ L   L                +    QN    +       I              + 
Sbjct  197  LFWLACGLNLVGGILTFSAAIMIAIYAFKQN--DTVHGILSLLIGFYAVVYACLQRGRLD  254

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLIL-----------VVGGGSLLLIIPGLLFCVWF  232
            +        +   L  +  F   ++L  L                 LL +   LL    +
Sbjct  255  RELKMWGLGLFCSLSALCIFFFDVVLSGLAFSTESMGQVAFGVLILLLFVGGMLLMFAGW  314

Query  233  FFCQYVLADDNI-GGLQAL-----EKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI-  285
             F   V   D +  G+ A+       +  +   +   +  +  L     +  SF+ + + 
Sbjct  315  IFSLVVCFMDRVYHGILAIFFPVYGATYCIARRNEEPVPAKLWLGGAGCIISSFVLSFLG  374

Query  286  -------------PYVGEAANLAFSLLLTPFSFLYYYL  310
                           +G    +   + + P   +   L
Sbjct  375  NMLIIATRTDADWSVIGTVLGMFALMFIAPPIVVMLVL  412


>HAN63642.1 hypothetical protein [Rhizobiales bacterium]
Length=55

 Score = 37.8 bits (84),  Expect = 0.47, Method: Composition-based stats.
 Identities = 8/38 (21%), Positives = 13/38 (34%), Gaps = 1/38 (3%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
            + CP C       ++  P +    RC +C       P
Sbjct  4   LIICPICDTRYE-TAAVFPPEGRKVRCSKCTHVWQAMP  40


>MBI1913873.1 hypothetical protein [Planctomycetes bacterium]
Length=124

 Score = 39.8 bits (89),  Expect = 0.47, Method: Composition-based stats.
 Identities = 12/39 (31%), Positives = 14/39 (36%), Gaps = 3/39 (8%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFD  39
           M    CPHC A  + P  K+        CP C Q     
Sbjct  1   MMRFACPHCKATVSAPEEKV---GVRVYCPRCRQPFQVP  36


>QDU80323.1 hypothetical protein Pla110_20500 [Polystyrenella longa]
Length=417

 Score = 42.1 bits (95),  Expect = 0.47, Method: Composition-based stats.
 Identities = 29/320 (9%), Positives = 76/320 (24%), Gaps = 21/320 (7%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
            ++CP C A  +  ++    K S   C    +          R             G    
Sbjct  98   IQCPDCKASFSADAADYGKKTSCPLCSTSVEVPDPSGMSQLRKARQKKKPHSLLDGEIAP  157

Query  64   IPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASG-----SGLRSISQLLADSWELFCRRG  118
             P +  E     +                            +  ++  L     +     
Sbjct  158  APREHKEKLPTGIMLYIDGVLTFPFQPEVIFRLCLFAALIFILDVTVDLCVYALMTVPYA  217

Query  119  WGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSM  178
               LG+  + + +      +          +  +++ +  +      Y    L+     +
Sbjct  218  LRALGLSAIVMFIITGGHAAVTFQTIFEETSAGSKDVRGWLEFDKYEYAFRLLTIGWLFL  277

Query  179  FIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYV  238
               +      +     L   +  S    + L +L++  G LL +       +  F     
Sbjct  278  IATVAGFVPVIVIGFILEKVNF-SLPGQMPLAMLLMPIGFLLSLAAFPFVALSTFEQHSY  336

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSL  298
             +  +    ++L +                    +I   +++       V          
Sbjct  337  FSVFSPRIFKSLFREWW---------------SWIIVTFVTWAFFLPWVVMRMVGSIAYP  381

Query  299  LLTPFSFLYYYLIYSDLKAN  318
             +  F    Y+     + + 
Sbjct  382  FINLFILSPYFAFIFFVYSR  401


>PWA73344.1 hypothetical protein CTI12_AA261870 [Artemisia annua]
Length=319

 Score = 42.1 bits (95),  Expect = 0.48, Method: Composition-based stats.
 Identities = 18/181 (10%), Positives = 51/181 (28%), Gaps = 3/181 (2%)

Query  131  LAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLF  190
               A I  +         +      +         +    ++ +   +         GL 
Sbjct  85   CDIATITYSTHHCVLGDPSSLLTILKSLKCTFFPLFFTSFVAAVLVVLITLTFLVSYGLV  144

Query  191  RSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
              +   L  V S+          +   +++  +   +    +     V   ++  G  AL
Sbjct  145  LMLGQTLGFVNSYD---NYFWFSLVLHTVIFGVIVGVIITKWSLAYVVGLAESKWGFSAL  201

Query  251  EKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYL  310
            ++S  L          + +   ++S    F    +P      + + +  ++  SF+   +
Sbjct  202  KRSWHLFEDVSKRSLKKGLKYDILSGICIFCGGIVPAAVVYVSWSLNWTISDLSFILLTV  261

Query  311  I  311
             
Sbjct  262  F  262


>MBI2836781.1 zinc-ribbon domain-containing protein [Acidobacteria bacterium]
Length=587

 Score = 42.4 bits (96),  Expect = 0.48, Method: Composition-based stats.
 Identities = 12/31 (39%), Positives = 17/31 (55%), Gaps = 0/31 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECC  33
            + CP C A+   PSS++P + S   CP C 
Sbjct  10  LIACPSCHAKFKVPSSRIPPQGSRVACPRCK  40


>BAT06933.1 Os09g0129600, partial [Oryza sativa Japonica Group]
Length=157

 Score = 40.5 bits (91),  Expect = 0.49, Method: Composition-based stats.
 Identities = 17/134 (13%), Positives = 35/134 (26%), Gaps = 11/134 (8%)

Query  207  LILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIF  265
              +  L++    L+  I  + F V   F   V   +    G  A  ++  LV        
Sbjct  15   YYVPFLLLSLFVLVGFIFLVYFSVLCSFSVVVSVAEPWCHGAGAFGRAWRLVKEKKRRAV  74

Query  266  GRFVLLLVISLTLSFL----------TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDL  315
                 + V++  +S +          +     +           +  F        Y + 
Sbjct  75   LFVAAISVLAAIVSAVYKLSMAGARSSIVAGLLLGLVYAILMGAVELFGVCSLTAFYYEC  134

Query  316  KANYRGPQHPPIKR  329
            K +          R
Sbjct  135  KGSNEVVTTDQYVR  148


>WP_172304196.1 FHA domain-containing protein [Pseudenhygromyxa sp. WMMC2535]NVB39161.1 
FHA domain-containing protein [Pseudenhygromyxa sp. 
WMMC2535]
Length=610

 Score = 42.4 bits (96),  Expect = 0.49, Method: Composition-based stats.
 Identities = 28/191 (15%), Positives = 52/191 (27%), Gaps = 20/191 (10%)

Query  132  AFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT-GSMFIYICKTDVGLF  190
              A      L+       P   +    ++           +      +        V   
Sbjct  231  FAAVTIVTALVNLVFGWIPVVGSIFAGLMGLVTLIAGPISAAALGYFVIKQRMGQPVTAV  290

Query  191  RSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQAL  250
             + K   +      +   L   + G GS+ LI+PG+      FF   +   +N   L   
Sbjct  291  DAWKTAFKSPIPVWVNFALAGFIAGLGSIFLIVPGIALG---FFIVPMFFVENKRLLAIN  347

Query  251  EKSRLLVSGHWWAIFGRFVL-------LLVISLTLSFLT---------ARIPYVGEAANL  294
              S       W  +    ++       + ++   LSFL            I  + +A + 
Sbjct  348  LGSLEYFKRDWAKVLLICLVGLAGMIAVGIVGGLLSFLFGLFSDYLSAGIINLLTQAGSG  407

Query  295  AFSLLLTPFSF  305
                LL    F
Sbjct  408  VVMTLLAVVGF  418


>HBE07214.1 hypothetical protein [Firmicutes bacterium]
Length=278

 Score = 41.7 bits (94),  Expect = 0.49, Method: Composition-based stats.
 Identities = 28/177 (16%), Positives = 51/177 (29%), Gaps = 19/177 (11%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             I L  I L  A I  A       +    +    + +  A V   +   + +  S     
Sbjct  26   FILLFCIALIGAVISLAGSTYAFLFGEEISGFTIFLLGAADVFITVWINAALIFSSASLH  85

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLI------------------LLILVVGGGSLLLIIP  224
                  L    K+      S  +  +                  ++  V     L     
Sbjct  86   YGNQFSLAEMKKVTRSKYWSAFIATMKLALIILVPILLIYYLAIVIPAVGVLVLLPAFCF  145

Query  225  GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
                   + F  + +  +    + A + S  LV G +W I G  +L   I +T+S +
Sbjct  146  LFYLSFRYLFVVHSVILEEH-AIGAFKNSSRLVDGRFWKIVGGTLLFSAIFITISLI  201


>RPI63797.1 hypothetical protein EHM48_01870, partial [Planctomycetaceae 
bacterium]
Length=429

 Score = 42.1 bits (95),  Expect = 0.49, Method: Composition-based stats.
 Identities = 56/425 (13%), Positives = 110/425 (26%), Gaps = 15/425 (4%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
              V CP C  E     S       S RC             +    +   +       L 
Sbjct  7    VQVLCPQCKHEYEFKDSL---AGKSVRCKCGNVFTFPSQPPASHQPSVPGVCPSCGATLA  63

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                       +     +  + S          A+ +   + S        LF     G 
Sbjct  64   ADAVLCVNCGLNLRTGLQITSASIAPPLSAAGSATRNSRGTRSVPALTLIGLFRIPFTGD  123

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            L    L   L        LL+     L           L+   A +   L+ +   M  Y
Sbjct  124  LPAEGLMQGLYLTLWSVVLLVLLFLTLVVLAAGAPMFGLVLAAATLAAALAVLGWLMRRY  183

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL---LLIIPGLLFCVWFFFCQYV  238
            I   +      +          T+ ++L+I ++  G +   +   P L+  +        
Sbjct  184  IGVVEQYASDGLTAMTDLSIRETVGIVLVIALIIAGPIVAGMFFSPLLIPAMLLAGLYVP  243

Query  239  LAD------DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
            +A         +  L  +++   +  G+   I     L L   +    L   +  V    
Sbjct  244  MAVSLAAVERTLNPLTVVKRIVEMFPGYILVILYVVPLQLAAEIVGDLLGGLLMSVLPRV  303

Query  293  NLAFSLLLTPFSFLYYYLIYSDLKA---NYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLL  349
             +    L+      Y  +    +          Q  P++ Q+LP    +    ++  +LL
Sbjct  304  AIVAGWLIAVSLAQYGVVAQYAMMGGLLRLYRQQASPLRGQFLPTLKWVGTACVLGSVLL  363

Query  350  VSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKT  409
                         ++     QQR            R    +    + A ++   +     
Sbjct  364  TVRLSPAWEYINPMARELAQQQRTKQAALADEQARRDRQSQLTAAAKAQFEQEWNGFHPA  423

Query  410  TSEGG  414
              +  
Sbjct  424  AEQDP  428


>RLI90893.1 hypothetical protein DRO65_02145 [Candidatus Altiarchaeales archaeon]
Length=208

 Score = 41.3 bits (93),  Expect = 0.49, Method: Composition-based stats.
 Identities = 22/145 (15%), Positives = 47/145 (32%), Gaps = 11/145 (8%)

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGG  246
            + +F  + LG   V  F+  LI LI     G  +  +      +      Y +  +N+ G
Sbjct  47   IPIFFFVGLGFILVKVFSFNLIPLIGAAIIGIGIYTLYIFCLGLLLAPVNYAIVIENLDG  106

Query  247  LQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF-------LTARIPYVGEAANLA----  295
            +  +++       +         LL  I+  LS        + + IP +G          
Sbjct  107  ISGIKRGVEFFMRNKLVCILLISLLSGINTLLSIIGNSVASIFSVIPLIGPILTTLTYVG  166

Query  296  FSLLLTPFSFLYYYLIYSDLKANYR  320
            F ++          ++++       
Sbjct  167  FLIVAIFIMSGISTVMWTKTYIERG  191


>WP_052298606.1 hypothetical protein [Syntrophobotulus glycolicus]
Length=175

 Score = 40.9 bits (92),  Expect = 0.50, Method: Composition-based stats.
 Identities = 16/129 (12%), Positives = 40/129 (31%), Gaps = 10/129 (8%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +           G +   +   +V +   M +  ++   F L   +  ++V       
Sbjct  43   LFMLAGAFLKGGFLGCVLAGVNDHEVRIGTFMAMAKKYFLRFILQSSITFILVIAFVPFF  102

Query  222  IIP----------GLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLL  271
             IP           ++   +  F  Y++  +N   + A   S  LV  +   +    + +
Sbjct  103  FIPGPLVLFLFIGIIILFFYLMFWDYIIVVENAKLIDAARISCSLVQSNLRKVLSLVIPV  162

Query  272  LVISLTLSF  280
             +I+     
Sbjct  163  SIITALFGI  171


>MBC2734268.1 hypothetical protein [Desulfobacteraceae bacterium]MBC2750296.1 
hypothetical protein [Desulfobacteraceae bacterium]
Length=451

 Score = 42.1 bits (95),  Expect = 0.50, Method: Composition-based stats.
 Identities = 41/301 (14%), Positives = 83/301 (28%), Gaps = 13/301 (4%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            + R GW L  I    +  A   +            +    + +          +L+    
Sbjct  49   YFRMGWVLQLIAFGFLWAAILWLAWNFERHLLLGASAGESDPRLRQRYGFNFLLLVVGGV  108

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
            +   +  ++ +  +     + L          +  +       G ++ I    LF V F 
Sbjct  109  LLAFVIWHLLQWVLAAIVFLHLAASDNNPLPSVAYV------LGWIVAISVVGLFPVRFV  162

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLT---LSFLTARIPYVGE  290
                  A    G   A ++   L  G+   +F   V+L  +      L+ ++ R      
Sbjct  163  LLLPAAA---AGHDIAPKRIWRLTRGNGLRLFLVLVILPGVLAGYGQLALMSGRGFPGSA  219

Query  291  AANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLV  350
                  +              Y  L A    P         L    A   W  +  +L++
Sbjct  220  IVAGLLTAYGGMVYLAVLAWAYRSLSA-MPLPDLKQGAGARLLRANARRLWAALGIVLIL  278

Query  351  SLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLLLSKQRKTT  410
                   +A    + G+ +Q     +PQ+      +  + P    S         Q + T
Sbjct  279  FGGVAVYNAVYRTAPGETVQILRFGKPQRITTDPGTRVKIPFVEESRPLPDGFQAQYQGT  338

Query  411  S  411
             
Sbjct  339  Y  339


>OHD13769.1 hypothetical protein A2Y41_03625 [Spirochaetes bacterium GWB1_36_13]HCL55959.1 
hypothetical protein [Spirochaetia bacterium]
Length=262

 Score = 41.7 bits (94),  Expect = 0.50, Method: Composition-based stats.
 Identities = 26/225 (12%), Positives = 69/225 (31%), Gaps = 14/225 (6%)

Query  151  QNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILL  210
                 Q+ +    V  +++ ++      F         L   ++     + +   +  L 
Sbjct  15   FRMINQYRLWHYLVLPMVITITLAYILRFSLSAFGSDYLLEELRRQFPQLETIPFVSTLF  74

Query  211  ILV---------VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHW  261
             ++              +++I   ++  V + F    L ++    +   E    L+    
Sbjct  75   SIMKIFISIAFYFLFSFIMVIFYNIIGNVVWSFFIGSLQEEIEFKMSGKEIRYPLIRNIK  134

Query  262  WAIFGRF-----VLLLVISLTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLK  316
            W +   +     +LL++I+   S +   IP++G    L   + L  +         S   
Sbjct  135  WILISIWDSVKDILLILIAYLFSLILFFIPFLGALIQLVLIIFLNSYIMGRVIFRMSLEN  194

Query  317  ANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAEQ  361
             ++   +   I R+     A   G++ +       +         
Sbjct  195  HSHTLSERNKILRRHFKGNALGIGFLAVISSYFPLIGIFLFFVFW  239


>HAS54469.1 hypothetical protein [Nitrospiraceae bacterium]
Length=219

 Score = 41.3 bits (93),  Expect = 0.50, Method: Composition-based stats.
 Identities = 9/67 (13%), Positives = 18/67 (27%), Gaps = 0/67 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
           VRC +C      P  K+       +C  C + +       +  + +  +   P       
Sbjct  3   VRCWNCNKLFRVPDEKIGEAGVQFKCTSCSEIVKITRENFEEYKQSHAVPPPPPPPAPEP  62

Query  64  IPSDRLE  70
                  
Sbjct  63  PRQTPRP  69


>OAD55042.1 putative Dol-P-Man:Man(7)GlcNAc(2)-PP-Dol alpha-1,6-mannosyltransferase 
[Eufriesea mexicana]
Length=2353

 Score = 42.4 bits (96),  Expect = 0.52, Method: Composition-based stats.
 Identities = 61/556 (11%), Positives = 129/556 (23%), Gaps = 47/556 (8%)

Query  4     VRCPHCGAERN--TPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
             ++CPHCG + N       +P               I   A   R + T  +       L 
Sbjct  1381  IQCPHCGRKFNKAAAERHIPKC-----EHMLHNKPIHSRAPKPRLRATLGLLIIGALKLY  1435

Query  62    RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
             R        +Q            +                 +  L    W       +  
Sbjct  1436  RDALQSIFGLQFTKWFVAITVTQYHFMYYLSRPLPNIMALPLVLLALHGWLKQNHMIFIW  1495

Query  122   LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
                  + +  A   +   L L              + I +    + LL    +    +  
Sbjct  1496  SSAAAIIMFRAELAMLLGLFLLYDIANKKLTIPRLFKIAVPAGIFFLLLTVTIDSIFWRR  1555

Query  182   I-------------------CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
             +                     T   L+       R +    LL+ L +L       L +
Sbjct  1556  LLWPEGEVFYFNTILNRSSEWGTSPFLWYFYSALPRGLALSYLLIPLGMLWDARVRALTV  1615

Query  223   IPGLLFCVW-------FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVIS  275
                +   ++         F  YV    N+       +     +   W  F   ++L  + 
Sbjct  1616  PGIVFIGLFSFLPHKELRFIIYVFPLLNVSAAAVCHRIWENRAKSPWNGFLALIILSHLV  1675

Query  276   LTLSFLTARIPYVGEAANLAFSL---------LLTPFSFLYYYLIYSDLKANYRGPQHPP  326
                 F    +   G       ++          L P       L      + +    +  
Sbjct  1676  FNALFSMFLLCVAGSNYPGGLAIAKLHRLEKDSLYPVHVHIDILTAQTGVSRFTQTNNSW  1735

Query  327   IKRQWLPLTAAIFGWMLIPGLLL-----VSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTP  381
             I  +   LT      +    LL+      S + +       +    D    +       P
Sbjct  1736  IYSKQENLTIDSPEMLQFTHLLMEAKSKYSPNIKPYLKTHDILDSVDGFSHIALNYNILP  1795

Query  382   DLNRSLPEEPQRLSSADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLE  441
              +          +          K+    +    S                +    ++  
Sbjct  1796  PIKIKTRPIIFIMKRKPNIQYDPKKATLQTYLKQSNESFIEKRIERRHIINDTVKHMEPI  1855

Query  442   LSDFPNLSLAQKGSARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIR  501
             L +        +    I    V D + +D    ++           ++  +  ++ +   
Sbjct  1856  LDEIMESLEQFEKPLNINYVNVFDTEVQDTIKLENITYDSTEFVESLSIEEIPNILAKNN  1915

Query  502   SIYLRQGTQAEQVHSI  517
               +L   T  E++  I
Sbjct  1916  LSHLETSTNEEKLEII  1931


>PKO19890.1 hypothetical protein CVU38_21450 [Chloroflexi bacterium HGW-Chloroflexi-1]
Length=244

 Score = 41.3 bits (93),  Expect = 0.53, Method: Composition-based stats.
 Identities = 27/198 (14%), Positives = 58/198 (29%), Gaps = 2/198 (1%)

Query  112  ELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGL  171
               C      + + ++  +   A   S   ++         + ++           LL L
Sbjct  1    MCCCILFAVAIVMTIVQYMARAALYRSVDQIEETGVAPTGREGFRLGWTNRAFRLFLLEL  60

Query  172  SWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVW  231
                  +   +    V     + L  +      + +   I +     LLLI+  ++  V 
Sbjct  61   MVGIAVLLAALLLATVAAAPLLLLLTKSGVLKGIGIGSTIWLGLVWILLLIVAAVIQSVL  120

Query  232  FFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
              F    +   +     AL     LV G    + G ++L+  I      +   IP V   
Sbjct  121  GQFWSREIILADRSIGDALASGYHLVRGRLKDVGGMWLLMFAIGFGFGLV--MIPVVLVV  178

Query  292  ANLAFSLLLTPFSFLYYY  309
               A +  +     ++  
Sbjct  179  IVGAGAAGVAMGFAIHAA  196


>RIO89492.1 zinc ribbon domain-containing protein [Staphylococcus haemolyticus]
Length=295

 Score = 41.7 bits (94),  Expect = 0.53, Method: Composition-based stats.
 Identities = 20/286 (7%), Positives = 61/286 (21%), Gaps = 12/286 (4%)

Query  5    RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRI  64
            +CP+CG           +        +          ES  ++      +          
Sbjct  2    QCPNCGQTYQPGDQYCGSCGKKL--NDSTLQSTQSTTESVNSEIQSATHSKTVKQEGFNE  59

Query  65   PSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGI  124
                   +      R  +               +           ++        G    
Sbjct  60   TQRDSAYEHGEFAHRYMSYEHRYAEGPFTLKVKTTFNESKSFFKQAFTAHDAVIKGEHSF  119

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICK  184
                +          L +    +       +        + ++L  L  +   + +    
Sbjct  120  SHTLLASLVVIGLLILGMFLHIFSASLFDGYFVDTATIILKFVLTILLALVFLLVVTFGV  179

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
              + +   +           +  + + ++  G  +  +   +   +   F   +L     
Sbjct  180  IRLMIVERILFKKVLSDYILINTLSVSVLFLGLVVFFLEFYIFSGILIAFSTILL-----  234

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
                 +  S  L+S +      R      I +    +   +   G 
Sbjct  235  -----ITSSIYLISKYSVNHTLRIASFYGIMIFFVIIAFAMHLFGS  275


>OIW10275.1 hypothetical protein TanjilG_28026 [Lupinus angustifolius]
Length=230

 Score = 41.3 bits (93),  Expect = 0.53, Method: Composition-based stats.
 Identities = 11/81 (14%), Positives = 24/81 (30%), Gaps = 0/81 (0%)

Query  207  LILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFG  266
              L         L ++       V +     V+  +   GL+AL +S  L+ G       
Sbjct  81   YFLGFSTAVLVVLPMLFVMFYLQVRWILVPVVVVLEPCWGLEALRRSASLIKGMKKVALI  140

Query  267  RFVLLLVISLTLSFLTARIPY  287
              +   ++     +    +  
Sbjct  141  LLLFFGLVEGLYLWCIPVLNI  161


>VVA18096.1 PREDICTED: LOC100276777 [Prunus dulcis]
Length=173

 Score = 40.5 bits (91),  Expect = 0.54, Method: Composition-based stats.
 Identities = 18/97 (19%), Positives = 35/97 (36%), Gaps = 3/97 (3%)

Query  227  LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR--  284
               V +     V   + I G++AL K+  L+ G     F   +L   +SL++        
Sbjct  52   YLAVVWNLALVVSILEEICGIEALGKAEQLIKGSKRRGFSLNILFGALSLSVFCGVLMSE  111

Query  285  -IPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYR  320
                +      +   L+  F    Y ++Y + K  + 
Sbjct  112  NAAMIPLLLLNSIYCLIEMFKLTTYTVLYHESKETHG  148


>MBE0634690.1 hypothetical protein [Candidatus Bipolaricaulota bacterium]
Length=245

 Score = 41.3 bits (93),  Expect = 0.54, Method: Composition-based stats.
 Identities = 16/176 (9%), Positives = 49/176 (28%), Gaps = 1/176 (1%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            L+ +  L      A           +    ++  ++  ++ A    ++ G+   T +   
Sbjct  15   LILLAPLFSAAQIALYGHFRGDSIGSSRVVESFRFRNWVVSALTLLLIAGIFAATLAFV-  73

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             + +  V +  +            +      + +   + L ++P  +  +          
Sbjct  74   LLPQLGVSMIVASFALGALYPLIGVSPESSFIGIALTASLFMVPTSMITLGCLLAPLHAT  133

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAF  296
             D +G + AL +S     G    +    +   +  +           V +      
Sbjct  134  KDGVGPIVALRRSWNTTYGSKRRLMQVALPFYLAVIGFVLTGYLPSSVAQPILSIL  189


>WP_109274543.1 hypothetical protein [Brachybacterium endophyticum]PWH07656.1 
hypothetical protein DEO23_03260 [Brachybacterium endophyticum]
Length=302

 Score = 41.7 bits (94),  Expect = 0.54, Method: Composition-based stats.
 Identities = 33/276 (12%), Positives = 66/276 (24%), Gaps = 5/276 (2%)

Query  16   PSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRIPSDRLEIQSKT  75
            P +                            +      T                     
Sbjct  9    PDNGSGPSPWQQGPTIPTSPADAPQQPYGEQRPYGAQQTFGEQQAVGGQQPFGAPPFDGA  68

Query  76   VNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAP  135
                    +      R    + S   +   + +  W +      GL       +  +   
Sbjct  69   PAPGTDLGADLGAALRWMWKAFSRNVAAFLVPSIVWSVVSFVIIGLFVGIGFAVFYSAVE  128

Query  136  IFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL  195
              +     P            +A          L  S         +      + R+   
Sbjct  129  GAAGSDETPPLGPILGAYAIMFASAPVAGLVGALWQSGTARGGRTVLEGERPSIGRAFIG  188

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRL  255
              R V +  L+L+L+ + +    +    PGL+  V   F     A      ++AL++S  
Sbjct  189  SGRLVLTALLVLVLVGVGMVLLYI----PGLIVAVM-SFYALPAAARGARPVEALKESFA  243

Query  256  LVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA  291
            L   +       +++L+  S   SFL   I  +   
Sbjct  244  LAKQNLGTTIIAYLILMAASSVASFLLVGIFVLIPL  279


>HIF59940.1 hypothetical protein [Rhodospirillales bacterium]
Length=110

 Score = 39.4 bits (88),  Expect = 0.55, Method: Composition-based stats.
 Identities = 10/75 (13%), Positives = 17/75 (23%), Gaps = 0/75 (0%)

Query  4   VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRR  63
           + CP C    +   S +       RC  C  +    P   Q       +   P       
Sbjct  3   ISCPSCSTSFSVSGSDIGTGGQMVRCFNCSHSWHQYPLPPQAQAQYVPVQYVPPGQFSAP  62

Query  64  IPSDRLEIQSKTVNC  78
                + +       
Sbjct  63  PQVSNVGVSQPQYMP  77


>WP_125026058.1 hypothetical protein [Chryseobacterium carnis]AZI34427.1 hypothetical 
protein EIB73_03700 [Chryseobacterium carnis]
Length=283

 Score = 41.7 bits (94),  Expect = 0.56, Method: Composition-based stats.
 Identities = 33/232 (14%), Positives = 64/232 (28%), Gaps = 10/232 (4%)

Query  113  LFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLS  172
            +F          Y L        + S        +       +        V  +   + 
Sbjct  23   IFIYLREKSFKFYSLYNFFLLLYLMSRNDDYYNLFEGAVAYLFGAQQADVFVRILNFFIQ  82

Query  173  WMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
             +  + +       + L + +K     V     +L LL L  G    L+ IP     ++ 
Sbjct  83   IVFYNFYSIFALYFLDLDKHIKKYFNRVVLILKILGLLFLGFGIICYLMQIPDFYISLYT  142

Query  233  FFCQYVL-ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV---  288
            F    V+     +  L+A+  S     G     F   V   V+   +SF    IP +   
Sbjct  143  FLYLPVMLIIFILSVLKAIRYS-----GKHKNFFLVGVCFYVMCALISFAGTFIPSLNMN  197

Query  289  -GEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIF  339
               +      ++ T F  L        +        +  I+ +       + 
Sbjct  198  NPISFFYVGIIIETIFFSLGLAYKIKLINDEKNRVHNLVIQHRHQQQIGKMQ  249


>WP_152622641.1 zinc-ribbon domain-containing protein [Archangium violaceum]
Length=274

 Score = 41.7 bits (94),  Expect = 0.56, Method: Composition-based stats.
 Identities = 11/66 (17%), Positives = 14/66 (21%), Gaps = 0/66 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQR  62
            + CP C  +       LP      +C  C    I  P  S                  R
Sbjct  2   EIACPQCSMQYALDPRLLPPGGVPVQCTRCSHVFIATPPASAAAPAPQATQAFGAVPQPR  61

Query  63  RIPSDR  68
                 
Sbjct  62  PATQPN  67


>NLP29229.1 hypothetical protein [Clostridia bacterium]HCW05157.1 hypothetical 
protein [Clostridium sp.]
Length=199

 Score = 40.9 bits (92),  Expect = 0.57, Method: Composition-based stats.
 Identities = 25/132 (19%), Positives = 44/132 (33%), Gaps = 4/132 (3%)

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
            +      L  S    L H+G      I+         +L I       +    C      
Sbjct  1    MDNIRFSLTESFFYMLLHIGGVIATSIVWATTALISLMLTIARTFSVGII-EVCGGEFFK  59

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL-SFLTARIPYVGEAANLAFSL--  298
            + +   QA+ +S  LV  ++W +F   +L  +  + L + L + +  V     L   L  
Sbjct  60   NKVDTSQAIRRSWELVKHNYWKVFASMILFFLSVMALRTSLESLVSLVTSILYLVLRLLG  119

Query  299  LLTPFSFLYYYL  310
               PF     Y+
Sbjct  120  GRIPFMEFVLYI  131


>OJU53307.1 hypothetical protein BGN96_09595 [Bacteroidales bacterium 45-6]
Length=305

 Score = 41.7 bits (94),  Expect = 0.58, Method: Composition-based stats.
 Identities = 21/138 (15%), Positives = 49/138 (36%), Gaps = 2/138 (1%)

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
            +  S+        + L  S+ + L       ++  +L+  V G  +L      L   +  
Sbjct  119  ILKSIGHIAGDVLLFLVISLFIFLPLYIVLAIVSKVLMPFVVGYFILAFGFCFLLTWFNL  178

Query  234  FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAAN  293
                 + D   G  ++L  +  ++  ++  I G  V+L +I   +  L   +P     A+
Sbjct  179  SLIVYVRDYYSGYFESLGSAWKMIIQNFKHIIGANVVLFLILYIVQTLVTFVP--SMIAS  236

Query  294  LAFSLLLTPFSFLYYYLI  311
            L +           +++ 
Sbjct  237  LLYLSSGRHLYSNGFFVF  254


>OQY57355.1 hypothetical protein B6247_00535 [Beggiatoa sp. 4572_84]
Length=218

 Score = 40.9 bits (92),  Expect = 0.58, Method: Composition-based stats.
 Identities = 21/103 (20%), Positives = 48/103 (47%), Gaps = 2/103 (2%)

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVG--GGSLLLIIPGLLFCVWFFFCQYVLADD  242
                L  ++   L ++  + +L++LL   +     ++L      +  V   F   ++  D
Sbjct  54   QPAVLIAAVMDSLIYLFLYVVLMLLLQAAIFYRLSAILTQSDMGILAVSLRFFIPLILFD  113

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            N   L++L++S  LV G+WW       + L+I +++  +++ I
Sbjct  114  NATVLESLQRSHQLVWGNWWHTAIVLTIPLLIIISVGVMSSAI  156


>NIQ96043.1 NINE protein [Desulfuromonadales bacterium]NIS42136.1 NINE protein 
[Desulfuromonadales bacterium]
Length=121

 Score = 39.4 bits (88),  Expect = 0.58, Method: Composition-based stats.
 Identities = 12/35 (34%), Positives = 15/35 (43%), Gaps = 0/35 (0%)

Query  2   PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
             V+CPHC   +     K+ A      CPEC Q  
Sbjct  4   VQVQCPHCSFSKAVDREKVGAGNKKVTCPECSQHF  38


>NTU80423.1 hypothetical protein [Chloroflexales bacterium]
Length=332

 Score = 41.7 bits (94),  Expect = 0.59, Method: Composition-based stats.
 Identities = 16/137 (12%), Positives = 38/137 (28%), Gaps = 1/137 (1%)

Query  156  QWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVG  215
               I L  +   L  +  +              L      G        +  +L ++   
Sbjct  131  MGPIFLLNLILALPVIIIVGIVAATVAASVVGALSSLGPEGNPANPGAFIASLLGLIFCV  190

Query  216  GGSLLLIIPGLLFCVWFF-FCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
             G +LL+         +    Q     + +G + +L +   L+  +       ++   V+
Sbjct  191  LGIILLLALVAGLLGIWSRVAQRACVIEILGPVASLGRGWQLIRRNLGLTLLTWLFQGVL  250

Query  275  SLTLSFLTARIPYVGEA  291
               + F+ A        
Sbjct  251  GWIIGFILAIPALAIAV  267


>WP_169527880.1 hypothetical protein [Flavobacterium sp. SE-s28]NMH28767.1 hypothetical 
protein [Flavobacterium sp. SE-s28]
Length=280

 Score = 41.3 bits (93),  Expect = 0.59, Method: Composition-based stats.
 Identities = 23/138 (17%), Positives = 47/138 (34%), Gaps = 3/138 (2%)

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVL  239
                         +   +  + +  +   + +L   G S + I    LF  +      ++
Sbjct  124  FIHYGRRTFGNVLLLGVVVSLITTCVSTGVELLHFPGFSTVNIAFSFLFGCFSLLAIPLV  183

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISL---TLSFLTARIPYVGEAANLAF  296
                 G L+AL  S  L+S  +  IF  +VL + +++      F+         +     
Sbjct  184  IFGEFGPLEALSASFKLISKQFSTIFSLYVLAIFVAISGLLPMFIGIATALPALSIFSIG  243

Query  297  SLLLTPFSFLYYYLIYSD  314
             +   PF F + Y +Y  
Sbjct  244  FIFTLPFLFAFVYTLYKQ  261


>WP_005398774.1 hypothetical protein [Helcococcus kunzii]EHR33215.1 hypothetical 
protein HMPREF9709_01259 [Helcococcus kunzii ATCC 51366]
Length=217

 Score = 40.9 bits (92),  Expect = 0.60, Method: Composition-based stats.
 Identities = 28/195 (14%), Positives = 53/195 (27%), Gaps = 25/195 (13%)

Query  136  IFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKL  195
                LL+    +   +  ++    LL  +  ++                        ++ 
Sbjct  18   FIYYLLVAVFLYGLNEFFSYFNLDLLYMLFILVFLPFINLEFYHSIKENRKPKFINMVRY  77

Query  196  GLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ-YVLADDNIGGLQALEKSR  254
                     L   L    +    LL  IPG+   + +     +V  DD IG   AL+KS 
Sbjct  78   KNSRPLVMNL---LESFYLLFWYLLFFIPGMYKTLSYTLAIKFVSEDDEIGYNDALKKSD  134

Query  255  LLVSGHWWAIFGRFVLLLVISLTLSFLTARIP---------------------YVGEAAN  293
              + G    +F   + + +    + F+                           V    +
Sbjct  135  EEMKGLKSPLFIAQLCVFLPLFFILFMLTFPNEMRILNGESNLADIRMVNVVQSVIIVLS  194

Query  294  LAFSLLLTPFSFLYY  308
               SL   P  +  Y
Sbjct  195  SIISLAFIPVFYAKY  209


>AVV84616.1 glycerophosphoryl diester phosphodiesterase [Shewanella putrefaciens]
Length=459

 Score = 42.1 bits (95),  Expect = 0.60, Method: Composition-based stats.
 Identities = 19/189 (10%), Positives = 46/189 (24%), Gaps = 9/189 (5%)

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
                      L  +  L    +      A +   A              L+     + + 
Sbjct  64   MAFAMSPYGLLFMVITLFSSFSVFFTQRAGVTILAASGLYDIPIGPVRALIFAAGRLPIF  123

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLI---------LVVGGGSLLL  221
            +   +      +  +   +          +    +   L             +   +L+ 
Sbjct  124  IRLGSTYAACLLLLSVPFIGCGYLAFSLFLQGHDINYYLYYKPIEWYFALTSILVLALIY  183

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFL  281
             +  L       F   ++  + I  L AL++S  L +G+   I        +    L + 
Sbjct  184  ALIALYLWARTAFSLTIIVSETISVLNALKRSWQLSAGNERRIMWHAAGWWLRIGALVYP  243

Query  282  TARIPYVGE  290
               +   G 
Sbjct  244  LILLISWGA  252


>OGF20315.1 hypothetical protein A2Y83_03000 [Candidatus Falkowbacteria bacterium 
RBG_13_39_14]
Length=263

 Score = 41.3 bits (93),  Expect = 0.60, Method: Composition-based stats.
 Identities = 47/219 (21%), Positives = 88/219 (40%), Gaps = 13/219 (6%)

Query  116  RRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMT  175
              G  L+ +  + I+     +F                N  + I+L    + ++ +S  T
Sbjct  39   FFGMILIELASIIIMAVPVVMFVIFNYTFRGKKIIPILNIIFGIILLLALFAVIYISIAT  98

Query  176  GSMFIYICKTDVG---LFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF  232
             +    I K       +    K   ++V  F ++ +++ +VV   SLLLIIPG++F V++
Sbjct  99   KAGMYLILKKFPPQEKVKEIFKEARKYVWKFFVVGVIVFVVVLLWSLLLIIPGIIFSVYY  158

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVG---  289
             F  + L  +      AL++S+ LV G+WWA+ GRF +  +I      + +         
Sbjct  159  TFSGWALIFEGHENTNALKRSKELVKGYWWAVVGRFFIFGIIIGIFFLILSAPLLFLSKG  218

Query  290  ----EAANLA---FSLLLTPFSFLYYYLIYSDLKANYRG  321
                             ++P   +Y YLI+ +L      
Sbjct  219  TAPYALWQFIKQPAQFFISPIVVIYSYLIFKELVKIKGP  257


>XP_022895190.1 uncharacterized protein LOC111409367 [Olea europaea var. sylvestris]
Length=418

 Score = 41.7 bits (94),  Expect = 0.61, Method: Composition-based stats.
 Identities = 22/211 (10%), Positives = 54/211 (26%), Gaps = 10/211 (5%)

Query  80   RCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSA  139
                          +        +S L +  W  +           L+  +L  + +   
Sbjct  57   YWRIRRTEHQLHWTKTGTPKYERLSDLASSEWANYMIFRAVYFTFLLIFSLLCTSAVVYI  116

Query  140  LLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRH  199
            +            +       +     +    ++     +  I            L L  
Sbjct  117  VACIYTAREITFKKVMSVVPKVWKRLMVTFLCTFFAFFAYNII----------FALTLIL  166

Query  200  VGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSG  259
             G          +V     ++  I  +   + +     V   ++  G +A+ KS+ L+ G
Sbjct  167  WGDTIADSTAGAVVFVIILIVYFIGFVYMTIIWQLASVVTVLEDSYGFKAMIKSKALIKG  226

Query  260  HWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
              +     F+ L      ++F+       G 
Sbjct  227  KMFITIIIFLKLNFTLAFINFVFKAYVVYGW  257


>AKF17187.1 membrane protein [uncultured bacterium Csd4]
Length=287

 Score = 41.3 bits (93),  Expect = 0.61, Method: Composition-based stats.
 Identities = 17/181 (9%), Positives = 46/181 (25%), Gaps = 7/181 (4%)

Query  120  GLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMF  179
             L     L  +                    +  +         +   L G   ++ ++ 
Sbjct  40   FLPSALGLATMTTINRNGMYYEDDVWISSLFKVVDAYSPFSYLLLFLGLWGAHLLSYTLL  99

Query  180  IYICKTDVGLFRSMKLGLRHVGSFTLLLILL-------ILVVGGGSLLLIIPGLLFCVWF  232
                   +   R      R      L+   +          +    +L  +  L+  V  
Sbjct  100  KASQDGRLPGKRLQFAEARKYMRPMLVKTFVVSLFVALTYFLLSLHVLFWLLFLVVGVPL  159

Query  233  FFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
                 +   ++ G + A  K   L    W+ +   F+ ++++   ++F       +    
Sbjct  160  MLLTPIWIIEDDGFIDAFNKCFRLGYVSWFQMVVIFLFMMMLGFLMTFSVFLFWSLSTMF  219

Query  293  N  293
             
Sbjct  220  M  220


>MBI2932895.1 DUF4013 domain-containing protein [Planctomycetes bacterium]
Length=401

 Score = 41.7 bits (94),  Expect = 0.61, Method: Composition-based stats.
 Identities = 14/171 (8%), Positives = 42/171 (25%), Gaps = 7/171 (4%)

Query  154  NWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILV  213
                 +            + +     I             +          ++ I L  +
Sbjct  210  MVFDLLKFTLAHVASFLPAALVIIGMILWLVDVAAASSYGEGEAALGRMAIIVWICLPPI  269

Query  214  VGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
            +  G + L    +       F   +   +    ++++        G +      +++L V
Sbjct  270  ILLGLMGLCYLPMALLANCIFGFPLSCFNPAFIVRSIWA----TRGDYLICLLAYLVLYV  325

Query  274  ISLTLSFLTA---RIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRG  321
            +   +SF+      + +        F++  T          Y   +     
Sbjct  326  VDTVVSFIAGLNDLVFFASVIPASIFTMYATVVQMRLLGQFYRYNQGRLGW  376


>MBI5143244.1 zinc-ribbon domain-containing protein [Nitrospirae bacterium]
Length=164

 Score = 40.1 bits (90),  Expect = 0.61, Method: Composition-based stats.
 Identities = 8/36 (22%), Positives = 14/36 (39%), Gaps = 1/36 (3%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
           M  V CP C  +      ++    +  +CP+C    
Sbjct  1   MV-VLCPKCKTKLKLDDDRVAESGTKFKCPKCTTAF  35


>MBI3407269.1 hypothetical protein [Planctomycetes bacterium]
Length=121

 Score = 39.4 bits (88),  Expect = 0.62, Method: Composition-based stats.
 Identities = 8/40 (20%), Positives = 14/40 (35%), Gaps = 3/40 (8%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDP  40
           M    CP C  + +  +S       +A C  C + +    
Sbjct  1   MIRFPCPKCKTQFSVDNSF---AGGTADCTWCGEKMTIPQ  37


>SFB38995.1 hypothetical protein SAMN03159300_10480 [Janthinobacterium sp. 
344]
Length=842

 Score = 42.1 bits (95),  Expect = 0.63, Method: Composition-based stats.
 Identities = 30/322 (9%), Positives = 57/322 (18%), Gaps = 26/322 (8%)

Query  6    CPHCGAE-------RNTPSSKLPA--------KKSSARCPECCQTLIFDPAESQRTQTTD  50
            CP C A            ++                  C   C       +         
Sbjct  36   CPVCSAPGSGSLAFLFVRAAACWCFGAFSCRRGGRRGACCVGCAGSALAFSLGFSAVGCF  95

Query  51   NIATCPHCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADS  110
                           +  L +     +               +   G   R  +  +   
Sbjct  96   VRFGAVCAVFFLWWLACWLLVFLVLSSGWLGGMPLWAAWLPCWWLCGVSARGWAWGVLGG  155

Query  111  WELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLG  170
               +   G G  G +  G                +                  VA+  L 
Sbjct  156  ARCWWGGGMGGRGRWPGGGFSGRWAWRWGSSCPSSFSCWCVQPLPFGCCSGPGVAFASLV  215

Query  171  LSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCV  230
                   +    C    G                 +       V G    +         
Sbjct  216  AGCWLSLVCCLGCVLGAGWLFLRSSAGGRSWRCRWVAC--CGRVAGWCCWVAFSLWFRSF  273

Query  231  WFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
            +  F    L    + GL +L +                ++   +     F        G 
Sbjct  274  FSPFLLCSLCAGGLVGLASLGRFFWWFQ---------CLVCCSLPCFFGFGACLPVLAGA  324

Query  291  AANLAFSLLLTPFSFLYYYLIY  312
                  S   + FS  ++   +
Sbjct  325  VVVCFGSFWPSRFSLWFWGRCW  346


>OPX25475.1 hypothetical protein B1H05_03755 [Candidatus Cloacimonas sp. 
4484_140]
Length=365

 Score = 41.7 bits (94),  Expect = 0.64, Method: Composition-based stats.
 Identities = 16/168 (10%), Positives = 53/168 (32%), Gaps = 7/168 (4%)

Query  146  TWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTL  205
                 +  N  +      ++ +   L  +     I+           ++  +    +   
Sbjct  11   FKDIFKAWNHSFRFHRLMISGVAYILMGIIAYFLIFGTFIKPMAMFGIRDSILIAQNLVS  70

Query  206  LLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIF  265
            +  ++ +++    + +    + F +        +     G     +     +    + + 
Sbjct  71   VKSIIGIILIAFIMFIARMCIAFSIRTEVVDDEIFV---GWKDVFKFLVKNIKTFVFYVL  127

Query  266  GRFVLLLVISL--TLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLI  311
            G  +L ++I+L     ++   IP+VG        L   PF+   + ++
Sbjct  128  GWILLFVLIALGYLFFYVIGMIPFVGSLWLSIIYL--IPFAISIFAVL  173


>WP_088109248.1 DUF975 family protein [Tyzzerella sp. An114]OUQ57968.1 hypothetical 
protein B5E58_08430 [Tyzzerella sp. An114]
Length=234

 Score = 40.9 bits (92),  Expect = 0.65, Method: Composition-based stats.
 Identities = 34/207 (16%), Positives = 68/207 (33%), Gaps = 11/207 (5%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
            +         I +L + L    + S +         P        I+ +     +     
Sbjct  34   WIYSAIVFFIIKILSLSLLTDILNSFVFKYIIILNIPLRIQIISLIISSISLLFISFPFE  93

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
            +    +    K +   F+ +  G +      +L I++ L+   G LLL++PG++    F 
Sbjct  94   VGILNYCINIKNNKSDFKDIFYGFKIYSKTLILGIIIGLLSAIGLLLLVVPGMIIIYGFS  153

Query  234  FCQYVL-ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA  292
               ++L  D  I    AL+ S  +  G    IF   +  +   +  S             
Sbjct  154  MTPFILLKDTEISSTDALKLSWDMTRGKKMDIFLFELSFIGWRILSSI----------IT  203

Query  293  NLAFSLLLTPFSFLYYYLIYSDLKANY  319
                 + L P+ +    L + D+   Y
Sbjct  204  LSLGDIFLNPYFYTAKSLYFEDIYNEY  230


>NLH07244.1 hypothetical protein [Chloroflexi bacterium]
Length=311

 Score = 41.3 bits (93),  Expect = 0.65, Method: Composition-based stats.
 Identities = 23/220 (10%), Positives = 59/220 (27%), Gaps = 7/220 (3%)

Query  117  RGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTG  176
                ++G+ L  + L                      +   A        I +G+     
Sbjct  90   CVAFVIGVVLWALGLVARGALVHGASVLDAGGATTFGDSWRAAWARGWRLIGIGILPAIP  149

Query  177  SMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQ  236
             + + I    +G+                    L +     + L  I   +  +   F +
Sbjct  150  MLILLIVGAGLGMAAFSMRSFSGDMFAGPGFGGLGVTFVAVACLAAIAAFVLGLLQTFAE  209

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEA-----  291
                 +N    ++  +   ++  +       F++ + I L +  LT  +  +        
Sbjct  210  RAAMLENTSVFESYGRGWQVLRDNLGPALVLFLIQVGIGLGIGVLTLILAPLLICLCLII  269

Query  292  --ANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKR  329
                L  +  +T +    + L +        GP+   +  
Sbjct  270  IPVMLLVNGTVTAYFSTLWTLAWRRWTGLAGGPEFVEMPP  309


>MBA4064256.1 hypothetical protein [Isosphaera sp.]
Length=215

 Score = 40.9 bits (92),  Expect = 0.67, Method: Composition-based stats.
 Identities = 9/29 (31%), Positives = 12/29 (41%), Gaps = 3/29 (10%)

Query  1   MPTVRCPHCGAERNTPSSKLPAKKSSARC  29
           M + RCP CG     P +K     +   C
Sbjct  1   MISFRCPKCGNSYRFPDTK---AGAKFVC  26


>WP_145428205.1 hypothetical protein [Symmachiella dynata]QDT50945.1 hypothetical 
protein Pan258_50280 [Symmachiella dynata]
Length=341

 Score = 41.7 bits (94),  Expect = 0.68, Method: Composition-based stats.
 Identities = 30/344 (9%), Positives = 65/344 (19%), Gaps = 30/344 (9%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHC---  58
               RC  CG+    P  +        RCP C +  +           + +  +       
Sbjct  3    IEFRCD-CGSRLRVPDQR---AGQVVRCPACEEQTLVPKLGEDPDAYSVDGKSPRSDDLV  58

Query  59   ----GLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELF  114
                   R     R +   +         S          +          L +  +   
Sbjct  59   SVRMNTSRPPAEKRTKSTYRAARSSEPPTSKAAPSAWRSASREKRRSQRPWLASFKFPFQ  118

Query  115  CRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW-  173
                + LLG  +    +A         L                        + +     
Sbjct  119  DENKFTLLGWGIGFAFVAIMMSIPIPSLVANIARLFVLLIAAGYFFHFLSEVVRVAAGGD  178

Query  174  -----------MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLI  222
                       +      +     +GL       L            +  ++        
Sbjct  179  VELPETSSGEDVLQDAVTWCGAIVLGLSPWWGFHLLRWWFAWETPAEVGEILLLVGAFYS  238

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
               LL    F              L A+ K                V+  ++   +  + 
Sbjct  239  PMALLAATLFNSILAANPL---YVLGAIFKLPRQYLTCCLLSASVIVVYFLLGAVIMSML  295

Query  283  ARIPYVGEA----ANLAFSLLLTPFSFLYYYLIYSDLKANYRGP  322
                 +        +    L ++         +Y   +      
Sbjct  296  GPDSLLMMLFVHTISALLLLYVSIVVMHQLGTVYYKNRTKIGWF  339


>OGO90750.1 hypothetical protein A3F10_06675 [Coxiella sp. RIFCSPHIGHO2_12_FULL_42_15]
Length=327

 Score = 41.3 bits (93),  Expect = 0.69, Method: Composition-based stats.
 Identities = 21/272 (8%), Positives = 67/272 (25%), Gaps = 19/272 (7%)

Query  128  GIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDV  187
                    +          WL        W + +     + +  +  +           V
Sbjct  64   PHWETINVVAMMPNSYTIAWLVAGLFLTIWFLQICNAIMVRVCWNVASIGHAKLGNAFAV  123

Query  188  GLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-----  242
            G        L+ +    ++ ++  +              +  +      Y +        
Sbjct  124  GFKYFYVYLLQLLVIIAVMAVMYGIQYLLSLFHQDWLNAIIAILTQMGIYYILLKLILVN  183

Query  243  -----NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFS  297
                 ++ G +   ++    SGHWW   G  V               + ++        +
Sbjct  184  ASAVIDLCGFRGFARAWNFTSGHWWRTLGSLVFSYSFY---------LMFIMMIGLGIIT  234

Query  298  LLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNL  357
            L+L P     +    + + A+             +  T   F  ++     +++ ++  +
Sbjct  235  LVLMPGLNPVFGSNLAHMMAHMPDSGPMWFGIIMMIFTILTFLAIIFTVPTMMAANQAMI  294

Query  358  SAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPE  389
              +         + +     +   + +     
Sbjct  295  YNDIKYRHNGHAEVQTEYSHETHAEHSGKYNP  326


>NEE54877.1 hypothetical protein [Streptomyces sp. SID8455]
Length=222

 Score = 40.9 bits (92),  Expect = 0.70, Method: Composition-based stats.
 Identities = 29/150 (19%), Positives = 50/150 (33%), Gaps = 4/150 (3%)

Query  124  IYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYIC  183
            ++LL   +  A + + L              W+ A          + L  +T  + + + 
Sbjct  77   LFLLASAVVQAAVPAVLQEAVLGRPARFGSVWRRAWSRVWAMIGTVFLLGLTAVVPMMLL  136

Query  184  KTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDN  243
               V       + L    S   LL   +L       L     +   V       V+  +N
Sbjct  137  MAAVAATTVYFVTLGDADSALPLLWTGLLGTLLLGPLA----VWIWVKLSLAPTVVVFEN  192

Query  244  IGGLQALEKSRLLVSGHWWAIFGRFVLLLV  273
             G   A+ +S  LV G WW +FG  +L + 
Sbjct  193  QGPFAAIRRSAQLVRGSWWRVFGIGLLAVG  222


>KZV19977.1 hypothetical protein F511_27652 [Dorcoceras hygrometricum]
Length=324

 Score = 41.3 bits (93),  Expect = 0.71, Method: Composition-based stats.
 Identities = 20/167 (12%), Positives = 64/167 (38%), Gaps = 2/167 (1%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +  ++ + ++L      +++          +   +  ++     ++  L  +    ++F+
Sbjct  70   IPVLHAVFVLLFTLFASASITYSTFHGFYGRPVKFVSSMKSILFSFFPLLATVSIAAIFL  129

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL--LLIIPGLLFCVWFFFCQYV  238
             +    VG    + +    +  F +    +  +V   S+  LL+   +   V ++    +
Sbjct  130  GLVCFVVGFIAWLAVNGLLLFGFEIDYEGVFFMVIAISIAVLLVCSVICLFVDWYLTIVI  189

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            +  ++  G + L++S  L+ G     F   +L  +    +SF  + +
Sbjct  190  VVVESKWGFEPLKRSSYLMKGMKNVGFFMILLFGLGLGLMSFSCSML  236


>MBI4857296.1 glycerophosphoryl diester phosphodiesterase membrane domain-containing 
protein [Acetobacterium woodii]
Length=605

 Score = 41.7 bits (94),  Expect = 0.72, Method: Composition-based stats.
 Identities = 42/403 (10%), Positives = 98/403 (24%), Gaps = 42/403 (10%)

Query  200  VGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSG  259
            +  F +  IL   ++     ++I+   L  +++ F  +    ++    +A++KSR L+ G
Sbjct  159  IPDFVMSHILASPMLSLIYFVVIVMLFLLVIFWIFSIHYFTLEDNNFFEAIKKSRALLKG  218

Query  260  HWWA-IFGRFVLLLVI---------------SLTLSFLT-------ARIPYVG-------  289
            H+W   F        I                  L+ +          I   G       
Sbjct  219  HFWHTTFWVAFWNTAILLVLMLLLGLLEVVAYFVLTQILEDTLAVSIFIGSFGFFVAAFF  278

Query  290  --------EAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGW  341
                           S+    +S +    I S      +      +  +           
Sbjct  279  TTFQLIETSITFALVSMFYYDYSQMAKIEIPSTALIVNKNNSDEAVSERKKLFIGVSMVS  338

Query  342  MLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKL  401
            + +  +L   +              +  +      P  +   N     +      ADY  
Sbjct  339  IFVIVVLNSYVIFTTFHETFNSHFIEGPKISAQNSPLSSTPENTLAFLQQAIDEGADYAE  398

Query  402  LLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIEID  461
            +   Q K                      +          +       L  +   ++ I 
Sbjct  399  IDVAQSKDGVIVVSKPTIKKTTDQPINVWEMTADELKNSNIPTLSEALLFCQDKIKLSIQ  458

Query  462  KVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVHSILGKL  521
             +                    +          D  S  ++  +    +   V  +    
Sbjct  459  -INPSGNESSLVASTIAVIQQTNSSAWCILASLDYPSLEQAARVDPQIRRAYVTGVALGE  517

Query  522  ELTLPL---AIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVT  561
              T+P+   +IE+  +T   +        +          ++T
Sbjct  518  IQTIPVDALSIEASFITPKKVTAIHGEHKEVFAWTVNSEESIT  560


>HIH51751.1 hypothetical protein [Nanoarchaeota archaeon]
Length=254

 Score = 40.9 bits (92),  Expect = 0.74, Method: Composition-based stats.
 Identities = 23/143 (16%), Positives = 52/143 (36%), Gaps = 4/143 (3%)

Query  166  YILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPG  225
              ++  +              + +   +        +   L   +        +L+++  
Sbjct  111  LSVMFSTGGKYWFRFLGATLSILVIVLVPFLAVLFTANKFLYDSVPAAQLIAIILILLVL  170

Query  226  LLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
            +LF V+F F  Y L   ++    ++++S   V  ++      F++L +     S L  +I
Sbjct  171  MLFFVFFAFAPYFLVMKDMRVFASIKESFAFVKKNYVDALLLFLILTI----TSILINQI  226

Query  286  PYVGEAANLAFSLLLTPFSFLYY  308
            PYVG   NL     +     +Y+
Sbjct  227  PYVGWVINLLLLGPVQALVLVYF  249


>WP_176066946.1 hypothetical protein [Anaeromyxobacter sp. R267]
Length=266

 Score = 40.9 bits (92),  Expect = 0.75, Method: Composition-based stats.
 Identities = 15/137 (11%), Positives = 35/137 (26%), Gaps = 7/137 (5%)

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
              +   +  L  +T +    +          +        S  +   L          + 
Sbjct  63   LALFLDVWLLVALTRAALFVLQGRSPSPNAIVATIRPRWRSILVFSALFAAGTFLLGAIG  122

Query  222  IIPGLL-------FCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
            +    L       +    +    VLA +   G+ A+ ++  LV   W        +    
Sbjct  123  VWSWALSFALSGAWSFSLYLGIPVLAREPATGVGAVRRAWALVRRCWARRLTGLFVTGAC  182

Query  275  SLTLSFLTARIPYVGEA  291
             +    L   + +   A
Sbjct  183  FVLACGLCGALAFALMA  199


>RME82318.1 hypothetical protein D6771_07200 [Zetaproteobacteria bacterium]
Length=439

 Score = 41.7 bits (94),  Expect = 0.75, Method: Composition-based stats.
 Identities = 36/366 (10%), Positives = 83/366 (23%), Gaps = 40/366 (11%)

Query  133  FAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRS  192
               ++   L+  A +                 + I+             +          
Sbjct  64   LGALYFLGLMWTAAFAIASLFPALRGENAPLSSTIMRASRATPLFALYALFGFVFIAACI  123

Query  193  MKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEK  252
              L    +    +   L+ + V  G+L++    +L     +     +  +     +A+++
Sbjct  124  GILAAASLLLAKVHPALMAVGVFVGALVI----MLITARLWMASVNIVAEGASPWRAIKR  179

Query  253  ------SRLLVSGHWWAIFGRFVLLLVISLTLSF--------------------------  280
                             +F + V ++VIS   S                           
Sbjct  180  AWRAFGWWAAARFWGNLLFQQIVFMVVISAIASLTAGSLVATLFGALQQNAAAQGAGLAL  239

Query  281  -LTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIF  339
             + + + ++   A +        F+ L    ++ +       P+  P  +     + A+ 
Sbjct  240  GIVSALGWLLLIAAIVGGF--MGFAMLATAAMFHEECIGAYRPEIEPPSKTSWVASGAVA  297

Query  340  GWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADY  399
                +   L    S    SA  L SA      +    P+         P           
Sbjct  298  LGFGLLFALPGVQSPHAPSAPALPSAPSPAPSQAVRTPKPENRRA-PAPSGDAAAHRRAA  356

Query  400  KLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGSARIE  459
                                   F       D+      K         +  + G  R  
Sbjct  357  WQAWLAGDDIRVIREADKALSASFDSAIRPGDEAQKRKAKAWTYAIRAWAYHRMGDERNA  416

Query  460  IDKVLD  465
               +  
Sbjct  417  RASLTS  422


>MBI5180183.1 zinc-ribbon domain-containing protein [Nitrospirae bacterium]
Length=188

 Score = 40.5 bits (91),  Expect = 0.75, Method: Composition-based stats.
 Identities = 15/181 (8%), Positives = 40/181 (22%), Gaps = 2/181 (1%)

Query  4    VRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNI--ATCPHCGLQ  61
            ++C  CGA+      ++       +C  C           +  ++      +      + 
Sbjct  3    IQCARCGAQYRLDDRRITHDFIRVKCTNCGNVFAAQKMVERIIESPMERGKSENIREQVI  62

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
             R        ++             +       A+      I  +  D    F       
Sbjct  63   ARRREQPAPARAAAGPGMADYIPGMIVMFFLTLAAIYFGNFIKSVSPDLVNRFHLNYVLF  122

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
            L +  +          S +          +       +     A   +G++ +       
Sbjct  123  LLVIGIAWRNTIGIPDSLMYGIGLGRPLLKIGIIIMGLRFGFGALADIGITGLAIIAIFV  182

Query  182  I  182
             
Sbjct  183  F  183


>WP_141556510.1 DUF975 family protein, partial [Bacillus pseudomycoides]PHE43515.1 
hypothetical protein COF52_31440, partial [Bacillus pseudomycoides]
Length=97

 Score = 38.6 bits (86),  Expect = 0.78, Method: Composition-based stats.
 Identities = 10/80 (13%), Positives = 30/80 (38%), Gaps = 1/80 (1%)

Query  212  LVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKSRLLVSGHWWAIFGRFVL  270
            +       + I+  ++    +    YV+ ++      QA+++S+ L+ GH   +F  ++ 
Sbjct  11   IAFFVLLAISIVGMIVMYFSYALTYYVMIENPEYSVSQAMKESKNLMKGHKLDLFLLWLS  70

Query  271  LLVISLTLSFLTARIPYVGE  290
             +  ++              
Sbjct  71   FIGWAILALLTFGIGFLWLS  90


>WP_146323698.1 hypothetical protein [Corynebacterium canis]TWT26880.1 hypothetical 
protein FRX94_03290 [Corynebacterium canis]
Length=332

 Score = 41.3 bits (93),  Expect = 0.78, Method: Composition-based stats.
 Identities = 17/153 (11%), Positives = 42/153 (27%), Gaps = 3/153 (2%)

Query  164  VAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLII  223
            V  +    + +  +  + I    +     +   L    +    L +  + V     L   
Sbjct  173  VFTLPHFGTALLLAFTMSIVDRAIFYLFMLGQILWLEVTTAASLTVFHIPVTMYYALPGF  232

Query  224  PGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF--L  281
                       C +    +      A+     +   H+  +F       +I+       +
Sbjct  233  LTTFLMPLITMCVFPA-FEGHKFRSAMSLGFKIGVKHYPILFIVGFSTWLITYVAGLVHV  291

Query  282  TARIPYVGEAANLAFSLLLTPFSFLYYYLIYSD  314
             A   +        F LL+ P   +++  +Y  
Sbjct  292  WAGFGFARVGIGQVFGLLMMPMVVVFFAHLYRQ  324


>OLL89795.1 putative integral membrane protein [Pseudonocardia sp. Ae331_Ps2]OLM09740.1 
putative integral membrane protein [Pseudonocardia 
sp. Ae706_Ps2]
Length=334

 Score = 41.3 bits (93),  Expect = 0.80, Method: Composition-based stats.
 Identities = 32/229 (14%), Positives = 59/229 (26%), Gaps = 22/229 (10%)

Query  123  GIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYI  182
             ++++        + +             +Q +  A       Y +  +      +   I
Sbjct  111  LLFVVAPPAHAVVLSTLRPAVLGRDRLTLSQAFTAARPHLRALYGVWAVVAALSLIPTLI  170

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD  242
              T              VG+ T+  ++ IL V    LL     +   +           +
Sbjct  171  ETTAAIPSWISPDIPDTVGTDTITAVVAILTVASALLL-----IYVSILIALAPAAAVIE  225

Query  243  NIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTA-------------RIPYVG  289
             +    AL +S  LV   WW  F     + VI L  + L                 P++ 
Sbjct  226  GLTAGAALRRSLELVHRSWWRCFAVLAAISVIILIPTTLITVPAIIIGIGIGMIAPPWIA  285

Query  290  EAANLAFSLLLTPFSFLYY----YLIYSDLKANYRGPQHPPIKRQWLPL  334
                      +   S  Y      L+Y D +          I       
Sbjct  286  ILTAATPLAGVFAISCSYSIGAAALLYHDQRIRRENYATDLIHDARQKP  334


>WP_157949979.1 hypothetical protein [Vallitalea okinawensis]
Length=304

 Score = 41.3 bits (93),  Expect = 0.82, Method: Composition-based stats.
 Identities = 27/284 (10%), Positives = 69/284 (24%), Gaps = 21/284 (7%)

Query  77   NCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPI  136
                    +         ++   +  +   L                     +  +    
Sbjct  14   MISTTFEKWTKNYSWILSSAAMFVVQLVMSLIIMVPAVIIVISMFAFGADAFMETSMTTP  73

Query  137  FSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLG  196
                +                AI+      + + L        +   +T    F    +G
Sbjct  74   IDNFVPPSIPTSRLMLIIIILAIITVFALVMGVYLRMGYARASLNFVRTGKYQFEDFFMG  133

Query  197  LRHVGSFTLLLILLILVVGGGSLLLIIPG--------------------LLFCVWFFFCQ  236
             +          L  L+V   ++  II                      ++  + +    
Sbjct  134  YKKFWLGFKASFLAGLIVFITAIPAIICMGLSYVHVGFLFLALPLLVIPIIVELRYAQIL  193

Query  237  YVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIP-YVGEAANLA  295
            Y++ D+     + ++KS  ++ G     FG   L+ +++L   F+      +        
Sbjct  194  YLVQDETTSATEIVKKSSEMMKGLKGKYFGLIFLISIVTLLPLFIITLFFAFASPIMGNL  253

Query  296  FSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIF  339
              L+           I S     Y+   +    +    +  A +
Sbjct  254  VQLITNFIGIFISSFIISIQAYFYQEIINNDQDKINHDIMDANY  297


>WP_016148070.1 DUF975 family protein [Butyricicoccus pullicaecorum]EOQ37245.1 
hypothetical protein HMPREF1526_01936 [Butyricicoccus pullicaecorum 
1.2]SKA58753.1 Protein of unknown function [Butyricicoccus 
pullicaecorum DSM 23266]
Length=274

 Score = 40.9 bits (92),  Expect = 0.82, Method: Composition-based stats.
 Identities = 24/138 (17%), Positives = 42/138 (30%), Gaps = 5/138 (4%)

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLIL--VVGGGSLLLIIPGL  226
            LGL      +   I            + LR          L  L   + G  L+++IP L
Sbjct  116  LGLCIFVHIVLWMIIPIAATAGYCYFMMLRDPAILENAEKLYQLILQMSGLYLVIMIPFL  175

Query  227  LFCVWFFFCQYV-LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTAR-  284
               + +F    + + D ++G  QA  ++     GH   +    V      L   +     
Sbjct  176  ARILRYFIAYPMIMRDRSLGAWQATRQAARAFKGHNLHLIVLVVSFFGWQLVGMWTMGIG  235

Query  285  -IPYVGEAANLAFSLLLT  301
             + Y G   +        
Sbjct  236  TLFYWGYLLSALLMYFWY  253


>HHS15334.1 hypothetical protein [Phycisphaerae bacterium]
Length=368

 Score = 41.3 bits (93),  Expect = 0.86, Method: Composition-based stats.
 Identities = 41/395 (10%), Positives = 76/395 (19%), Gaps = 30/395 (8%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
             T  C  CG +   P +         RC +C        A+       + IA        
Sbjct  3    ITAECTTCGRKYQAPDA---MAGKRVRCKQCGNIFQVPQADLDSGPDLNAIAELAELEKS  59

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                   +      +       +F            +G  ++    A + EL       L
Sbjct  60   FHPIDASISAGDSGLARAAPGETFTDAESPPIAPLRAGRTNVRFKFAYARELDYWTPIVL  119

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
                LL +           +L                   A +      L        I 
Sbjct  120  TVGSLLCLGYQSLTRSETPVLWIKLTRL------------AILILAYSLLIAPLSLAMIR  167

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                            R          L   V+G    L+    L+  V        ++ 
Sbjct  168  AAGRRYRFQMPTADRWRAF-----ATYLPAWVIGVVMWLVGDGQLIALVLGCLAGVAIS-  221

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPY-VGEAANLAFSLLL  300
                   AL     L     + I    +   +       +   + + +   A        
Sbjct  222  -----TAALWLLFRL---QPYEIAPTALFAGIGFFLGLGIAGIVMWGLNTLALNIVVATK  273

Query  301  TPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSAE  360
             P       +       +    +     RQ             +P               
Sbjct  274  RPDVVPASPIAQGMAWISQEQQKQLLAARQPRTFRPTPANGKSLPPAAEPQTPPTPPPPT  333

Query  361  QLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLS  395
            +   +   + +     P          P       
Sbjct  334  ESPISSSPLVKLSRAAPIPPAFDEVIRPLTDSPYI  368


>EKD49365.1 hypothetical protein ACD_63C00169G0001, partial [uncultured bacterium]
Length=593

 Score = 41.7 bits (94),  Expect = 0.86, Method: Composition-based stats.
 Identities = 25/279 (9%), Positives = 63/279 (23%), Gaps = 21/279 (8%)

Query  102  SISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILL  161
              +Q     W            + L+ ++  F                         +  
Sbjct  68   MWNQEELIGWVFLQFAVALGKLLALISMMFEFVIKDDMFSNFLGNAAVDSGWKITVGVAN  127

Query  162  ATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLL  221
                 +LL + + T           + +     + L +       ++L    +       
Sbjct  128  MAFVLVLLAIGFATILRVPSYHIKKLIVPFIAAVLLINFSKLISGVVLDFCNII------  181

Query  222  IIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF-  280
                    V F                   +   LV G      G  +    I + + F 
Sbjct  182  -------MVTFTDHMKGGNIGGYFAKSMGVQDWFLVKGCPQDGCGTGIFFQTIFVVIVFE  234

Query  281  LTARIPYVGEAANLAFSL-------LLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLP  333
            +   I  +  +              +L+PF+++   L  +                    
Sbjct  235  IALMIALLFLSVFFVIRFITLLILIILSPFAYVAMILPSTKDFTGKWWKMFLQNAFYGPV  294

Query  334  LTAAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQR  372
                ++  + I   ++  +  +  +A   +       Q 
Sbjct  295  AVFMVYLAVDIVLQMVSDMPEEAENAAGTIFTNGINMQT  333


>NYZ79902.1 hypothetical protein [Candidatus Micrarchaeota archaeon]
Length=244

 Score = 40.9 bits (92),  Expect = 0.87, Method: Composition-based stats.
 Identities = 33/220 (15%), Positives = 77/220 (35%), Gaps = 12/220 (5%)

Query  110  SWELFCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL  169
             + +F      LL ++   +    A +    L +  T +          I L ++++ + 
Sbjct  21   YFSVFILFSLILLPLFSSYVNGGAAFLRFTSLYRDITLVTALVFVLVGLISLLSLSFFIS  80

Query  170  GLSWMTGSMFIYICKTDVGLFRSM-KLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL--  226
             +  +                 +  +  ++    F L+  L I V    S++     +  
Sbjct  81   AIISVVKLRETLDHVKFTKAVSTFTRYVVKVFMFFVLMSALSIAVGVLLSIVGAPVAITQ  140

Query  227  ----LFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTL----  278
                +  + F F   VL  +++    A+  +   V     A+   F+L  V +L L    
Sbjct  141  LALFVVWIPFVFTPQVLIIEDLDIADAMRDALKFVKKAPQALVLYFLLGFVFTLALVVIE  200

Query  279  SFLTARIPYVGEAANLA-FSLLLTPFSFLYYYLIYSDLKA  317
            ++L     +  + A++   SL + P+  ++   +Y    A
Sbjct  201  TYLGQYFIWEHKIASIVLLSLFVVPYLQMFATELYIRRYA  240


>RMD59350.1 hypothetical protein D6821_01435, partial [Candidatus Parcubacteria 
bacterium]
Length=558

 Score = 41.7 bits (94),  Expect = 0.87, Method: Composition-based stats.
 Identities = 58/484 (12%), Positives = 123/484 (25%), Gaps = 33/484 (7%)

Query  114  FCRRGWGLLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSW  173
                 + LL + +L +   F     +   K    L+     W   I         LG++ 
Sbjct  1    MDFNFYSLLTLAVLVVNFLFLRFVLSQARKTKFLLSYILAVWSLVIWQMVELVSSLGVAV  60

Query  174  MTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFF  233
                    +  + V       L    +     L   + ++VG   L ++           
Sbjct  61   GWLPFLWRVADSAVTFLMVFLLFFVLMMIAPKLSKYVYMLVGLSGLAMLALIWGNAYIIG  120

Query  234  FCQYVL----------------ADDNIGGLQALEKSRLLVSGHWWAIF--GRFVLLLVIS  275
               Y                      +G +           G   A+   GR++ + +  
Sbjct  121  VQPYFYGGWRPIYGPLGHWQIIWIVAMGIILVFSLMWWFWRGRQSALTAGGRYLAIGL--  178

Query  276  LTLSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLT  335
                 L   I  + +     F + + P S L   L      A         I+ +   + 
Sbjct  179  ----SLAVSIAIIFDLILPLFGIEIIPISGLASTLAVGSFLAEVYKFHFLDIRLRRFLIA  234

Query  336  AAIFGWMLIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLS  395
              +  + ++   +  S        E +    + I  +L +      +  R   E   +  
Sbjct  235  QKLLLFFVLIVAISASSVAYFAFKEGVNIIKEHILAQLESVAFLKENQLRGYLEREIKEM  294

Query  396  SADYKLLLSKQRKTTSEGGLSLGPVTLFADRFWADDQNPHLWLKLELSDFPNLSLAQKGS  455
             +D           T+             ++       P    +           A +  
Sbjct  295  QSDIVRADYIVDYLTNFHPEEQKEQEESKEQGHWHKNPPPEERRQSDQYEEEEEEAAEEE  354

Query  456  ARIEIDKVLDDDARDLYDRQHSFEHPAFHWVGINQTDENDLFSGIRSIYLRQGTQAEQVH  515
             R  +   L        + +  F       V I+   + +        Y + G +     
Sbjct  355  IRGVLQNEL-----LYKNLEEVFIITPDGKVDISTDPKQEGKFKNNEWYFQAGQKRAVSQ  409

Query  516  SILGKLELTLPLAIESLQLTRNDIGKTLQIGGKQLILQRLGSNAVTLRFLGD-RTDLLNV  574
             +   + +  P  + +  L  +         G    L     N + L  LG  ++    +
Sbjct  410  GLFYDISIRRPSIVIAAPLKDSAGRTVGVYAG---RLDPRKVNKLMLEQLGLGKSGETYL  466

Query  575  HASN  578
             A N
Sbjct  467  VARN  470


>WP_153043259.1 DUF975 family protein, partial [Bacillus cereus]
Length=108

 Score = 38.6 bits (86),  Expect = 0.89, Method: Composition-based stats.
 Identities = 16/83 (19%), Positives = 40/83 (48%), Gaps = 1/83 (1%)

Query  195  LGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADD-NIGGLQALEKS  253
               +++     L ++  + +   SLLLI+PG++    +    Y+L ++ +    +AL +S
Sbjct  26   FKKKNLFKSIKLGLMQAIFLFLWSLLLIVPGIIKYFSYSMSYYILVENPDYTASEALRES  85

Query  254  RLLVSGHWWAIFGRFVLLLVISL  276
            + ++ G    +F  ++  +   L
Sbjct  86   KRIMKGQKLKLFVLWLSFIGWFL  108


>HBC03416.1 hypothetical protein [Aequorivita sp.]
Length=238

 Score = 40.5 bits (91),  Expect = 0.89, Method: Composition-based stats.
 Identities = 22/235 (9%), Positives = 59/235 (25%), Gaps = 11/235 (5%)

Query  67   DRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYL  126
             + +                     +          IS                + G   
Sbjct  1    MQHKHIPLKQQRDVGEMITTYFEFFKQNFKPFLNIFISYNGLFILGFLGVSYLMVTGFIG  60

Query  127  LGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTD  186
                     I S                +   I+ A + Y L  +  M            
Sbjct  61   AYNASNNYGIESDNSGYFMMLGLGFMGFFILYIITAVLNYSLAAVYVMQYEKNRGAIVEK  120

Query  187  VGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFC-----------  235
              ++ ++K  L  +  F +L++   ++     ++L    ++    ++             
Sbjct  121  KKVWETVKQNLGKIILFIILMVFGYVIAMIAGIILGFIPIIGTFAYYLIVLAYTSWVGLS  180

Query  236  QYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGE  290
               + +DN     A  +   L+   +W      +++ ++   L  +   +P +  
Sbjct  181  FMAMINDNKDVTDAFGEGWKLMKSFFWKSVLSNLVIGMLLGILMMVVMMVPGILI  235


>NTU71802.1 hypothetical protein [Coriobacteriia bacterium]
Length=346

 Score = 41.3 bits (93),  Expect = 0.90, Method: Composition-based stats.
 Identities = 18/135 (13%), Positives = 40/135 (30%), Gaps = 3/135 (2%)

Query  160  LLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL  219
                     L       S+   +    V LF +     R         I   L++     
Sbjct  136  FRVWWRVAALYALGALPSLVSMLVMGLVVLFTTTIPLSRGELPSPAATIAGQLLMVPLQP  195

Query  220  LLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLS  279
            L  +  +L  +           +++G   AL +   LV  +  ++   ++L  +I   ++
Sbjct  196  LTAVASVLLGLLVNIAVRSAVLEDLGWRAALRRGFELVRRNLESVAVIYLLSTLI---VA  252

Query  280  FLTARIPYVGEAANL  294
             +      V    + 
Sbjct  253  SVLVAAAIVFGIVSS  267


>WP_145372228.1 hypothetical protein [Maioricimonas rarisocia]QDU40997.1 hypothetical 
protein Mal4_53620 [Maioricimonas rarisocia]
Length=466

 Score = 41.3 bits (93),  Expect = 0.90, Method: Composition-based stats.
 Identities = 39/385 (10%), Positives = 80/385 (21%), Gaps = 44/385 (11%)

Query  1    MPTV-RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCP---  56
            M  + RCPHC +               ARC +C +T   D A  +        +  P   
Sbjct  1    MSLIARCPHCDSRFRVREEL---AGKRARCRKCRETFTIDEASEEVVVLQPAESGEPAVS  57

Query  57   -HCGLQRRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFC  115
                       D                +       E     S + + +  +        
Sbjct  58   EDQHDDAPRLEDYFPADEDPDEVMATEAALPDIDADEDFQLKSPVDAGAATITSEPAAQA  117

Query  116  RRGWGLLGI---------YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAY  166
                              +         P                            VA 
Sbjct  118  APQMDAPFPVPPVAPSQEFAQPAPHQAMPSPHPGPAYGYPPAEGAFPTPGAVTPPPPVAA  177

Query  167  ILLGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGL  226
                         +    T   +       L       +    + + +  G  + ++ G+
Sbjct  178  PPGAFPAAAPVATLAGPGTAADVALHAPKLLHAAQWGLVWAAAMGVALLLGQYVHVLVGI  237

Query  227  LFCVWFFFCQYVLADDNIGGLQALEK-----------SRLLVSGHWWAIFGR--------  267
               V+  +  YV     +  L A +                V      +FG         
Sbjct  238  GLAVFATWTCYVGMISCVSYLAARQCETGVCPPQGQAWTFFVRNAVGLVFGTAALGAAVA  297

Query  268  ---FVLLLVISLT-----LSFLTARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANY  319
                ++   I++      L  +   +  +     +  +  +T   +L   +I  D +   
Sbjct  298  LAFAIVFGGIAMISQVETLGPIVGGLLVIPTFLLILIAASVTLNMYLLPVVIGVDDRGLL  357

Query  320  RGPQHPPIKRQWLPLTAAIFGWMLI  344
               +                    I
Sbjct  358  AAFRMICQLAVRQGFAMMYRYLQAI  382


>PJA37463.1 hypothetical protein CO181_03395, partial [candidate division 
WWE3 bacterium CG_4_9_14_3_um_filter_43_9]
Length=265

 Score = 40.9 bits (92),  Expect = 0.91, Method: Composition-based stats.
 Identities = 10/75 (13%), Positives = 29/75 (39%), Gaps = 2/75 (3%)

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF--LTARIPYVGEAANLAFSL  298
             + +   +AL +S  LV  H+       ++ + I   +    +   IP++     +    
Sbjct  129  IEGLTPRKALFESFALVKKHFLEQIALGLINVGIGCGVGCLTVLVVIPFIPVVLLIYILF  188

Query  299  LLTPFSFLYYYLIYS  313
             ++P   + + ++  
Sbjct  189  KVSPLIGVPFGVLVF  203


>EWC43568.1 hypothetical protein DRE_01455 [Drechslerella stenobrocha 248]
Length=426

 Score = 41.3 bits (93),  Expect = 0.91, Method: Composition-based stats.
 Identities = 26/273 (10%), Positives = 60/273 (22%), Gaps = 14/273 (5%)

Query  185  TDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNI  244
              + L   + L +  +        +    + GG+    I  +L          +     I
Sbjct  119  ELILLGMVLFLLVPQILKLAGSSRVYKAALFGGAAFFAIMCILAVARVIVYGVLSYQTYI  178

Query  245  GGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI-----PYVGEAANLAFSLL  299
              + A    R LV G+        +   V  +  +F            VG    L     
Sbjct  179  S-ISAGNSYRRLVKGYTVMYLLSNIFAAVFLVIAAFKVKMDRERRKGIVGWIPALIV--C  235

Query  300  LTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQNLSA  359
            L  +S     +++    A  R             +T   +    +   ++   +    + 
Sbjct  236  LIGYSIDNMVVVFITYGARRRSDLDKNGLLAAYCITLIFYFGAFLVLFIIARATGWEYAD  295

Query  360  EQLLSAGKDIQQRLGTQPQQTPDLN------RSLPEEPQRLSSADYKLLLSKQRKTTSEG  413
            +    A +        +                  +  +                     
Sbjct  296  QGHPPATQYAHNPADPRNHPVGGQPGAPYGGMQQMDPTKMTGYPQSSTPAPPYAAAQGMP  355

Query  414  GLSLGPVTLFADRFWADDQNPHLWLKLELSDFP  446
                G           +  N  ++    +   P
Sbjct  356  YSGNGMYEPQGQVHHTNTWNSGVYAPTPVHQIP  388


>CUB59715.1 hypothetical protein BN2127_JRS10_05231 [Bacillus subtilis]
Length=154

 Score = 39.8 bits (89),  Expect = 0.93, Method: Composition-based stats.
 Identities = 15/122 (12%), Positives = 42/122 (34%), Gaps = 0/122 (0%)

Query  169  LGLSWMTGSMFIYICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLF  228
            +  +       +    T V +  ++  GL            +I+     ++L I+  +L+
Sbjct  14   MMKTMWAMLAILLYTGTWVPMLGTIVFGLLGFLDENANPSFIIVFFILLAILFIVMAILY  73

Query  229  CVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYV  288
              +      ++        QA+++S+ L+ GH   +F  ++  +  ++            
Sbjct  74   FSYSMTYYVMVEKPEYSVSQAMKESKHLMKGHKLDLFLLWLSFIGWAILAILTLGIGFLW  133

Query  289  GE  290
              
Sbjct  134  LS  135


>HCK71050.1 hypothetical protein [Planctomycetaceae bacterium]
Length=433

 Score = 41.3 bits (93),  Expect = 0.94, Method: Composition-based stats.
 Identities = 33/309 (11%), Positives = 70/309 (23%), Gaps = 27/309 (9%)

Query  2    PTVRCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQ  61
               +C +C +    P      +     C +      +      R     ++A        
Sbjct  132  IEFQCDNCNSIVAVPQEFAGKQGKCPYCDQVMVIPEYSSVRRFRDDPLSSLAAGTMPNAA  191

Query  62   RRIPSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGL  121
                 D L + +          S         +   S +    Q   D    +      L
Sbjct  192  GPGSYDPLGLPTSNTIPTTAGGSSHSHRSTPKQFDESMVLPWEQEEYDGKRFWDSSKVLL  251

Query  122  LGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIY  181
                     + F    +  +                 I         +            
Sbjct  252  FSPSRAFGSMKFDESTTKAIGYSTKG---------HLIGWLLTVLTAIPWIAFLLMAAEQ  302

Query  182  ICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLAD  241
                             +   +T + I     + G  LL         + F F       
Sbjct  303  ASDEQ----------FDYPKIYTWIGIGAGAWIVGSQLL--------SLAFVFIFTTTFH  344

Query  242  DNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAANLAFSLLLT  301
              +    +  +   +       I G   L  V+SL ++ +++ I     A + A ++  T
Sbjct  345  AGLLITGSSPREIDVTFRITAYISGVMGLTTVLSLPIAMISSIIWIPALAYHGAVAVHKT  404

Query  302  PFSFLYYYL  310
            P    +  +
Sbjct  405  PPPQAFLAV  413


>WP_012633310.1 zinc-ribbon domain-containing protein [Anaeromyxobacter dehalogenans]ACL65458.1 
MJ0042 family finger-like protein [Anaeromyxobacter 
dehalogenans 2CP-1]
Length=530

 Score = 41.3 bits (93),  Expect = 0.94, Method: Composition-based stats.
 Identities = 7/34 (21%), Positives = 12/34 (35%), Gaps = 0/34 (0%)

Query  3   TVRCPHCGAERNTPSSKLPAKKSSARCPECCQTL  36
            +RC  C        + L  + S+ +C  C    
Sbjct  2   LIRCERCSTLYELDEALLAPEGSAVQCTRCQHVF  35


>PIR48170.1 hypothetical protein COU80_05915 [Candidatus Peregrinibacteria 
bacterium CG10_big_fil_rev_8_21_14_0_10_55_24]
Length=315

 Score = 40.9 bits (92),  Expect = 0.96, Method: Composition-based stats.
 Identities = 22/195 (11%), Positives = 63/195 (32%), Gaps = 17/195 (9%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +  +  L     +  + +    +    L  ++      + L ++   L   +W+   +  
Sbjct  120  IFTLIFLLWSGGYYLLLATTPERDILKLLQKSGRLLIPLFLLSLRIFLWSFAWLALLLIN  179

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLA  240
             +            + L          ++L++      +L ++    + +       +  
Sbjct  180  AVG--------MWIVRLPLGMLTKSAPVILLIPSFLALILSVLLMAPYAIA-----PLRL  226

Query  241  DDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSF----LTARIPYVGEAANLAF  296
             +  G +Q++ +S   V G    +   F  LLV+   +S     +   +P +G       
Sbjct  227  LEGHGVVQSMRESAQSVRGRQLHVGAMFCALLVLLWVVSLLTQTVAQILPLIGALLVGVV  286

Query  297  SLLLTPFSFLYYYLI  311
              L   F+  +  ++
Sbjct  287  GQLELAFAACFLVVL  301


>WP_156137542.1 hypothetical protein [Methyloceanibacter caenitepidi]
Length=308

 Score = 40.9 bits (92),  Expect = 0.96, Method: Composition-based stats.
 Identities = 36/231 (16%), Positives = 62/231 (27%), Gaps = 17/231 (7%)

Query  77   NCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGIYLLGIVLAFAPI  136
              R                                 LF    +    +  +   LA    
Sbjct  57   WWRMLLYEATGILVSMGMLVFCLHVCGGLDAGRLRALFRPHPYWSFAVVWIVWWLAELSG  116

Query  137  FSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFIYICKTDVGLFRSMKLG  196
             S +    + W          A  L    +I  G        F  I    VG   +  + 
Sbjct  117  VSVVAALFSAWDLEHTYTMSLAPWLIVFDFIGWGSLSPGTYAFPAIVFGAVGWSPAEIMN  176

Query  197  LRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLL  256
              H G      +L+             P  +  V+  F   ++ D N+G ++A+ +S  +
Sbjct  177  STHWGFVCFAAMLI-------------PYFVTRVFLAFPGLLVIDRNLGPIEAMRESIRM  223

Query  257  VSGHWWAIFGRFVLLLVISLTLSFLTAR---IPYVG-EAANLAFSLLLTPF  303
              G    + G  V   V+ + +  L      +P  G      A +L    F
Sbjct  224  TKGSRGRLLGMVVFWTVVGVVVRMLGFYWDEVPLEGRVVIFAALTLKWICF  274


>WP_107725008.1 hypothetical protein [Desmospora activa]PTM58174.1 hypothetical 
protein C8J48_0752 [Desmospora activa DSM 45169]
Length=359

 Score = 40.9 bits (92),  Expect = 0.96, Method: Composition-based stats.
 Identities = 33/283 (12%), Positives = 77/283 (27%), Gaps = 5/283 (2%)

Query  121  LLGIYLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILLGLSWMTGSMFI  180
            +L + L+G ++             A      N+     I      +      +    + +
Sbjct  76   ILIVLLIGTLIFVILSTFIYSFCYAGSYAMVNEIVLDGIASIRTYFGSGFRYFGRMFLHL  135

Query  181  YICKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSLLLIIPGLLFCVWF--FFCQYV  238
            ++    +  F    +    + +  L        +  G   L+    LF V         +
Sbjct  136  FLLGLVLFPFSIPAIVFEVLANIALFNGNETGRLLWGIAALVGWIFLFIVNLGAIHGPVI  195

Query  239  LADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARIPYVGEAA--NLAF  296
            L  +N G  ++L+ S  L    +  +F   + + + +     + +    V      N   
Sbjct  196  LTAENKGAWESLKLSYRLTFKSFGKVFLTVLCI-IAATITYMIFSIPMTVTSMFAENNIG  254

Query  297  SLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWMLIPGLLLVSLSRQN  356
             LLLT      +YL     +   +       K+                     S    +
Sbjct  255  LLLLTILLMGLFYLFLPFYQMAVQLIISLRYKQHLRKFVVPEEELQTDGSPYGGSFDSPS  314

Query  357  LSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADY  399
                 +  +  +       + +Q+    +  P+        D 
Sbjct  315  SPQSDVPQSATESAAGAAAEYEQSDTNEKEAPKNKYPQFPTDP  357


>OZA74597.1 hypothetical protein B7X77_08285, partial [Caulobacter sp. 39-67-4]
Length=175

 Score = 39.8 bits (89),  Expect = 0.97, Method: Composition-based stats.
 Identities = 18/71 (25%), Positives = 28/71 (39%), Gaps = 3/71 (4%)

Query  215  GGGSLLLIIPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVI  274
                + +    + F +   F   +  D    G  A+  S  L  GH  AI G +VL  V+
Sbjct  30   LISFVGVACLMIYFTIRLSFASTMTFD---TGRIAVRASWALTKGHVGAILGAYVLATVM  86

Query  275  SLTLSFLTARI  285
            +L +  L   I
Sbjct  87   ALIVYLLIMTI  97


>OQD88128.1 hypothetical protein PENANT_c004G04864 [Penicillium antarcticum]
Length=732

 Score = 41.3 bits (93),  Expect = 0.97, Method: Composition-based stats.
 Identities = 20/203 (10%), Positives = 48/203 (24%), Gaps = 0/203 (0%)

Query  223  IPGLLFCVWFFFCQYVLADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLT  282
                +  +      ++        ++AL  S+ L S            +  + L    + 
Sbjct  306  YGFAIPGLLVSTMLFIHLPGKYIFVRALRGSKHLTSNGIVHWSAWIGCVTGVGLISYIIA  365

Query  283  ARIPYVGEAANLAFSLLLTPFSFLYYYLIYSDLKANYRGPQHPPIKRQWLPLTAAIFGWM  342
            + IP      +L  +LL T   F     ++          +  P     +  +  +    
Sbjct  366  SAIPVFDSLVSLIGALLGTFMCFQPMGCMWLYDNWGKGKTERSPRWTAMVCWSGFVIIIG  425

Query  343  LIPGLLLVSLSRQNLSAEQLLSAGKDIQQRLGTQPQQTPDLNRSLPEEPQRLSSADYKLL  402
                +     S         L      +              R  P   +     ++   
Sbjct  426  TFMMVAGTYGSVALTPTCFFLRCFITTEHIRLPYNDNMAANARYEPAPQRDSFEDNHFSH  485

Query  403  LSKQRKTTSEGGLSLGPVTLFAD  425
                 + T++         +  D
Sbjct  486  APPSYQATADPEPRSEGDNVPDD  508


>WP_198181099.1 MULTISPECIES: zinc ribbon domain-containing protein [unclassified 
Lactobacillus]MBI0121988.1 zinc ribbon domain-containing 
protein [Lactobacillus sp. M0398]MBI0123864.1 zinc ribbon 
domain-containing protein [Lactobacillus sp. W8174]MBI0136032.1 
zinc ribbon domain-containing protein [Lactobacillus sp. 
W8173]
Length=283

 Score = 40.9 bits (92),  Expect = 0.98, Method: Composition-based stats.
 Identities = 32/286 (11%), Positives = 74/286 (26%), Gaps = 30/286 (10%)

Query  5    RCPHCGAERNTPSSKLPAKKSSARCPECCQTLIFDPAESQRTQTTDNIATCPHCGLQRRI  64
            +CP CGA      +   +     R              S +           +    +  
Sbjct  3    KCPQCGAIMGKDVNFCTSCGYDLR------------NVSVQKVEEAPTNLVENTAQVKSK  50

Query  65   PSDRLEIQSKTVNCRRCNRSFCLQPEREFRASGSGLRSISQLLADSWELFCRRGWGLLGI  124
             ++     +              Q   +   +                +     + +LG+
Sbjct  51   QTESQPAAASESINYGDQFQNYWQWCVDSWRNPGTSSPNVASWYGWLTILIEDAFVVLGL  110

Query  125  YLLGIVLAFAPIFSALLLKPATWLNPQNQNWQWAILLATVAYILL--GLSWMTGSMFIYI  182
            Y     +    I  A  +                +L   V  ++    +     + + Y+
Sbjct  111  YYCANTIVSTLINWANRMGANINGWENFHVPFNVMLEIFVVLVIFEAIVIGGFYAGYRYV  170

Query  183  CKTDVGLFRSMKLGLRHVGSFTLLLILLILVVGGGSL---LLIIPGLLFCVWFFFCQYVL  239
               ++  F  +  G        L+     L++  G      +I   ++    FF  Q+++
Sbjct  171  YDRNLSFFEFINRGAHACNFNLLISAAFFLLMLLGLNSIKFVIAVFIIMLALFFASQHII  230

Query  240  ADDNIGGLQALEKSRLLVSGHWWAIFGRFVLLLVISLTLSFLTARI  285
               + G +                I G  +   +I + L  L + I
Sbjct  231  LFGDQGAV-------------HDKILGFLIAFAIIFVCLMILDSII  263



Lambda      K        H
   0.318   0.0739    0.150 

Gapped
Lambda      K        H
   0.267   0.0226    0.140 

Effective search space used: 1353141561552


  Database: nr30
    Posted date:  Jan 28, 2021  9:08 AM
  Number of letters in database: 7,716,583,296
  Number of sequences in database:  33,704,358



Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40
